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FULL-LENGTH HUMAN CDNAS ENCODING POTENTIALLY SECRETED 
PROTEINS 

CROSS REFERENCE TO RELATED APPLICATIONS 

5 The present application is related to and claims priority from US provisional Application 

Nos. 60/197,873, filed April 18, 2000, 60/224,009, filed August 7, 2000, 60/260,328, filed January 
8, 2001, and 60/224,006, filed August 4, 2000, the entire disclosures of each of which is herein 
incorporated by reference. 

10 FIELD OF THE INVENTION 

The present invention is directed to GENSET polypeptides, fragments thereof, and the 
regulatory regions located in the 5'- and 3'-ends of the genes encoding the polypeptides. The 
invention also concerns polypeptides encoded by GENSET genes and fragments thereof. The 
present invention also relates to recombinant vectors including the polynucleotides of the present 

1 5 invention, particularly recombinant vectors comprising a GENSET gene regulatory region or a 
sequence encoding a GENSET polypeptide, and to host cells containing the polynucleotides of the 
invention, as well as to methods of making such vectors and host cells. The present invention 
further relates to the use of these recombinant vectors and host cells in the production of the 
polypeptides of the invention. The invention further relates to antibodies that specifically bind to 

20 the polypeptides of the invention and to methods for producing such antibodies and fragments 
thereof The invention also provides methods of detecting die presence of the polynucleotides and 
polypeptides of the present invention in a sample, methods of diagnosis and screening of abnormal 
GENSET polypeptide expression and/or biological activity, methods of screening compounds for 
their ability to modulate the activity or expression of the GENSET polypeptides, and uses of such 

25 compounds. 

BACKGROUND OF THE INVENTION 

The estimated 30,000-12,000 genes scattered along the human chromosomes offer 
tremendous promise for the understanding, diagnosis, and treatment of human diseases. In addition, 
probes capable of specifically hybridizing to loci distributed throughout the human genome find 
30 application in the construction of high resolution chromosome maps and in the identification of 
individuals. 

In the past, the characterization of even a single human gene was a painstaking process, 
requiring years of effort. Recent developments in the areas of cloning vectors, DNA sequencing, 
and computer technology have merged to greatly accelerate the rate at which human genes can be 
35 isolated, sequenced, mapped, and characterized. 
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Currently, two different approaches are being pursued for identifying and ciiaracterizing the 
genes distributed along the human genome. In one approach, large fragments of genomic DNA are 
isolated, cloned, and sequenced. Potential open reading frames in these genomic sequences are 
identified using bio-informatics software. However, this approach entails sequencing large 
5 stretches of human DNA which do not encode proteins in order to find the protein encoding 

sequences scattered throughout the genome. In addition to requiring extensive sequencing, the bio- 
informatics software may mischaracterize the genomic sequences obtained, i.e., labeling non-coding 
DNA as coding DNA and vice versa. 

An alternative approach takes a more direct route to identifying and characterizing human 

10 genes. In this approach, complementary DNAs (cDNAs) are synthesized from isolated messenger 
RNAs (mRNAs) which encode human proteins. Using this approach, sequencing is only performed 
on DNA which is derived from protein coding fragments of the genome. Often, only short stretches 
of the cDNAs are sequenced to obtain sequences called expressed sequence tags (ESTs). The ESTs 
may then be used to isolate or purify cDNAs which include sequences adjacent to the EST 

15 sequences. The cDNAs may contain all of the sequence of the EST which was used to obtain them 
or only a fragment of the sequence of the EST which was used to obtain them. In addition, the 
cDNAs may contain the full coding sequence of the gene from which the EST was derived or, 
alternatively, the cDNAs may include fragments of the coding sequence of the gene from which the 
EST was derived. It will be appreciated that there may be several cDNAs which include the EST 

20 sequence as a result of alternate splicing or the activity of alternative promoters. 

In the past, these short EST sequences were often obtained from oligo-dT primed cDNA 
libraries. Accordingly, they mainly corresponded to the 3' untranslated region of the mRNA. In 
part, the prevalence of EST sequences derived from the 3' end of the mRNA is a result of the fact 
that typical techniques for obtaining cDNAs are not well suited for isolating cDNA sequences 

25 derived from the 5' ends of mRNAs (Adams et ai. Nature 377:3-174, 1996, Hillier et al. Genome 
Res. 6:807-828, 1996). In addition, in those reported instances where longer cDNA sequences have 
been obtained, the reported sequences typically correspond to coding sequences and do not include 
the full 5' untranslated region (5'UTR) of the mRNA from which the cDNA is derived. Indeed, 
5'UTRs have been shown to affect either the stability or translation of mRNAs. Thus, regulation of 

30 gene expression may be achieved through the use of alternative 5'UTRs as shown, for instance, for 
the translation of the tissue inhibitor of metalloprotease mRNA in mitogenically activated cells 
(Waterhouse et al, J Biol Chem. 265:5585-9. 1990). Furthermore, modification of 5'UTR through 
mutation, insertion or translocation events may even be implied in pathogenesis. For instance, the 
Fragile X syndrome, the most common cause of inherited mental retardation, is partly due to an 

35 insertion of multiple CGG trinucleotides in the 5'UTR of the Fragile X mRNA resulting in the 
inhibition of protein synthesis via ribosome stalling (Feng et al, Science 268:731-4, 1995). An 
aberrant mutation in regions of the 5'UTR known to inhibit translation of the proto-oncogene c-myc 
was shown to result in upregulation of c-myc protein levels in cells derived from patients with 
multiple myelomas (Willis et al, Curr Top Microbiol Immunol 224:269-76, 1997). In addition, the 
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use of oligo-dT primed cDNA libraries does not allow the isolation of complete 5'UTRs since such 
incomplete sequences obtained by this process may not include the first exon of the mRNA, 
particularly in situations where the first exon is short. Furthermore, they may not include some 
exons, often short ones, which are located upstream of splicing sites. Thus, there is a need to obtain 
5 sequences derived from the 5' ends of mRNAs. 

Moreover, despite the great amount of EST data that large-scale sequencing projects have 
yielded (Adams et al.. Nature 377:174, 1996; Hillier et al. Genome Res. 6:807-828, 1996), 
information concerning the biological function of the mRNAs corresponding to such obtained 
cDNAs has revealed to be limited. Indeed, whereas the knowledge of the complete coding 

1 0 sequence is absolutely necessary to investigate the biological function of mRNAs, ESTs yield only 
partial coding sequences. So far, large-scale full-length cDNA cloning has been achieved only with 
limited success because of the poor efficiency of methods for constructing full-length cDNA 
libraries. Indeed, such methods require either a large amount of mRNA (Ederly et al., 1995), thus 
resulting in non representative full-length libraries when small amounts of tissue are available or 

1 5 require PGR amplification (Maruyama et al., 1994; CLONTECHniques, 1 996) to obtain a 
reasonable number of clones, thus yielding strongly biased cDNA libraries where rare and long 
cDNAs are lost. Thus, there is a need to obtain full-length cDNAs, i.e. cDNAs containing the full 
coding sequence of their corresponding mRNAs. 

While many sequences derived from human chromosomes have practical applications, 

20 approaches based on the identification and characterization of those chromosomal sequences which 
encode a protein product are particularly relevant to diagnostic and therapeutic uses. Of the 30,000- 
120,000 protein coding genes, those genes encoding proteins which are secreted from the cell in 
which they are synthesized, as well as the secreted proteins themselves, are particularly valuable as 
potential therapeutic agents. Such proteins are often involved in cell to cell communication and 

25 may be responsible for producing a clinically relevant response in their target cells. In fact, several 
secretory proteins, including tissue plasminogen activator, G-CSF, GM-CSF, erythropoietin, human 
growth hormone, insulin, interferon-a, interferon-p, interferon-y, and interleukin-2, are currently in 
clinical use. These proteins are used to treat a wide range of conditions, including acute myocardial 
infarction, acute ischemic stroke, anemia, diabetes, growth hormone deficiency, hepatitis, kidney 

30 carcinoma, chemotherapy induced neutropenia and multiple sclerosis. For these reasons, cDNAs 
encoding secreted proteins or fragments thereof represent a particularly valuable source of 
therapeutic agents. Thus, there is a need for the identification and characterization of secreted 
proteins and the nucleic acids encoding them. 

In addition to being therapeutically useful themselves, secretory proteins include short 

35 peptides, called signal peptides, at their amino termini which direct their secretion. These signal 
peptides are encoded by the signal sequences located at the 5' ends of the coding sequences of genes 
encoding secreted proteins. Because these signal peptides will direct the extracellular secretion of 
any protein to which they are operably linked, the signal sequences may be exploited to direct the 
efficient secretion of any protein by operably linking the signal sequences to a gene encoding the 
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protein for which secretion is desired. In addition, fragments of the signal peptides called 
membrane-translocating sequences may also be used to direct the intracellular import of a peptide 
or protein of interest. This may prove beneficial in gene therapy strategies in which it is desired to 
deliver a particular gene product to cells other than the cells in which it is produced. Signal 
5 sequences encoding signal peptides also find application in simplifying protein purification 

techniques. In such applications, the extracellular secretion of the desired protein greatly facilitates 
purification by reducing the number of undesired proteins from which the desired protein must be 
selected. Thus, there exists a need to identify and characterize the 5' fragments of the genes for 
secretory proteins which encode signal peptides. 

10 Sequences coding for secreted proteins may also find application as therapeutics or 

diagnostics. In particular, such sequences may be used to determine whether an individual is likely 
to express a detectable phenotype, such as a disease, as a consequence of a mutation in the coding 
sequence for a secreted protein. In instances where the individual is at risk of suffering from a 
disease or other undesirable phenotype as a result of a mutation in such a coding sequence, the 

15 undesirable phenotype may be corrected by introducing a normal coding sequence using gene 
therapy. Alternatively, if the undesirable phenotype results from overexpression of the protein 
encoded by the coding sequence, expression of the protein may be reduced using antisense or triple 
helix based strategies. 

The secreted human polypeptides encoded by the coding sequences may also be used as 

20 therapeutics by administering them directly to an individual having a condition, such as a disease, 
resulting from a mutation in the sequence encoding the polypeptide. In such an instance, the 
condition can be cured or ameliorated by administering the polypeptide to the individual. 

In addition, the secreted human polypeptides or fragments thereof may be used to generate 
antibodies useful in determining the tissue type or species of origin of a biological sample. The 

25 antibodies may also be used to determine the cellular localization of the secreted human 
polypeptides or the cellular localization of polypeptides which have been fused to the human 
polypeptides. In addition, the antibodies may also be used in immunoaffinity chromatography 
techniques to isolate, purify, or enrich the human polypeptide or a target polypeptide which has 
been fused to the human polypeptide. 

30 Public information on the number of human genes for which the promoters and upstream 

regulatory regions have been identified and characterized is quite limited. In part, this may be due 
to the difficulty of isolating such regulatory sequences. Upstream regulatory sequences such as 
transcription factor binding sites are typically too short to be utilized as probes for isolating 
promoters from human genomic libraries. Recently, some approaches have been developed to 

35 isolate human promoters. One of them consists of making a CpG island library (Cross et al., Nature 
Genetics 6: 236-244, 1994). The second consists of isolating human genomic DNA sequences 
containing Spel binding sites by the use of Spel binding protein (Mortlock et al. Genome Res. 
6:327-335, 1996). Both of these approaches have their limits due to a lack of specificity and of 
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comprehensiveness. Thus, there exists a need to identify and systematically characterize the 5' 
fragments of the genes. 

cDNAs including the 5' ends of their corresponding mRNA may be used to efficiently 
identify and isolate 5'UTRs and upstream regulatory regions which control the location, 
5 developmental stage, rate, and quantity of protein synthesis, as well as the stability of the mRNA 
(Theil et ai, BioFactors 4:87-93, (1993). Once identified and characterized, these regulatory 
regions may be utilized in gene therapy or protein purification schemes to obtain the desired amount 
and locations of protein synthesis or to inhibit, reduce, or prevent the synthesis of undesirable gene 
products. 

10 In addition, cDNAs containing the 5' ends of secretory protein genes may include 

sequences useful as probes for chromosome mapping and the identification of individuals. Thus, 
there is a need to identify and characterize the sequences upstream of the S' coding sequences of 
genes encoding secretory proteins. 

SUMMARY OF THE INVENTION 

1 5 The present invention provides a purified or isolated polynucleotide comprising, consisting 

of, or consisting essentially of a nucleotide sequence selected from the group consisting of: (a) the 
sequences of SEQ ID NOs:l-169, 339-455, 561-784; (b) the sequences of clone inserts of the 
deposited clone pool; (c) the coding sequences of SEQ ID NOs: 1-169, 339-455, 561-784; (d)the 
coding sequences of the clone inserts of the deposited clone pool; (e) the sequences encoding one of 

20 the polypeptides of SEQ ID NOs: 1 70-338, 456-560, 785-91 8; (f) the sequences encoding one of the 
polypeptides encoded by the clone inserts of the deposited clone pool; (g) the genomic sequences 
coding for the GENSET polypeptides; (h) the 5' transcriptional regulatory regions of GENSET 
genes; (i) the 3' transcriptional regulatory regions of GENSET genes; 0) the polynucleotides 
comprising the nucleotide sequence of any combination of (g)-(i); (k) the variant polynucleotides of 

25 any of the polynucleotides of (a)-0); (I) the polynucleotides comprising a nucleotide sequence of 
(a)-(k), wherein the polynucleotide is single stranded, double stranded, or a portion is single 
stranded and a portion is double stranded; (m) the polynucleotides comprising a nucleotide 
sequence complementary to any of the single stranded polynucleotides of (1). The invention further 
provides for fragments of the nucleic acid molecules of (a)-(m) described above. 

30 Further embodiments of the invention include purified or isolated polynucleotides that 

comprise, consist of, or consist essentially of a nucleotide sequence at least 70% identical, more 
preferably at least 75%, and eyen more preferably at least 80%, 85%, 90%, 95%, 96%, 97%, 98%, 
or 99% identical, to any of the nucleotide sequences in (a)-(m) above, e.g. over a region of at least 
about 25, 50, 100, 1 50, 250, 500, 1000, or more contiguous nucleotides, or a polynucleotide which 

35 hybridizes under stringent hybridization conditions to a polynucleotide in (a)-(m) above. 

The present invention also relates to recombinant vectors, which include the purified or 
isolated polynucleotides of the present invention, and to host cells recombinant for the 
polynucleotides of the present invention, as well as to methods of making such vectors and host 
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cells. The present invention further relates to the use of these recombinant vectors and recombinant 
host cells in the production of GENSET polypeptides. 

The invention further provides a purified or isolated polypeptide comprising, consisting of, 
or consisting essentially of an amino acid sequence selected from the group consisting of: (a) the 
5 polypeptides of SEQ ID NOs:170-338, 456-560, 785-918; (b) the polypeptides encoded by the 
clone inserts of the deposited clone pool; (c) the epitope-bearing fragments of the polypeptides of 
SEQ ID NOs: 170-338, 456-560, 785-918; (d) the epitope-bearing fragments of the polypeptides 
encoded by the clone inserts contained in the deposited clone pool; (e) the domains of the 
polypeptides of SEQ ID NOs: 170-338, 456-560, 785-918; (f) the domains of the polypeptides 

1 0 encoded by the clone inserts contained in the deposited clone pool; and (g) the allelic variant 
polypeptides of any of the polypeptides of (a)-(f). The invention further provides for fragments of 
the polypeptides of (a)-(g) above, such as those having biological activity or comprising 
biologically functional domain(s). 

The present invention further includes polypeptides with an amino acid sequence with at 

15 least 70% similarity, and more preferably at least 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, or 
99% similarity to those polypeptides described in (a)-(g), as well as polypeptides having an amino 
acid sequence at least 70% identical, more preferably at least 75% identical, and still more 
preferably 80%, 85%, 90%, 95%, 96%, 97%, 98%, or 99% identical to those polypeptides described 
in (a)-(g), e.g. over a region of at least about 25, 50, 1 00, 1 50, 250, 500, 1 000, or more amino acids. 

20 The invention further relates to methods of making the polypeptides of the present invention. 
The present invention further relates to transgenic plants or animals, wherein said 
transgenic plant or animal is transgenic for a polynucleotide of the present invention and expresses a 
polypeptide of the present invention. 

The invention further relates to antibodies that specifically bind to GENSET polypeptides 

25 of the present invention and fragments thereof as well as to methods for producing such antibodies 
and fragments thereof. 

The invention also provides kits, uses and methods for detecting GENSET gene expression 
and/or biological activity in a biological sample. One such method involves assaying for the 
expression of a GENSET polynucleotide in a biological sample using the polymerase chain reaction 

30 (PGR) to amplify and detect GENSET polynucleotides or Southern and Northern blot hybridization 
to detect GENSET genomic DNA, cDNA or mRNA. Alternatively, a method of detecting 
GENSET gene expression in a test sample can be accomplished using a compound which binds to a 
GENSET polypeptide of the present invention or a portion of a GENSET polypeptide. 
The present invention also relates to diagnostic methods and uses of GENSET 

35 polynucleotides and polypeptides for identifying individuals or non-human animals having elevated 
or reduced levels of GENSET gene products, which individuals are likely to benefit from therapies 
to suppress or enhance GENSET gene expression, respectively, and to methods of identifying 
individuals or non-human animals at increased risk for developing, or at present having, certain 
diseases/disorders associated with GENSET polypeptide expression or biological activity. 
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The present invention also relates to kits, uses and metiiods of screening compounds for 
their ability to modulate (e.g. increase or inhibit) the activity or expression of GENSET 
polypeptides including compounds that interact with GENSET gene regulatory sequences and 
compounds that interact directly or indirectly with a GENSET polypeptide. Uses of such 
5 compounds are also within the scope of the present invention. 

The present invention also relates to pharmaceutical or physiologically acceptable 
compositions comprising, an active agent, the polypeptides, polynucleotides or antibodies of the 
present invention, as well as, typically, a pharmaceutical ly acceptable carrier. 

The present invention also relates to computer systems containing cDNA codes and 
1 0 polypeptide codes of sequences of the invention and to computer-related methods of comparing 
sequences, identifying homology or features using GENSET polypeptides or GENSET 
polynucleotide sequences of the invention. 

In another aspect, the present invention provides an isolated polynucleotide, the 
polynucleotide comprising a nucleic acid sequence encoding: i) a polypeptide comprising an amino 
15 acid sequence having at least about 80% identity to any one of the sequences shown as SEQ ID 
NOs: 170-338, 456-560, 785-918 or any one of the sequences of polypeptides encoded by the clone 
inserts of the deposited clone pool; or a biologically active fragment of the polypeptide. 

In one embodiment, the polypeptide comprises any one of the sequences shown as SEQ ID 
NOs: 170-338, 456-560, 785-91 8 or any one of the sequences of the polypeptides encoded by the 
20 clone inserts of the deposited clone pool. In another embodiment, the polypeptide comprises a 
signal peptide. In another embodiment, the polypeptide is a mature protein. In another 
embodiment, the nucleic acid sequence has at least about 80% identity over at least about 100 
contiguous nucleotides to any one of the sequences shown as SEQ ID NOs:l-169, 339-455, 561- 
784 or any one of the sequences of the clone inserts of the deposited clone pool. In another 
25 embodiment, the polynucleotide hybridizes under stringent conditions to a polynucleotide 

comprising any one of the sequences shown as SEQ ID NOs: 1-1 69, 339-455, 561-784 or any one of 
the sequences of the clone inserts of the deposited clone pool. In another embodiment, the nucleic 
acid sequence comprises any one of the sequences shown as SEQ ID NOs:l-169, 339-455, 561-784 
or any one the sequences of the clone inserts of the deposited clone pool. In another embodiment, 
30 the polynucleotide is operably linked to a promoter. 

In another aspect, the present invention provides an expression vector comprising any of the 
herein-described polynucleotides, operably linked to a promoter. In another aspect, the present 
invention provides a host cell recombinant for any of the herein-described polynucleotides. In 
another aspect, the present invention provides a non-human transgenic animal comprising the host 
35 cell. 

In another aspect, the present invention provides a method of making a GENSET 
polypeptide, the method comprising a) providing a population of host cells comprising a herein- 
described polynucleotide and b) culturing the population of host cells under conditions conducive to 
the production of the polypeptide within said host cells. 
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In one embodiment, the method further comprises purifying the polypeptide from the 
population of host cells. 

In another aspect, the present invention provides a method of making a GENSET 
polypeptide, the method comprising a) providing a population of cells comprising a polynucleotide 
5 encoding a herein-described polypeptide; b) culturing the population of cells under conditions 
conducive to the production of the polypeptide within the cells; and c) purifying the polypeptide 
from the population of cells. 

In another aspect, the present invention provides an isolated polynucleotide, the 
polynucleotide comprising a nucleic acid sequence having at least about 80% identity over at least 
1 0 about 1 00 contiguous nucleotides to any one of the sequences shown as SEQ ID NOs: 1 - 1 69, 339- 
455, 561-784 or any one of the sequences of the clone inserts of the deposited clone pool. 

In one embodiment, the polynucleotide hybridizes under stringent conditions to a 
polynucleotide comprising any one of the sequences shown as SEQ ID NOs: 1-169, 339-455, 561- 
784 or any one of the sequences of the clone inserts of the deposited clone pool. In another 
1 5 embodiment, the polynucleotide comprises any one of the sequences shown as SEQ ID NOs: 1 - 1 69, 
339-455, 561-784 or any one of the sequences of the clone inserts of the deposited clone pool. 

In another aspect, the present invention provides a biologically active polypeptide encoded 
by any of the herein-described polynucleotides. 

In another aspect, the present invention provides an isolated polypeptide or biologically 
20 active fragment thereof, the polypeptide comprising an amino acid sequence having at least about 
80% sequence identity to any one of the sequences shown as SEQ ID NOs: 170-338, 456-560, 785- 
91 8 or any one of the sequences of polypeptides encoded by the clone inserts of the deposited clone 
pool. 

In one embodiment, the polypeptide is selectively recognized by an antibody raised against 
25 an antigenic polypeptide, or an antigenic fragment thereof, the antigenic polypeptide comprising 
any one of the sequences shown as SEQ ID NOs: 1 70-338, 456-560, 785-91 8 or any one of the 
sequences of polypeptides encoded by the clone inserts of the deposited clone pool. In another 
embodiment, the polypeptide comprises any one of the sequences shown as SEQ ID NOs: 170-33 8, 
456-560, 785-9 1 8 or any one of the sequences of polypeptides encoded by the clone inserts of the 
30 deposited clone pool. In another embodiment, the polypeptide comprises a signal peptide. In 
another embodiment, the polypeptide is a mature protein. 

In another aspect, the present invention provides an antibody that specifically binds to any 
of ther herein-described polypeptides. 

In another aspect, the present invention provides a method of determining whether a 
35 GENSET gene is expressed within a mammal, the method comprising the steps of: a) providing a 
biological sample from said mammal; b) contacting said biological sample with either of: i) a 
polynucleotide that hybridizes under stringent conditions to any of the herein-described 
polynucleotides; or ii) a polypeptide that specifically binds to any of the herein-described 
polypeptides; and c) detecting the presence or absence of hybridization between the polynucleotide 
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and an RNA species within the sample, or the presence or absence of binding of the polypeptide to a 
protein within the sample; wherein a detection of the hybridization or of the binding indicates that 
the GENSET gene is expressed within the mammal. 

In one embodiment, the polynucleotide is a primer, and the hybridization is detected by 
5 detecting the presence of an amplification product comprising the sequence of the primer. In 
another embodiment, the polypeptide is an antibody. 

In another aspect, the present invention provides a method of determining whether a 
mammal has an elevated or reduced level of GENSET gene expression, the method comprising the 
steps of: a) providing a biological sample from the mammal; and b) comparing the amount of any of 

10 the herein-described polypeptides, or of an RNA species encoding the polypeptide, within the 
biological sample with a level detected in or expected from a control sample; wherein an increased 
amount of the polypeptide or the RNA species within the biological sample compared to the level 
detected in or expected from the control sample indicates that the mammal has an elevated level of 
the GENSET gene expression, and wherein a decreased amount of the polypeptide or the RNA 

15 species within the biological sample compared to the level detected in or expected from the control 
sample indicates that the mammal has a reduced level of the GENSET gene expression. 

In another aspect, the present invention provides a method of identifying a candidate 
modulator of a GENSET polypeptide, the method comprising: a) contacting any of the herein- 
described polypeptides with a test compound; and b) determining whether the compound 

20 specifically binds to the polypeptide; wherein a detection that the compound specifically binds to 
the polypeptide indicates that the compound is a candidate modulator of the GENSET polypeptide. 

In one embodiment, the method further comprises testing the biological activity of the 
GENSET polypeptide in the presence of the candidate modulator, wherein an alteration in the 
biological activity of the GENSET polypeptide in the presence of the compound in comparison to 

25 the activity in the absence of said compound indicates that the compound is a modulator of the 
GENSET polypeptide. 

In another aspect, the present invention provides a method for the production of a 
pharmaceutical composition, the method comprising a) identifying a modulator of a GENSET 
polypeptide using any of the herein-described methods; and b) combining the modulator with a 

30 pharmaceutically acceptable carrier. 

BRIEF DESCRIPTION OF DRAWINGS 

Figure 1 is a block diagram of an exemplary computer system. 

Figure 2 is a flow diagram illustrating one embodiment of a process 200 for comparing a 
35 new nucleotide or protein sequence with a database of sequences in order to determine the identity 
levels between the new sequence and the sequences in the database. 

Figure 3 is a flow diagram illustrating one embodiment of a process 250 in a computer for 
determining whether two sequences are homologous. 
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Figure 4 is a flow diagram illustrating one embodiment of an identifier process 300 for 
detecting the presence of a feature in a sequence. 



BRIEF DESCRIPTION OF THE TABLES 

5 Table I provides the SEQ ID Nos in the present application (with the SEQ ID Nos 

corresponding to nucleic acid sequences preceded by "NUC", and the SEQ ID Nos corresponding to 
the encoded polypeptide sequences preceded by "PRT") that correspond to a SEQ ID NO in priority 
application number 60/197,873. Applicants' internal designation number (Clone ID) corresponding 
to each sequence identification (SEQ ID) number is also provided. 

1 0 Table II lists the putative chromosomal location of the polynucleotides of the present 

invention. The SEQ ID NO listed for each polynucleotide is that from the priority application 
60/197,873; the corresponding SEQ ID NOs for the sequence in the present application can be 
determined by referring to Table I. 

Table III lists the number of hits in Genset's cDNA libraries of tissues and cell types for 

15 polynucleotides of the invention. The following abbreviations are used to refer to each cell or tissue 
type: A=Brain; B=Fetal brain; C=Fetal kidney; D=Fetal liver; E=Pituitary gland; F=Liver; 
G=Placenta; H=Prostate; I=Salivary gland; J^Stomach/Intestine; and K=Testis. The SEQ ID NO 
listed for each polynucleotide is that from the priority application 60/197,873; the corresponding 
SEQ ID NOs for the sequence in the present application can be determined by referring to Table I. 

20 Table IV lists the number of hits in publicly available library of tissues and cell types for 

polynucleotides of the invention. The SEQ ID NO listed for each polynucleotide is that from the 
priority application 60/197,873; the corresponding SEQ ID NOs for each sequence in the present 
application can be determined by referring to Table I. 

Table V lists the tissues and cell types in which the polynucleotide sequences of the present 

25 invention are over- or under-represented. The SEQ ID NO listed for each polynucleotide is that 
from the priority application 60/197,873; the corresponding SEQ ID NOs for each sequence in the 
present application can be determined by referring to Table I. 



BRIEF DESCRIPTION OF THE SEQUENCE LISTING 

30 SEQ ID NOs: 1-1 69, 339-455, 561-784 are the nucleotide sequences of cDNAs, with open 

reading frames as indicated as features (CDS). When appropriate, the locations of the potential 
polyadenylation site and polyadenylation signal are also indicated. 

SEQ ID NOs: 170-338, 456-560, 785-918 are the amino acid sequences of proteins encoded 
by the cDNAs of SEQ ID NOs:l-169, 339-455, 561-784. 

35 SEQ IDNOs:l-85, 339-400, 406-407, 413-415, 561-594, and 634-651 are the nucleotide 

sequences of cDNAs encoding a potentially secreted protein. The locations of the ORFs and 
sequences encoding signal peptides are listed in the accompanying Sequence Listing. In addition, 
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the von Heijne score of the signal peptide computed as described below is listed as the "score" in 
the accompanying Sequence Listing. The sequence of the signal-peptide is listed as "seq" in the 
accompanying Sequence Listing. The "/" in the signal peptide sequence indicates the location 
where proteolytic cleavage of the signal peptide occurs to generate a mature protein. When 
5 appropriate, the locations of the first and last nucleotides of the coding sequences, eventually the 
locations of the first and last nucleotides of the polyA and the locations of the first and last 
nucleotides of the polyA sites are indicated. 

SEQ ID NOs:86-169, 401-405, 408-412, 416-455, 595-633, 652-784 are the nucleotide 
sequences of cDNAs in which no sequence encoding a signal peptide has been identified to date. 

10 However, it remains possible that subsequent analysis will identify a sequence encoding a signal 
peptide in these nucleic acids. The locations of the ORFs are listed in the accompanying Sequence 
Listing. When appropriate, the locations of the first and last nucleotides of the coding sequences, 
eventually the locations of the first and last nucleotides of the polyA and the locations of the first 
and last nucleotides of the polyA sites are indicated. 

15 SEQ ID NOs: 1 70-254, 456-5 1 7, 520-52 1, 527-529, 785-8 18, and 858-875 are the amino 

acid sequences of polypeptides which contain a signal peptide. These polypeptides are encoded by 
the cDNAs of SEQ ID NOs: 1-85, 339-400, 406-407, 413-41 5, 561-594, and 634-651. The location 
of the signal peptide is listed in the accompanying Sequence Listing. 

SEQ ID NOs:255-338, 517-519, 522-526, 530-560, 819-857, 876-918 are tiie amino acid 

20 sequences of polypeptides in which no signal peptide has been identified to date. However, it 

remains possible that subsequent analysis will identify a signal peptide in these polypeptides. These 
polypeptides are encoded by the nucleic acids of SEQ ID NOs: 86-169, 401-405, 408-412, 416-455, 
595-633, 652-784. 

In accordance with the regulations relating to Sequence Listings, the following codes have 
25 been used in the Sequence Listing to describes nucleotide sequences. The code "r" in the sequences 
indicates that the nucleotide may be a guanine or an adenine. The code "y" in the sequences 
indicates that the nucleotide may be a thymine or a cytosine. The code "m" in the sequences 
indicates that the nucleotide may be an adenine or a cytosine. The code "k" in the sequences 
indicates that the nucleotide may be a guanine or a thymine. The code "s" in the sequences 
30 indicates that the nucleotide may be a guanine or a cytosine. The code "w" in the sequences 
indicates that the nucleotide may be an adenine or an thymine. In addition, all instances of the 
symbol "n" in the nucleic acid sequences mean that the nucleotide can be adenine, guanine, cytosine 
or thymine. 

In some instances, the polypeptide sequences in the Sequence Listing contain the symbol 
35 "Xaa." These "Xaa" symbols indicate either (1) a residue which cannot be identified because of 
nucleotide sequence ambiguity or (2) a stop codon in the determined sequence where applicants 
believe one should not exist (if the sequence were determined more accurately). In some instances, 
several possible identities of the unknown amino acids may be suggested by the genetic code. 
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In the case of secreted proteins, it should be noted that, in accordance with the regulations 
governing Sequence Listings, in the appended Sequence Listing the encoded protein (i.e. the protein 
containing the signal peptide and the mature protein or part thereof) extends from an amino acid 
residue having a negative number through a positively numbered amino acid residue. Thus, the first 
5 amino acid of the mature protein resulting from cleavage of the signal peptide is designated as 
amino acid number 1, and the first amino acid of the signal peptide is designated with the 
appropriate negative number. 



DETAILED DESCRIPTION OF THE INVENTION AND PREFERRED 
10 EMBODIMENTS 

DEFINITfONS 

Before describing the invention in greater detail, the following definitions are set forth to 
illustrate and define the meaning and scope of the terms used to describe the invention herein. 

The term "GENSET gene," when used herein, encompasses genomic, mRNA and cDNA 
1 5 sequences encoding a GENSET polypeptide, including the 5' and 3' untranslated regions of said 
sequences. 

The term "GENSET polypeptide biological activity" or "GENSET biological activity" is 
intended for polypeptides exhibiting any activity similar, but not necessarily identical, to an activity 
of a GENSET polypeptide of the invention. The GENSET polypeptide biological activity of a 
20 given polypeptide may be assessed using any suitable biological assay, a number of which are 
known to those skilled in the art. In contrast, the term "biological activity" refers to any activity 
that any polypeptide may have. 

The term "corresponding mRNA" refers to mRNA which was or can be a template for 
cDNA synthesis for producing a cDNA of the present invention. 
25 The term "corresponding genomic DNA" refers to genomic DNA which encodes an mRNA 

of interest, e.g. corresponding to a cDNA of the invention, which genomic DNA includes the 
sequence of one of the strands of the mRNA, in which thymidine residues in the sequence of the 
genomic DNA (or cDNA) are replaced by uracil residues in the mRNA. 

The term "deposited clone pool" is used herein to refer to the pool of clones entitled cDNA- 
30 8-2000, deposited with the ATCC on September 27, 2000, or the pool of clones entitled cDNA-1 1- 
2000, deposited with the ATCC on November 27, 2000, or any other deposited clone pool 
containing a clone corresponding to any of the herein-described sequences. 

The term "heterologous", when used herein, is intended to designate any polynucleotide or 
polypeptide other than a GENSET polynucleotide or GENSET polypeptide of the invention, 
35 respectively. 

"Providing" with respect to, e.g. a biological sample, population of cells, etc. indicates that 
the sample, population of cells, etc. is somehow used in a method or procedure. Significantly, 
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"providing" a biological sample or population of cells does not require that the sample or cells are 
specifically isolated or obtained for the purposes of the invention, but can instead refer, for 
example, to the use of a biological sample obtained by another individual, for another purpose. 

An "amplification product" refers to a product of any amplification reaction, e.g. PGR, RT- 
5 PCR,LCR,etc. 

A "modulator" of a protein or other compound refers to any agent that has a functional 
effect on the protein, including physical binding to the protein, alterations of the quantity or quality 
of expression of the protein, altering any measurable or detectable activity, property, or behavior of 
the protein, or in any way interacts with the protein or compound. 

1 0 "A test compound" can be any molecule that is evaluated for its ability to modulate a 

protein or other compound. 

An antibody or other compound that specifically binds to a polypeptide or polynucleotide of 
the invention is also said to "selectively recognize" the polypeptide or polynucleotide. 

The terni "isolated" with respect to a molecule requires that the molecule be removed from 

15 its original environment (e. g., the natural environment if it is naturally occurring). For example, a 
naturally-occurring polynucleotide or polypeptide present in a living animal is not isolated, but the 
same polynucleotide or DNA or polypeptide, separated from some or all of the coexisting materials 
in the natural system, is isolated. Such polynucleotide could be part of a vector and/or such 
polynucleotide or polypeptide could be part of a composition, and still be isolated in that the vector 

20 or composition is not part of its natural environment. For example, a naturally-occurring 

polynucleotide present in a living animal is not isolated, but the same polynucleotide, separated 
from some or all of the coexisting materials in the natural system, is isolated. Specifically excluded 
from the definition of "isolated" are: naturally-occurring chromosomes (such as chromosome 
spreads), artificial chromosome libraries, genomic libraries, and cDNA libraries that exist either as 

25 an in vitro nucleic acid preparation or as a transfected/transformed host cell preparation, wherein the 
host cells are either an in vitro heterogeneous preparation or plated as a heterogeneous population of 
single colonies. Also specifically excluded are the above libraries wherein a specified 
polynucleotide makes up less than 5% of the number of nucleic acid inserts in the vector molecules. 
Further specifically excluded are whole cell genomic DNA or whole cell RNA preparations 

30 (including said whole cell preparations which are mechanically sheared or enzymatically digested). 
Further specifically excluded are the above whole cell preparations as either an in vitro preparation 
or as a heterogeneous mixture separated by electrophoresis (including blot transfers of the same) 
wherein the polynucleotide of the invention has not further been separated from the heterologous 
polynucleotides in the electrophoresis medium (e.g., further separating by excising a single band 

35 from a heterogeneous band population in an agarose gel or nylon blot). 

The term "purified" does not require absolute purity; rather, it is intended as a relative 
definition. Purification of starting material or natural material to at least one order of magnitude, 
preferably two or three orders, and more preferably four or five orders of magnitude is expressly 
contemplated. As an example, purification from 0. 1% concentration to 10% concentration is two 
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orders of magnitude. To illustrate, individual cDNA clones isolated from a cDNA library have been 
conventionally purified to electrophoretic homogeneity. The sequences obtained from these clones 
could not be obtained directly either from the library or from total human DNA. The cDNA clones 
are not naturally occurring as such, but rather are obtained via manipulation of a partially purified 
5 naturally occurring substance (messenger RNA). The conversion of mRNA into a cDNA library 
involves the creation of a synthetic substance (cDNA) and pure individual cDNA clones can be 
isolated from the synthetic library by clonal selection. Thus, creating a cDNA library from 
messenger RNA and subsequently isolating individual clones from that library results in an 
approximately 10''-10' fold purification of the native message. 

1 0 The term "purified" is further used herein to describe a polypeptide or polynucleotide of the 

invention which has been separated from other compounds including, but not limited to, 
polypeptides or polynucleotides, carbohydrates, lipids, etc. The term "purified" may be used to 
specify the separation of monomeric polypeptides of the invention from oligomeric forms such as 
homo- or hetero- dimers, trimers, etc. The term "purified" may also be used to specify the 

15 separation of covalently closed (i.e. circular) polynucleotides from linear polynucleotides. A 
polynucleotide is substantially pure when at least about 50%, preferably 60 to 75% of a sample 
exhibits a single polynucleotide sequence and conformation (linear versus covalently close). A 
substantially pure polypeptide or polynucleotide typically comprises about 50%, preferably 60 to 
90% weight/weight of a polypeptide or polynucleotide sample, respectively, more usually about 

20 95%, and preferably is over about 99% pure. Polypeptide and polynucleotide purity, or 
homogeneity, is indicated by a number of means well known in the art, such as agarose or 
polyacrylamide gel electrophoresis of a sample, followed by visualizing a single band upon staining 
the gel. For certain purposes higher resolution can be provided by using HPLC or other means well 
known in the art. As an alternative embodiment, purification of the polypeptides and polynucleotides 

25 of the present invention may be expressed as "at least" a percent purity relative to heterologous 

polypeptides and polynucleotides (DNA, RNA or both). As a preferred embodiment, the polypeptides 
and polynucleotides of the present invention are at least; 10%, 20%, 30%, 40%, 50%, 60%, 70%, 
80%, 90%, 95%, 96%, 96%, 98%, 99%, or 100% pure relative to heterologous polypeptides and 
polynucleotides, respectively. As a further preferred embodiment the polypeptides and 

30 polynucleotides have a purity ranging from any number, to the thousandth position, between 90% and 
100% (e.g., a polypeptide or polynucleotide at least 99.995% pure) relative to either heterologous 
polypeptides or polynucleotides, respectively, or as a weight/weight ratio relative to all compounds 
and molecules other than those existing in the carrier. Each number representing a percent purity, 
to the thousandth position, may be claimed as individual species of purity. 

35 As used interchangeably herein, the terms " nucleic acid molecule(s) ". " oligonucleotide(s) ". 

and " polvnucleotide(s) " include RNA or DNA (either single or double stranded, coding, 
complementary or antisense), or RNA/DNA hybrid sequences of more than one nucleotide in either 
single chain or duplex form (although each of the above species may be particularly specified). The 
term " nucleotide " is used herein as an adjective to describe molecules comprising RNA, DNA, or 
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RNA/DNA hybrid sequences of any length in single-stranded or duplex form. More precisely, the 
expression "nucleotide sequence" encompasses the nucleic material itself and is thus not restricted 
to the sequence information (i.e. the succession of letters chosen among the four base letters) that 
biochemically characterizes a specific DNA or RNA molecule. The term "nucleotide" is also used 
5 herein as a noun to refer to individual nucleotides or varieties of nucleotides, meaning a molecule, 
or individual unit in a larger nucleic acid molecule, comprising a purine or pyrimidine, a ribose or 
deoxyribose sugar moiety, and a phosphate group, or phosphodiester linkage in the case of 
nucleotides within an oligonucleotide or polynucleotide. The term "nucleotide" is also used herein 
to encompass "modified nucleotides" which comprise at least one modification such as (a) an 

10 alternative linking group, (b) an analogous form of purine, (c) an analogous form of pyrimidine, or 
(d) an analogous sugar. For examples of analogous linking groups, purine, pyrimidines, and sugars, 
see, for example, PCT publication No. WO 95/04064, which disclosure is hereby incorporated by 
reference in its entirety. Preferred modifications of the present invention include, but are not 
limited to, 5-fluorouracil, 5-bromouracil, 5-chlorouracil, 5-iodouracil, hypoxanthine, xantine, 4- 

15 acetylcytosine, 5-(carboxyhydroxylmethyl) uracil, 5-carboxymcthylaminomethyl-2-thiouridine, 5- 
carboxymethylaminomethyluracil, dihydrouracil, beta-D-galactosylqueosine, inosine, N6- 
isopentenyladenine, 1-methylguanine, 1-methylinosine, 2,2-dimethyiguanine, 2-methyladenine, 2- 
methylguanine, 3-methylcytosine, 5-methylcytosine, N6-adenine, 7-methylguanine, 5- 
methylaminomethyluracil, 5-methoxyaminomethyl-2-thiouracil, beta-D-mannosylqueosine, 5'- 

20 methoxycarboxymethyluracil, 5-methoxyuracil, 2-methylthio-N6-isopentenyladenine, uracil-5- 
oxyacetic acid (v) ybutoxosine, pseudouracil, queosine, 2-thiocytosine, 5-methyl-2-thiouracil, 2- 
thiouracil, 4-thiouracil, 5-methyluracil, uracil-5-oxyacetic acid methylester, uracil-5-oxyacetic acid, 
5-methyl-2-thiouracil, 3-(3-amino-3-N-2-carboxypropyl) uracil, and 2,6-diaminopurine. The 
polynucleotide sequences of the invention may be prepared by any known method, including 

25 synthetic, recombinant, ex vivo generation, or a combination thereof, as well as utilizing any 
purification methods known in the art. Methylenemethylimino linked oligonucleosides as well as 
mixed backbone compounds having, may be prepared as described in U.S. Pat. Nos. 5,378,825; 
5,386,023; 5,489,677; 5,602,240; and 5,610,289, which disclosures are hereby incorporated by 
reference in their entireties. Formacetal and thioformacetal linked oligonucleosides may be prepared 

30 as described in U.S. Pat. Nos. 5,264,562 and 5,264,564, which disclosures are hereby incorporated 
by reference in their entireties. Ethylene oxide linked oligonucleosides may be prepared as 
described in U.S. Pat. No. 5,223,618, which disclosure is hereby incorporated by reference in its 
entirety. Phosphinate oligonucleotides may be prepared as described in U.S. Pat. No. 5,508,270, 
which disclosure is hereby incorporated by reference in its entirety. Alkyl phosphonate 

35 oligonucleotides may be prepared as described in U.S. Pat. No. 4,469,863, which disclosure is 
hereby incorporated by reference in its entirety. 3'-Deoxy-3'-methylene phosphonate 
oligonucleotides may be prepared as described in U.S. Pat. Nos. 5,610,289 or 5,625,050 which 
disclosures are hereby incorporated by reference in their entireties. Phosphoramidite 
oligonucleotides may be prepared as described in U.S. Pat. No. 5,256,775 or U.S. Pat. No. 
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5,366,878 which disclosures are hereby incorporated by reference in their entireties. 
Alkylphosphonothioate oligonucleotides may be prepared as described in published PCX 
applications WO 94/17093 and WO 94/02499 which disclosures are hereby incorporated by 
reference in their entireties. 3'-Deoxy-3'-amino phosphoramidate oligonucleotides may be prepared 

5 as described in U.S. Pat. No. 5,476,925, which disclosure is hereby incorporated by reference in its 
entirety. Phosphotriester oligonucleotides may be prepared as described in U.S. Pat. No. 5,023,243, 
which disclosure is hereby incorporated by reference in its entirety. Borano phosphate 
oligonucleotides may be prepared as described in U.S. Pat. Nos. 5,130,302 and 5,177,198 which 
disclosures are hereby incorporated by reference in their entireties. 

1 0 The term "upstream" is used herein to refer to a location which is toward the 5' end of the 

polynucleotide from a specific reference point. 

The terms "base paired" and "Watson & Crick base paired" are used interchangeably herein 
to refer to nucleotides which can be hydrogen bonded to one another by virtue of their sequence 
identities in a manner like that found in double-helical DNA with thymine or uracil residues linked 

1 5 to adenine residues by two hydrogen bonds and cytosine and guanine residues linked by three 
hydrogen bonds (see Stryer, 1995, which disclosure is hereby incorporated by reference in its 
entirety). 

The terms "complementary" or "complement thereof are used herein to refer to the 
sequences of polynucleotides which is capable of forming Watson & Crick base pairing with 

20 another specified polynucleotide throughout the entirety of the complementary region. For the 
purpose of the present invention, a first polynucleotide is deemed to be complementary to a second 
polynucleotide when each base in the first polynucleotide is paired with its complementary base. 
Complementary bases are, generally, A and T (or A and U), or C and G. "Complement" is used 
herein as a synonym from "complementary polynucleotide", "complementary nucleic acid" and 

25 "complementary nucleotide sequence". These terms are applied to pairs of polynucleotides based 
solely upon their sequences and not any particular set of conditions under which the two 
polynucleotides would actually bind. Unless otherwise stated, all complementary polynucleotides 
are fully complementary on the whole length of the considered polynucleotide. 

The terms "polypeptide" and "protein", used interchangeably herein, refer to a polymer of 

30 amino acids without regard to the length of the polymer; thus, peptides, oligopeptides, and proteins 
are included within the definition of polypeptide. This term also does not specify or exclude 
chemical or post-expression modifications of the polypeptides of the invention, although chemical 
or post-expression modifications of these polypeptides may be included excluded as specific 
embodiments. Therefore, for example, modifications to polypeptides that include the covalent 

35 attachment of glycosyl groups, acetyl groups, phosphate groups, lipid groups and the like are 
expressly encompassed by the term polypeptide. Further, polypeptides with these modifications 
may be specified as individual species to be included or excluded from the present invention. The 
natural or other chemical modifications, such as those listed in examples above can occur anywhere 
in a polypeptide, including the peptide backbone, the amino acid side-chains and the amino or 
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carboxyl termini. It will be appreciated that the same type of modification may be present in the 
same or varying degrees at several sites in a given polypeptide. Also, a given polypeptide may 
contain many types of modifications. Polypeptides may be branched, for example, as a result of 
ubiquitination, and they may be cyclic, with or without branching. Modifications include 
5 acetylation, acylation, ADP-ribosylation, amidation, covalent attachment of flavin, covalent 

attachment of a heme moiety, covalent attachment of a nucleotide or nucleotide derivative, covalent 
attachment of a lipid or lipid derivative, covalent attachment of phosphotidylinositol, cross-linking, 
cyclization, disulfide bond formation, demethylation, formation of covalent cross-links, formation 
of cysteine, formation of pyroglutamate, formylation, gamma-carboxylation, glycosylation, GPI 

10 anchor formation, hydroxylation, iodination, methylation, myristoylation, oxidation, pegylation, 
proteolytic processing, phosphorylation, prenylation, racemization, selenoylation, sulfation, 
transfer-RNA mediated addition of amino acids to proteins such as arginylation, and ubiquitination. 
(See, for instance Creighton (1993); Seifter o/., (1990); Rattan etal., (1992)). Also included 
within the definition are polypeptides which contain one or more analogs of an amino acid 

15 (including, for example, non-naturally occurring amino acids, amino acids which only occur 

naturally in an unrelated biological system, modified amino acids from mammalian systems, etc.), 
polypeptides with substituted linkages, as well as other modifications known in the art, both 
naturally occurring and non-naturally occurring. 

As used herein, the terms "recombinant polynucleotide" and "polynucleotide construct" are 

20 used interchangeably to refer to linear or circular, purified or isolated polynucleotides that have 
been artificially designed and which comprise at least two nucleotide sequences that are not found 
as contiguous nucleotide sequences in their initial natural environment. In particular, these terms 
mean that the polynucleotide or cDNA is adjacent to "backbone" nucleic acid to which it is not 
adjacent in its natural environment. Additionally, to be "enriched" the cDNAs will represent 5% or 

25 more of the number of nucleic acid inserts in a population of nucleic acid backbone molecules. 
Backbone molecules according to the present invention include nucleic acids such as expression 
vectors, self-replicating nucleic acids, viruses, integrating nucleic acids, and other vectors or nucleic 
acids used to maintain or manipulate a nucleic acid insert of interest. Preferably, the enriched 
cDNAs represent 15% or more of the number of nucleic acid inserts in the population of 

30 recombinant backbone molecules. More preferably, the enriched cDNAs represent 50% or more of 
the number of nucleic acid inserts in the population of recombinant backbone molecules. In a 
highly preferred embodiment, the enriched cDNAs represent 90% or more (including any number 
between 90 and 100%, to the thousandth position, e.g., 99.5%) of the number of nucleic acid inserts 
in the population of recombinant backbone molecules. 

35 The term "recombinant polypeptide" is used herein to refer to polypeptides that have been 

artificially designed and which comprise at least two polypeptide sequences that are not found as 
contiguous polypeptide sequences in their initial natural environment, or to refer to polypeptides 
which have been expressed from a recombinant polynucleotide. 
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As used herein, the term "operably linked" refers to a linkage of polynucleotide elements in 

a functional relationship. A sequence which is "operably linked" to a regulatory sequence such as a 

promoter means that said regulatory element is in the correct location and orientation in relation to 

the nucleic acid to control RNA polymerase initiation and expression of the nucleic acid of interest. 
5 For instance, a promoter or enhancer is operably linked to a coding sequence if it affects the 

transcription of the coding sequence. 

As used herein, the term "non-human animal" refers to any non-human animal, including 

insects, birds, rodents and more usually mammals. Preferred non-human animals include: primates; 

farm animals such as swine, goats, sheep, donkeys, cattle, horses, chickens, rabbits; and rodents, 
1 0 preferably rats or mice. As used herein, the term "animal" is used to refer to any species in the 

animal kingdom, preferably vertebrates, including birds and fish, and more preferable a mammal. 

Both the terms "animal" and "mammal" expressly embrace human subjects unless preceded with 

the term "non-human". 

The term "domain" refers to an amino acid fragment with specific biological properties. 
15 This term encompasses all known structural and linear biological motifs. Examples of such motifs 

include but are not limited to leucine zippers, helix-tum-helix motifs, glycosylation sites, 

ubiquitination sites, alpha helices, and beta sheets, signal peptides which direct the secretion of 

proteins, sites for post-translational modification, enzymatic active sites, substrate binding sites, and 

enzymatic cleavage sites. 

20 Although each of these terms has a distinct meaning, the terms "comprising", "consisting 

of and "consisting essentially of may be interchanged for one another throughout the instant 
application. The term "having" has the same meaning as "comprising" and may be replaced with 
either the term "consisting of or "consisting essentially of. 

Unless otherwise specified in the application, nucleotides and amino acids of 

25 polynucleotides and polypeptides, respectively, of the present invention are contiguous and not 
interrupted by heterologous sequences. 

Identity Between Nucleic Acids Or Polypeptides 

The terms "percentage of sequence identity" and "percentage homology" are used 
interchangeably herein to refer to comparisons among polynucleotides and polypeptides, and are 

30 determined by comparing two optimally aligned sequences over a comparison window, wherein the 
portion of the polynucleotide or polypeptide sequence in the comparison window may comprise 
additions or deletions (i.e., gaps) as compared to the reference sequence (which does not comprise 
additions or deletions) for optimal alignment of the two sequences. The percentage is calculated by 
determining the number of positions at which the identical nucleic acid base or amino acid residue 

35 occurs in both sequences to yield the number of matched positions, dividing the number of matched 
positions by the total number of positions in the window of comparison and multiplying the result 
by 100 to yield the percentage of sequence identity. Homology is evaluated using any of the variety 
of sequence comparison algorithms and programs known in the art. Such algorithms and programs 
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include, but are by no means limited to, TBLASTN, BLASTP, FASTA, TFASTA, CLUSTALW, 
FASTDB (Pearson and Lipman, 1988; Altschul e/ a/., 1990; Thompson a/., 1994; Higgins e/ 
al., 1996; Altschul e/ a/., 1990; Altschul e/ a/., 1993; Bmtlage/a/, 1990), the disclosures of which 
are incorporated by reference in their entireties. 
5 In a particularly preferred embodiment, protein and nucleic acid sequence homologies are 

evaluated using the Basic Local Alignment Search Tool ("BLAST") which is well known in the art 
(see, e.g., Karlin and Ahschul, 1990; Altschul et al., 1 990, 1993, 1997), the disclosures of which 
are incorporated by reference in their entireties. In particular, five specific BLAST programs are 
used to perform the following task: 
10 ( I ) BLASTP and BLAST3 compare an amino acid query sequence against a protein 

sequence database; 

(2) BLASTN compares a nucleotide query sequence against a nucleotide sequence 
database; 

(3) BLASTX compares the six-frame conceptual translation products of a query nucleotide 
1 5 sequence (both strands) against a protein sequence database; 

(4) TBLASTN compares a query protein sequence against a nucleotide sequence database 
translated in all six reading frames (both strands); and 

(5) TBLASTX compares the six-frame translations of a nucleotide query sequence against 
the six-frame translations of a nucleotide sequence database. 

20 The BLAST programs identify homologous sequences by identifying similar segments, 

which are referred to herein as "high-scoring segment pairs," between a query amino or nucleic acid 
sequence and a test sequence which is preferably obtained from a protein or nucleic acid sequence 
database. High-scoring segment pairs are preferably identified (i.e., aligned) by means of a scoring 
matrix, many of which are known in the art. Preferably, the scoring matrix used is the BLOSUM62 

25 matrix (Gonnet a/., 1992; Henikoff and HenikofF, 1993, the disclosures of which are 

incorporated by reference in their entireties). Less preferably, the PAM or PAM250 matrices may 
also be used (see, e.g., Schwartz and Dayhoff, eds., 1 978, the disclosure of which is incorporated by 
reference in its entirety). The BLAST programs evaluate the statistical significance of all high- 
scoring segment pairs identified, and preferably selects those segments which satisfy a user- 

30 specified threshold of significance, such as a user-specified percent homology. Preferably, the 
statistical significance of a high-scoring segment pair is evaluated using the statistical significance 
formula of Kariin (see, e.g., Karlin and Altschul, 1 990), the disclosure of which is incorporated by 
reference in its entirety. The BLAST programs may be used with the default parameters or with 
modified parameters provided by the user. 

35 Another preferred method for determining the best overall match between a query 

nucleotide sequence (a sequence of the present invention) and a subject sequence, also referred to as 
a global sequence alignment, can be determined using the FASTDB computer program based on the 
algorithm of Brutlag et al. (1990), the disclosure of which is incorporated by reference in its 
entirety. In a sequence alignment the query and subject sequences are both DNA sequences. An 
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RNA sequence can be compared by first converting U's to T's. The result of said global sequence 
alignment is in percent identity. Preferred parameters used in a FASTDB alignment of DNA 
sequences to calculate percent identity are: Matrix = Unitary, k-tupie = 4, Mismatch Penalty = 1, 
Joining Penalty = 30, Randomization Group Length = 0, Cutoif Score = 1, Gap Penalty = 5, Gap 
5 Size Penalty = 0.05, Window Size = 500 or the length of the subject nucleotide sequence, 

whichever is shorter. If the subject sequence is shorter than the query sequence because of 5' or 3' 
deletions, not because of internal deletions, a manual correction must be made to the results. This is 
because the FASTDB program does not account for 5' and 3' truncations of the subject sequence 
when calculating percent identity. For subject sequences truncated at the 5' or 3'ends, relative to the 

1 0 query sequence, the percent identity is corrected by calculating the number of bases of the query 
sequence that are 5' and 3' of the subject sequence, which are not matched/aligned, as a percent of 
the total bases of the query sequence. Whether a nucleotide is matched/aligned is determined by 
results of the FASTDB sequence alignment. This percentage is then subtracted from the percent 
identity, calculated by the above FASTDB program using 10, the specified parameters, to arrive at a 

15 final percent identity score. This corrected score is what is used for the purposes of the present 
invention. Only nucleotides outside the 5' and 3' nucleotides of the subject sequence, as displayed 
by the FASTDB alignment, which are not matched/aligned with the query sequence, are calculated 
for the purposes of manually adjusting the percent identity score. For example, a 90 nucleotide 
subject sequence is aligned to a 100 nucleotide query sequence to determine percent identity. The 

20 deletions occur at the 5' end of the subject sequence and therefore, the FASTDB alignment does not 
show a matched/alignment of the first 1 0 nucleotides at 5' end. The 1 0 unpaired nucleotides 
represent 10% of the sequence (number of nucleotides at the 5' and 3' ends not matched/total 
number of nucleotides in the query sequence) so 10% is subtracted from the percent identity score 
calculated by the FASTDB program. If the remaining 90 nucleotides were perfectly matched the 

25 final percent identity would be 90%. In another example, a 90 nucleotide subject sequence is 
compared with a 100 nucleotide queiy sequence. This time the deletions are internal deletions so 
that there are no nucleotides on the 5' or 3' of the subject sequence which are not matched/aligned 
with the query. In this case the percent identity calculated by FASTDB is not manually corrected. 
Once again, only nucleotides 5' and 3' of the subject sequence which are not matched/aligned with 

30 the query sequence are manually corrected. No other manual corrections are made for the purposes 
of the present invention. 

Another preferred method for determining the best overall match between a query amino 
acid sequence (a sequence of the present invention) and a subject sequence, also referred to as a 
global sequence alignment, can be determined using the FASTDB computer program based on the 

35 algorithm of Brutlag et al. ( 1 990). In a sequence alignment the query and subject sequences are 
both amino acid sequences. The result of said global sequence alignment is in percent identity. 
Preferred parameters used in a FASTDB amino acid alignment are: Matrix = PAM 0, k-tuple = 2, 
Mismatch Penalty = I, Joining Penalty = 20, Randomization Group Length = 0, Cutoff Score = I, 
Window Size = sequence length. Gap Penalty = 5, Gap Size Penalty = 0.05, Window Size = 500 or 
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the length of the subject amino acid sequence, whichever is shorter. If the subject sequence is 
shorter than the query sequence due to N-or C-terminal deletions, not because of internal deletions, 
the results, in percent identity, must be manually corrected. This is because the FASTDB program 
does not account forN- and C-terminal truncations of the subject sequence when calculating global 
5 percent identity. For subject sequences truncated at the N- and C-termini, relative to the query 
sequence, the percent identity is corrected by calculating the number of residues of the query 
sequence that are N- and C- terminal of the subject sequence, which are not matched/aligned with a 
corresponding subject residue, as a percent of the total bases of the query sequence. Whether a 
residue is matched/aligned is determined by results of the FASTDB sequence alignment. This 

1 0 percentage is then subtracted from the percent identity, calculated by the above FASTDB program 
using the specified parameters, to arrive at a final percent identity score. This final percent identity 
score is what is used for the purposes of the present invention. Only residues to the N- and C- 
termini of the subject sequence, which are not matched/aligned with the query sequence, are 
considered for the purposes of manually adjusting the percent identity score. That is, only query 

1 5 amino acid residues outside the farthest N- and C-terminal residues of the subject sequence. For 
example, a 90 amino acid residue subject sequence is aligned with a 100-residue query sequence to 
determine percent identity. The deletion occurs at theN-terminus of the subject sequence and 
therefore, the FASTDB alignment does not match/align with the first residues at the N-terminus. 
The 1 0 unpaired residues represent 1 0% of the sequence (number of residues at the N- and C- 

20 termini not matched/total number of residues in the query sequence) so 10% is subtracted from the 
percent identity score calculated by the FASTDB program. If the remaining 90 residues were 
perfectly matched the final percent identity would be 90%. In another example, a 90-residue 
subject sequence is compared with a 1 00-residue query sequence. This time the deletions are 
internal so there are no residues at the N- or C-termini of the subject sequence, which are not 

25 matched/aligned with the query. In this case the percent identity calculated by FASTDB is not 
manually corrected. Once again, only residue positions outside the N- and C-terminal ends of the 
subject sequence, as displayed in the FASTDB alignment, which are not matched/aligned with the 
query sequence are manually corrected. No other manual corrections are made for the purposes of 
the present invention. 

30 The term "percentage of sequence similarity" refers to comparisons between polypeptide 

sequences and is determined by comparing two optimally aligned sequences over a comparison 
window, wherein the portion of the polypeptide sequence in the comparison window may comprise 
additions or deletions (i.e., gaps) as compared to the reference sequence (which does not comprise 
additions or deletions) for optimal alignment of the two sequences. The percentage is calculated by 

35 determining the number of positions at which an identical or equivalent amino acid residue occurs 
in both sequences to yield the number of matched positions, dividing the number of matched 
positions by the total number of positions in the window of comparison and multiplying the result 
by 100 to yield the percentage of sequence similarity. Similarity is evaluated using any of the 
variety of sequence comparison algorithms and programs known in the art, including those 
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described above in this section. Equivalent amino acid residues are defined herein in the "Mutated 
polypeptides" section. 



POLYNUCLEOTIDES OF THE INVENTION 

5 The present invention concerns GENSET genomic and cDNA sequences. The present 

invention encompasses GENSET genes, polynucleotides comprising GENSET genomic and cDNA 
sequences, as well as fragments and variants thereof. These polynucleotides may be purified, 
isolated, or recombinant. 

Also encompassed by the present invention are allelic variants, orthologs, splice variants, 

1 0 and/or species homologues of the GENSET genes. Procedures known in the art can be used to 
obtain full-length genes and cDNAs, allelic variants, splice variants, full-length coding portions, 
orthologs, and/or species homologues of genes and cDNAs corresponding to a nucleotide sequence 
selected from the group consisting of sequences of SEQ IDNOs:l-169, 339-455, 561-784 and 
sequences of clone inserts of the deposited clone pool, using information from the sequences 

1 5 disclosed herein or the clone pool deposited with the ATCC or other depositary authority. For 
example, allelic variants, orthologs and/or species homologues may be isolated and identified by 
making suitable probes or primers from the sequences provided herein and screening a suitable 
nucleic acid source for allelic variants and/or the desired homologue using any technique known to 
those skilled in the art including those described into the section entitled 'To find similar 

20 sequences". 

In a specific embodiment, the polynucleotides of the invention are at least 1 5, 30, 50, 1 00, 
125, 500, or 1000 continuous nucleotides. In another embodiment, the polynucleotides are less than 
or equal to 300kb, 200kb, 1 OOkb, 50kb, 1 Okb, 7.5kb, 5kb, 2.5kb, 2kb, 1 .5kb, or 1 kb in length. In a 
further embodiment, polynucleotides of the invention comprise a portion of the coding sequences, 
25 as disclosed herein, but do not comprise all or a portion of any intron. In another embodiment, the 
polynucleotides comprising coding sequences do not contain coding sequences of a genomic 
flanking gene (i.e., 5' or 3' to the gene of interest in the genome). In other embodiments, the 
polynucleotides of the invention do not contain the coding sequence of more than 1000, 500, 250, 
100, 75, 50, 25, 20, 15, 10, 5, 4, 3, 2, or 1 naturally occurring genomic flanking gene(s). 

30 Deposited clone pool of the invention 

Expression of GENSET genes has been shown to lead to the production of at least one 
mRNA species per GENSET gene, which cDNA sequence is set forth in the appended sequence 
listing as SEQ IDNOs:l-169, 339-455, 561-784. ThecDNAs (SEQ IDNOs:l-169, 339-455, 561- 
784) corresponding to these GENSET mRNA species were cloned either in the vector pBluescriptll 
35 SK" (Stratagene) or in a vector called pPT. Cells containing the cloned cDNAs of the present 
invention are maintained in permanent deposit by the inventors at Genset, S.A., 24 Rue Royale, 
75008 Paris, France. Each cDNA can be removed from the Bluescript vector in which it was 
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inserted by performing a NotI Pst I double digestion, or from the pPT vector by performing a Muni 
Hindlll double digestion, to produce the appropriate fragment for each clone, provided the cDNA 
sequence does not contain any of the corresponding restriction sites within its sequence. 
Alternatively, other restriction enzymes of the multicloning site of the vector may be used to 
5 recover the desired insert as indicated by the manufacturer. 

Pools of cells containing certain cDNAs of the invention, from which the cells containing a 
particular polynucleotide is obtainable, have also been deposited with the American Tissue Culture 
Collection (ATCC), 10801 University Boulevard, Manassas, VA 201 10-2209, United States. These 
cDNA clones have been transfected into separate bacterial cells (E-coli) for these composite 
10 deposits. 

Bacterial cells containing a particular clone can be obtained from the composite deposit as 

follows: 

An oligonucleotide probe or probes should be designed to the sequence that is known for 
that particular clone. This sequence can be derived from the sequences provided herein, or from a 
1 5 combination of those sequences. The design of the oligonucleotide probe should preferably follow 

these parameters: 

(a) It should be designed to an area of the sequence which has the fewest ambiguous bases 

("N's"), if any; 

(b) Preferably, the probe is designed to have a Tm of approximately 80 degrees Celsius 
20 (assuming 2 degrees for each A or T and 4 degrees for each G or C). However, probes having 

mehing temperatures between 40 degrees Celsius and 80 degrees Celsius may also be used provided 
that specificity is not lost. 

The oligonucleotide should preferably be labeled with gamma[^^P]ATP (specific activity 
6000 Ci/mmole) and T4 polynucleotide kinase using commonly employed techniques for labeling 
25 oligonucleotides. Other labeling techniques can also be used. Unincorporated label should 

preferably be removed by gel filtration chromatography or other established methods. The amount 
of radioactivity incorporated into the probe should be quantified by measurement in a scintillation 
counter. Preferably, specific activity of the resulting probe should be approximately 4x10* 
dpm/pmole. 

30 The bacterial culture containing the pool of full-length clones should preferably be thawed 

and 100 ul of the stock used to inoculate a sterile culture flask containing 25 ml of sterile L-broth 
containing ampicillin at 100 ug/ml. The culture should preferably be grown to saturation at 37 
degrees Celsius, and the saturated culture should preferably be diluted in fresh L-broth. Aliquots of 
these dilutions should preferably be plated to determine the dilution and volume which will yield 

35 approximately 5000 distinct and well-separated colonies on solid bacteriological media containing 
L-broth containing ampicillin at 1 00 ug/ml and agar at 1 .5% in a 1 50 mm petri dish when grown 
overnight at 37 degrees Celsius. Other known methods of obtaining distinct, well-separated 
colonies can also be employed. 
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Standard colony hybridization procedures should then be used to transfer the colonies to 
nitrocellulose filters and lyse, denature and bake them. 

The filter is then preferably incubated at 65 degrees Celsius for 1 hour with gentle agitation 
in 6X SSC (20X stock is 1 75.3 g NaCl/liter, 88.2 g Na citrate/liter, adjusted to pH 7.0 with NaOH) 
5 containing 0.5% SDS, 100 pg/ml ofyeastRNA, and 10 mM EDTA (approximately 10 ml per 150 
mm filter). Preferably, the probe is then added to the hybridization mix at a concentration greater 
than or equal to 1x10* dpm/ml. The filter is then preferably incubated at 65 degrees Celsius with 
gentle agitation overnight. The filter is then preferably washed in 500 ml of 2X SSC/0.1% SDS at 
room temperature with gentle shaking for 15 minutes. A third wash with O.IX SSC/0.5% SDS at 65 
1 0 degrees Celsius for 30 minutes to 1 hour is optional. The filter is then preferably dried and 

subjected to autoradiography for sufficient time to visualize the positives on the X-ray film. Other 
known hybridization methods can also be employed. 

The positive colonies are picked, grown in culture, and plasmid DNA isolated using 
standard procedures. The clones can then be verified by restriction analysis, hybridization analysis, 
15 or DNA sequencing. The plasmid DNA obtained using these procedures may then be manipulated 
using standard cloning techniques familiar to those skilled in the art. 

Alternatively, to recover cDNA inserts from the pool of bacteria, a PGR can be performed 
on plasmid DNA isolated using standard procedures and primers designed at both ends of the cDNA 
insertion, including primers designed in the multicloning site of the vector. If a specific cDNA of 
20 interest is to be recovered, primers may be designed in order to be specific for the 5' end and the 3' 
end of this cDNA using sequence information available from the appended sequence listing. The 
PCR product which corresponds to the cDNA of interest can then be manipulated using standard 
cloning techniques familiar to those skilled in the art. 

Therefore, an object of the invention is an isolated, purified, or recombinant polynucleotide 
25 comprising a nucleotide sequence selected from the group consisting of cDNA inserts of the 
deposited clone pool. Moreover, preferred polynucleotides of the invention include purified, 
isolated, or recombinant GENSET cDNAs consisting of, consisting essentially of, or comprising a 
nucleotide sequence selected from the group consisting of cDNA inserts of the deposited clone 
pool. 

30 cDNA sequences of the invention 

Another object of the invention is a purified, isolated, or recombinant polynucleotide 
comprising a nucleotide sequence selected from the group consisting of sequences of SEQ ID 
NOs:l-169, 339-455, 561-784, complementary sequences thereto, and fragments thereof 
Moreover, preferred polynucleotides of the invention include purified, isolated, or recombinant 
35 GENSET cDNAs consisting of, consisting essentially of, or comprising a sequence selected from 
the group consisting of SEQ ID NOs:l-169, 339-455, 561-784. 

Accordingly, the coding sequence (CDS) or open reading frame (ORF) of each cDNA of 
the invention refers to the nucleotide sequence beginning with the first nucleotide of the start codon 
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and ending with the last nucleotide of the stop codon. Similarly, the 5' untranslated region (or 
5'UTR) of each cDNA of the invention refers to the nucleotide sequence starting at nucleotide 1 
and ending at the nucleotide immediately 5 ' to the first nucleotide of the start codon. The 3 ' 
untranslated region (or 3'UTR) of each cDNA of the invention refers to the nucleotide sequence 
S starting at the nucleotide immediately 3 ' to the last nucleotide of the stop codon and ending at the 
last nucleotide of the cDNA. 

Untranslated regions 

In addition, the invention concerns a purified, isolated, and recombinant nucleic acid 
comprising a nucleotide sequence selected from the group consisting of the S'UTRs of sequences of 

1 0 SEQ ID NOs: 1- 1 69, 339-455, 561 -784 and sequences of clone inserts of the deposited clone pool, 
sequences complementary thereto, and allelic variants thereof. The invention also concerns a 
purified, isolated, and/or recombinant nucleic acid comprising a nucleotide sequence selected from 
the group consisting of the 3'UTRs of sequences of SEQ ID N0s:l-I69, 339-455, 561-784 and 
sequences of clone inserts of the deposited clone pool, sequences complementary thereto, and allelic 

1 5 variants thereof. 

These polynucleotides may be used to detect the presence of GENSET mRNA species in a 
biological sample using either hybridization or RT-PCR techniques well known to those skilled in 
the art. 

In addition, these polynucleotides may be used as regulatory molecules able to affect the 
20 processing and maturation of any polynucleotide including them (either a GENSET polynucleotide 
or an heterologous polynucleotide), preferably the localization, stability and/or translation of said 
polynucleotide including them (for a review on UTRs see Decker and Parker, 1 995, Derrigo et ai, 
2000). In particular, 3'UTRs may be used in order to control the stability of heterologous mRNAs 
in recombinant vectors using any methods known to those skilled in the art including Makrides 
25 ((1999) Protein Expr Purif 1 999 Nov; 17(2): 1 83-202), US Patents 5,925,564; 5,807,707 and 
5,756,264, which disclosures are hereby incorporated by reference in their entireties. 

Coding sequences 

Another object of the invention is an isolated, purified or recombinant polynucleotide 
comprising the coding sequence of a sequence selected from the group consisting of sequences of 
30 SEQ ED NOs: I - 1 69, 339-455, 561 -784, clone inserts of the deposited clone pool, and variants 
thereof. 

A further object of the invention is an isolated, purified or recombinant polynucleotide 
encoding a polypeptide comprising a sequence selected from the group consisting of sequences of 
SEQ ID NOs: 1 70-338, 456-560, 785-91 8 and allelic variants thereof Another object of the 
35 invention is an isolated, purified or recombinant polynucleotide encoding a polypeptide comprising 
a sequence selected from the group consisting of polypeptides encoded by cDNA inserts of the 
deposited clone pool and allelic variants thereof 
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It will be appreciated that should the extent of the coding sequence differ from that 
indicated in the appended sequence listing as a resuh of a sequencing error, reverse transcription or 
amplification error, mRNA splicing, post-translational modification of the encoded protein, 
enzymatic cleavage of the encoded protein, or other biological factors, one skilled in the art would 
5 be readily able to identify the extent of the coding sequences in the sequences of SEQ ID NOs: 1- 
169, 339-455, 561-784. Accordingly, the scope of any claims herein relating to nucleic acids 
containing the coding sequence of one of SEQ ID NOs: 1 -1 69, 339-455, 56 1 -784 is not to be 
construed as excluding any readily identifiable variations from or equivalents to the coding 
sequences described in the appended sequence listing. Equivalents includes any alterations in a 

10 nucleotide coding sequence that does not result in an amino acid change, or that results in a 
conservative amino acid substitution, as defmed below, in the polypeptide encoded by the 
nucleotide sequence. Similarly, should the extent of the polypeptides differ from those indicated in 
the appended sequence listing as a result of any of the preceding factors, the scope of claims 
relating to polypeptides comprising the amino acid sequence of the polypeptides of SEQ ID 

15 NOs: 170-338, 456-560, 785-918 is not to be construed as excluding any readily identifiable 
variations from or equivalents to the sequences described in the appended sequence listing. 

The above-disclosed polynucleotides that contain the coding sequence of the GENSET 
genes may be expressed in a desired host cell or a desired host organism, when this polynucleotide 
is placed under the control of suitable expression signals. The expression signals may be either the 

20 expression signals contained in the regulatory regions in the GENSET genes of the invention or, in 
contrast, the signals may be exogenous regulatory nucleic sequences. Such a polynucleotide, when 
placed under the suitable expression signals, may also be inserted in a vector for its expression 
and/or amplification. 

Further included in the present invention are polynucleotides encoding the polypeptides of 

25 the present invention that are fused in frame to the coding sequences for additional heterologous 
amino acid sequences. Also included in the present invention are nucleic acids encoding 
polypeptides of the present invention together with additional, non-coding sequences, including, but 
not limited to, non-coding 5' and 3' sequences, vector sequence, sequences used for purification, 
probing, or priming. For example, heterologous sequences include transcribed, untranslated 

30 sequences that may play a role in transcription and mRNA processing, such as ribosome binding 
and stability of mRNA. The heterologous sequences may alternatively comprise additional coding 
sequences that provide additional functionalities. Thus, a nucleotide sequence encoding a 
polypeptide may be fused to a tag sequence, such as a sequence encoding a peptide that facilitates 
purification or detection of the fused polypeptide. In certain preferred embodiments of this aspect 

35 of the invention, the tag amino acid sequence is a hexa-histidine peptide, such as the tag provided in 
a pQE vector (QIAGEN), or in any of a number of additional, commercially available vectors. For 
instance, hexa-histidine provides for the convenient purification of the fusion protein (see, Gentz el 
al, 1989, ProcNatl Acad Sci U S A Feb;86(3):821-4, the disclosure of which is incorporated by 
reference in its entirety). The "HA" tag is another peptide useful for purification which corresponds 
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to an epitope derived from the influenza hemagglutinin protein (see, Wilson et al., 1984, Cell 
Jul;37(3):767-78, the disclosure of which is incorporated by reference in its entirety). As discussed 
below, other such fusion proteins include a GENSET polypeptide fused to Fc at the N- or 
C-terminus. 

5 Suitable recombinant vectors that contain a polynucleotide such as described herein are 

disclosed elsewhere in the specification. Expression vectors encoding GENSET polypeptides or 
fragments thereof are described in the section entitled "Preparation of the polypeptides". 

Regulatory sequences of the invention 

As mentioned, the genomic sequence of GENSET genes contain regulatory sequences in 
10 the non-coding 5'-flanking region and possibly in the non-coding 3'-flanking region that border the 
GENSET polypeptide coding regions containing the exons of these genes. 

Polynucleotides derived from GENSET polynucleotide 5' and 3' regulatory regions are 
useful in order to detect the presence of at least a copy of a genomic nucleotide sequence of the 
GENSET gene or a fragment thereof in a test sample. 

15 Preferred regulatory sequences 

Polynucleotides carrying the regulatory elements located at the 5' end and at the 3' end of 
GENSET polypeptide coding regions may be advantageously used to control, e.g., the 
transcriptional and translational activity of a heterologous polynucleotide of interest. 

Thus, the present invention also concerns a purified or isolated nucleic acid comprising a 

20 polynucleotide which is selected from the group consisting of the 5' and 3' GENSET 

polynucleotide regulatory regions, sequences complementary thereto, regulatory active fragments 
and variants thereof The invention also pertains to a purified or isolated nucleic acid comprising a 
polynucleotide having at least 95% nucleotide identity with a polynucleotide selected from the 
group consisting of GENSET polynucleotide 5' and 3' regulatory regions, advantageously 99 % 

25 nucleotide identity, preferably 99.5% nucleotide identity and most preferably 99.8% nucleotide 
identity with a polynucleotide selected from the group consisting of GENSET polynucleotide 5' and 
3' regulatory regions, sequences complementary thereto, variants and regulatory active fragments 
thereof 

Another object of the invention consists of purified, isolated or recombinant nucleic acids 
30 comprising a polynucleotide that hybridizes, under the stringent hybridization conditions defined 
herein, with a polynucleotide selected from the group consisting of the nucleotide sequences of 
GENSET polynucleotide 5' and 3' regulatory regions, sequences complementary thereto, variants 
and regulatory active fragments thereof. 

Preferred fragments of 5' regulatory regions have a length of about 1500 or 1000 
35 nucleotides, preferably of about 500 nucleotides, more preferably about 400 nucleotides, even more 
preferably 300 nucleotides and most preferably about 200 nucleotides. 
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Preferred fragments of 3' regulatory regions are at least 20, 50, 100, 150, 200, 300 or 400 
bases in length. 

"Regulatory active" polynucleotide derivatives of the 5' or 3' regulatory region are 
polynucleotides comprising or alternatively consisting of a fragment of said polynucleotide which is 
5 functional as a regulatory region for expressing a recombinant polypeptide or a recombinant 
polynucleotide in a recombinant cell host. It could act either as an enhancer or as a repressor. For 
the purpose of the invention, a nucleic acid or polynucleotide is "functional" as a regulatory region 
for expressing a recombinant polypeptide or a recombinant polynucleotide if said regulatory 
polynucleotide contains nucleotide sequences which contain transcriptional and translational 

10 regulatory information, and such sequences are "operably linked" to nucleotide sequences which 
encode the desired polypeptide or the desired polynucleotide. 

The regulatory polynucleotides of the invention may be prepared from the nucleotide 
sequence of GENSET genomic or cDNA sequence, for example, by cleavage using suitable 
restriction enzymes, or by PGR. The regulatory polynucleotides may also be prepared by digestion 

15 of a GENSET gene-containing genomic clone by an exonuclease enzyme, such as Bal3 1 (Wabiko et 
al., DNA 5(4):305-14 (1986), the disclosure of which is incorporated by reference in its entirety). 
These regulatory polynucleotides can also be prepared by nucleic acid chemical synthesis, as 
described elsewhere in the specification. 

The regulatory polynucleotides according to the invention may be part of a recombinant 

20 expression vector that may be used to express a coding sequence in a desired host cell or host 
organism. The recombinant expression vectors according to the invention are described elsewhere 
in the specification. 

Preferred 5 '-regulatory polynucleotides of the invention include 5'-UTRs of GENSET 
cDNAs, or regulatory active fragments or variants thereof More preferred 5'-regulatory 
25 polynucleotides of the invention include sequences selected from the group consisting of 5'-UTRs 
of sequences of SEQ ID NOs:l-169, 339-455, 561-784, 5'-UTRs of clone inserts of the deposited 
clone pool, regulatory active fragments and variants thereof. 

Preferred 3'-regulatory polynucleotide of the invention include 3'-UTRs of GENSET 
cDNAs, or regulatory active fragments or variants thereof More preferred 3 '-regulatory 
30 polynucleotides of the invention include sequences selected from the group consisting of 3'-UTRs 
of sequences of SEQ ID NOs: 1-169, 339-455, 561-784, 3'-UTRs of clone inserts of the deposited 
clone pool, regulatory active fragments and variants thereof. 

A further object of the invention consists of a purified or isolated nucleic acid comprising: 
a) a polynucleotide comprising a 5' regulatory nucleotide sequence selected from the group 
35 consisting of: 

(i) a nucleotide sequence comprising a polynucleotide of a GENSET polynucleotide 5' 
regulatory region or a complementary sequence thereto; 
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(ii) a nucleotide sequence comprising a polynucleotide having at least 95% of nucleotide 
identity with the nucleotide sequence of a GENSET polynucleotide 5' regulatory region or a 
complementary sequence thereto; 

(iii) a nucleotide sequence comprising a polynucleotide that hybridizes under stringent 

5 hybridization conditions with the nucleotide sequence of a GENSET polynucleotide 5' regulatory 
region or a complementary sequence thereto; and 

(iv) a regulatory active fragment or variant of the polynucleotides in (i), (ii) and (iii); 

b) a nucleic acid molecule encoding a desired polypeptide or a nucleic acid molecule of 
interest, wherein said nucleic acid molecule is operably linked to the polynucleotide defined in (a); 

10 and 

c) optionally, a polynucleotide comprising a 3'- regulatory polynucleotide, preferably a 3'- 
regulatory polynucleotide of a GENSET gene. 

In a specific embodiment, the nucleic acid defined above includes the 5'-UTR of a 

GENSET cDN A, or a regulatory active fragment or variant thereof. 
15 In a second specific embodiment, the nucleic acid defined above includes the 3'-UTR of a 

GENSET cDNA, or a regulatory active fragment or variant thereof 

The regulatory polynucleotide of the 5' regulatory region, or its regulatory active fragments 

or variants, is operably linked at the 5'-end of the nucleic acid molecule encoding the desired 

polypeptide or nucleic acid molecule of interest. 
20 The regulatory polynucleotide of the 3' regulatory region, or its regulatory active fragments 

or variants, is advantageously operably linked at the 3 '-end of the nucleic acid molecule encoding 

the desired polypeptide or nucleic acid molecule of interest. 

The desired polypeptide encoded by the above-described nucleic acid may be of various 

nature or origin, encompassing proteins of prokaryotic viral or eukaryotic origin. Among the 
25 polypeptides expressed under the control of a GENSET polynucleotide regulatory region include 

bacterial, fungal or viral antigens. Also encompassed are eukaryotic proteins such as intracellular 

proteins, such as "house-keeping" proteins, membrane-bound proteins, such as mitochondrial 

membrane-bound proteins and cell surface receptors, and secreted proteins such as endogenous 

mediators such as cytokines. The desired polypeptide may be an heterologous polypeptide or a 
30 GENSET polypeptide, especially a protein with an amino acid sequence selected from the group 

consisting of sequences of SEQ ID NOs: 1 70-338, 456-560, 785-91 8, fragments and variants 

thereof 

The desired nucleic acids encoded by the above-described polynucleotides, usually an RNA 
molecule, may be complementary to a desired coding polynucleotide, for example to a GENSET 
35 coding sequence, and thus useful as an antisense polynucleotide. Such a polynucleotide may be 
included in a recombinant expression vector in order to express the desired polypeptide or the 
desired nucleic acid in host cell or in a host organism. Suitable recombinant vectors that contain a 
polynucleotide such as described herein are disclosed elsewhere in the specification. 
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The invention also relates to variants of the polynucleotides described herein and fragments 
thereof. "Variants" of polynucleotides, as the term is used herein, are polynucleotides that differ 
from a reference polynucleotide. Generally, differences are limited so that the nucleotide sequences 
5 of the reference and the variant are closely similar overall and, in many regions, identical. The 
present invention encompasses both allelic variants and degenerate variants. 

Examples of variant sequences of polynucleotides of the invention are given in the 
appended sequence listing. Specifically, Table I includes sequences for which a plurality of closely 
related sequences, e.g. variants, are provided. 

10 Allelic variants 

A variant of a polynucleotide may be a naturally occurring variant such as a naturally 
occurring allelic variant, or it may be a variant that is not known to occur naturally. By an "allelic 
variant" is intended one of several alternate forms of a gene occupying a given locus on a 
chromosome of an organism (see Lewin, 1990), the disclosure of which is incorporated by reference 
1 5 in its entirety. Diploid organisms may be homozygous or heterozygous for an allelic form. Non- 
naturally occurring variants of the polynucleotide may be made by art-known mutagenesis 
techniques, including those applied to polynucleotides, cells or organisms. See, for example, Table 
I, which includes sequences for which a plurality of closely related sequences, e.g. allelic variants of 
a single gene, are provided. 

20 Degenerate variant 

In addition to the isolated polynucleotides of the present invention, and fragments thereof, 
the invention further includes polynucleotides which comprise a sequence substantially different 
from those described above but which, due to the degeneracy of the genetic code, still encode a 
GENSET polypeptide of the present invention. These polynucleotide variants are referred to as 

25 "degenerate variants" throughout the instant application. That is, all possible polynucleotide 
sequences that encode the GENSET polypeptides of the present invention are contemplated. This 
includes the genetic code and species-specific codon preferences known in the art. Thus, it would 
be routine for one skilled in the art to generate the degenerate variants described above, for instance, 
to optimize codon expression for a particular host (e.g., change codons in the human mRNA to 

30 those preferred by other mammalian or bacterial host cells). 

Nucleotide changes present in a variant polynucleotide may be silent, which means that 
they do not alter the amino acids encoded by the polynucleotide. However, nucleotide changes may 
also result in amino acid substitutions, additions, deletions, fusions and truncations in the 
polypeptide encoded by the reference sequence. The substitutions, deletions or additions may 

35 involve one or more nucleotides. The variants may be altered in coding or non-coding regions or 
both. Alterations in the coding regions may produce conservative or non-conservative amino acid 
substitutions, deletions or additions. In the context of the present invention, preferred embodiments 
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are those in which the polynucleotide variants encode polypeptides which retain substantially the 
same biological properties or activities as the GENSET protein. More preferred polynucleotide 
variants are those containing conservative substitutions. 

Similar polynucleotides 

5 Other embodiments of the present invention provide a purified, isolated or recombinant 

polynucleotide which is at least 80%, 85%, 90%, 95%, 96%, 97%, 98% or 99% identical to a 
polynucleotide selected from the group consisting of sequences of SEQ ID NOs: 1-169, 339-455, 
561-784 and the clone inserts of the deposited clone pool. The above polynucleotides are included 

regardless of whether they encode a polypeptide having a GENSET biological activity. This is 
10 because even where a particular nucleic acid molecule does not encode a polypeptide having 

activity, one of skill in the art would still know how to use the nucleic acid molecule, for instance, 
as a hybridization probe or primer. Uses of the nucleic acid molecules of the present invention that 
do not encode a polypeptide having GENSET activity include, inter alia, isolating a GENSET gene 
or allelic variants thereof from a DNA library, and detecting GENSET mRNA expression in 
1 5 biological samples suspected of containing GENSET mRNA or DNA, e.g., by Northern Blot or 
PGR analysis. 

The present invention is further directed to polynucleotides having sequences at least 50%. 
60%, 70%, 80%, 90%, 95%, 96%, 97%, 98% or 99% identity to a polynucleotide selected from the 
group consisting of sequences of SEQ ID NOs:l-169, 339-455, 561-784 and clone inserts of the 

20 deposited clone pool, where said polynucleotide do, in fact, encode a polypeptide having a 

GENSET biological activity. Of course, due to the degeneracy of the genetic code, one of ordinary 
skill in the art will immediately recognize that a large number of the polynucleotides at least 50%. 
60%, 70%, 80%, 90%, 95%, 96%, 97%, 98%, or 99% identical to a polynucleotide selected from 
the group consisting of sequences of SEQ ID NOs: 1-1 69, 339-455, 561-784 and clone inserts of the 

25 deposited clone pool will encode a polypeptide having biological activity. In fact, since degenerate 
variants of these nucleotide sequences all encode the same polypeptide, this will be clear to the 
skilled artisan even without performing the above described comparison assay. It will be further 
recognized in the art that, for such nucleic acid molecules that are not degenerate variants, a 
reasonable number will also encode a polypeptide having biological activity. This is because the 

30 skilled artisan is fully aware of amino acid substitutions that are either less likely or not likely to 
significantly affect protein function (e.g., replacing one aliphatic amino acid with a second aliphatic 
amino acid), as further described below. By a polynucleotide having a nucleotide sequence at least, 
for example, 95% "identical" to a reference nucleotide sequence of the present invention, it is 
intended that the nucleotide sequence of the polynucleotide is identical to the reference sequence 

3 5 except that the polynucleotide sequence may include up to five point mutations per each 1 00 
nucleotides of the reference nucleotide sequence encoding the GENSET polypeptide. In other 
words, to obtain a polynucleotide having a nucleotide sequence at least 95% identical to a reference 
nucleotide sequence, up to 5% of the nucleotides in the reference sequence may be deleted, inserted. 
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or substituted with another nucleotide. The query sequence may be an entire sequence selected 
from the group consisting of sequences of SEQ ID NOs:l-169, 339-455, 561-784 and sequences of 
clone inserts of the deposited clone pool, or the ORF (open reading frame) of a polynucleotide 
sequence selected from said group, or any fragment specified as described herein. 

5 Hybridizing Polynucleotides 

In another aspect, the invention provides an isolated or purified nucleic acid molecule 
comprising a polynucleotide which hybridizes under stringent hybridization conditions to any 
polynucleotide of the present invention using any methods known to those skilled in the art 

including those disclosed herein and in particular in the "To find similar sequences" section. Also 
10 contemplated are nucleic acid molecules that hybridize to the polynucleotides of the present 

invention at lower stringency hybridization conditions, preferably at moderate or low stringency 

conditions as defined herein. Such hybridizing polynucleotides may be of at least 15,18, 20, 23, 25, 

28, 30, 35, 40, 50, 75, 100, 200, 300, 500 or 1000 nucleotides in length. 

Of particular interest are polynucleotides hybridizing to any polynucleotide of the invention 
1 5 and encoding GENSET polypeptides, particularly GENSET polypeptides exhibiting a GENSET 

biological activity. 

Of course, a polynucleotide which hybridizes only to polyA+ sequences (such as any 3' 
terminal polyA+ tract of a cDNA shown in the sequence listing), or to a 5' complementary stretch 
of T (or U) residues, would not be included in the definition of "polynucleotide," since such a 
20 polynucleotide would hybridize to any nucleic acid molecule containing a poly(A) stretch or the 
complement thereof (e.g., practically any double-stranded cDNA clone generated using oligo dT as 
a primer). 

Complementaty polynucleotides 

The invention further provides isolated nucleic acid molecules having a nucleotide 
25 sequence fully complementary to any polynucleotide of the invention. The present invention 
encompasses a purified, isolated or recombinant polynucleotide having a nucleotide sequence 
complementary to a sequence selected from the group consisting of sequences of SEQ ID NOs: 1 - 
169, 339-455, 561-784, sequences of clone inserts of the deposited clone pool and fragments 
thereof. Such isolated molecules, particularly DNA molecules, are useful as probes for gene 
30 mapping and for identifying GENSET mRNA in a biological sample, for instance, by PGR or 
Northern blot analysis. 

Polynucleotide fragments 

The present invention is further directed to polynucleotides encoding portions or fragments 
of the nucleotide sequences described herein. Uses for the polynucleotide fragments of the present 
35 invention include probes, primers, molecular weight markers and for expressing the polypeptide 
fragments of the present invention. Fragments include portions of polynucleotides selected from 
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the group consisting of a) the sequences of SEQ ID NOs:l-169, 339-455, 561-784, b) genomic 
GENSET sequences, c) the polynucleotides encoding a polypeptide selected from the group 
consisting of the sequences of SEQ ID NOs:170-338, 456-560, 785-918, d) the sequences of clone 
inserts of the deposited clone pool, and e) the polynucleotides encoding the polypeptides encoded 
5 by the clone inserts of the deposited clone pool. Particularly included in the present invention is a 
purified or isolated polynucleotide comprising at least 8 consecutive bases of a polynucleotide of 
the present invention. In one aspect of this embodiment, the polynucleotide comprises at least 10, 
12, 15, 18, 20, 25, 28, 30, 35, 40, 50, 75, 100, 150, 200, 300, 400, 500, 800, 1000, 1500, or 2000 
consecutive nucleotides of a polynucleotide of the present invention. 

10 In addition to the above preferred polynucleotide sizes, further preferred sub-genuses of 

polynucleotides comprise at least 8 nucleotides, wherein "at least 8" is defined as any integer 
between 8 and the integer representing the 3' most nucleotide position as set forth in the sequence 
listing or elsewhere herein. Further included as preferred polynucleotides of the present invention 
are polynucleotide fragments at least 8 nucleotides in length, as described above, that are further 

15 specified in terms of their 5' and 3' position. The 5' and 3' positions are represented by the position 
numbers set forth in the appended sequence listing. For allelic, degenerate and other variants, 
position 1 is defined as the 5' most nucleotide of the ORE, i.e., the nucleotide "A" of the start codon 
with the remaining nucleotides numbered consecutively. Therefore, every combination of a 5' and 
3' nucleotide position that a polynucleotide fragment of the present invention, at least 8 contiguous 

20 nucleotides in length, could occupy on a polynucleotide of the invention is included in the invention 
as an individual species. The polynucleotide fragments specified by 5' and 3' positions can be 
immediately envisaged and are therefore not individually listed solely for the purpose of not 
unnecessarily lengthening the specification. 

It is noted that the above species of polynucleotide fi'agments of the present invention may 

25 alternatively be described by the formula "a to b"; where "a" equals the 5' most nucleotide position 
and "b" equals the 3' most nucleotide position of the polynucleotide; and further where "a" equals 
an integer between 1 and the number of nucleotides of the polynucleotide sequence of the present 
invention minus 8, and where "b" equals an integer between 9 and the number of nucleotides of the 
polynucleotide sequence of the present invention; and where "a" is an integer smaller then "b" by at 

30 leasts. 

Therefore, the present invention encompasses isolated, purified, or recombinant 
polynucleotides which consist of, consist essentially of, or comprise a contiguous span of at least 8, 
10, 12, 15, 18, 20, 25, 28, 30, 35, 40, 50, 75, 100, 150, 200, 300, 400, 500, 1000 or 2000 nucleotides 
of a sequence selected fi-om the group consisting of the sequences of SEQ ID NOs: 1 - 1 69, 339-45 5 , 
35 561-784 and sequences fully complementary thereto. 

Other preferred fragments of the invention are polynucleotides comprising polynucleotides 
encoding domains of polypeptides. Such fragments may be used to obtain other polynucleotides 
encoding polypeptides having similar domains using hybridization or RT-PCR techniques. 
Alternatively, these fragments may be used to express a polypeptide domain which may have a 
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specific biological property. Thus, another object of the invention is an isolated, purified or 
recombinant polynucleotide encoding a polypeptide consisting of, consisting essentially of, or 
comprising a contiguous span of at least 5, 6, 8, 10, 12, 15, 20, 25, 30, 35, 40, 50, 60, 75, 100, 150 
or 200 consecutive amino acids of a sequence selected from the group consisting of the sequences 
5 of SEQ ID NOs: 1 70-338, 456-560, 785-91 8, to the extent that a contiguous span of these lengths is 
consistent with the lengths of said selected sequence, where said contiguous span comprises at least 
1, 2, 3, 5, or 10 of the amino acid positions of a domain of said selected sequence. The present 
invention also encompasses isolated, purified or recombinant polynucleotides encoding a 
polypeptide comprising a contiguous span of at least 5, 6, 8, 10, 12, 1 5, 20, 25, 30, 35, 40, 50, 60, 

10 75 , 1 00, 1 50 or 200 consecutive amino ac ids of a sequence selected from the group consisting of 
sequences of SEQ ID NOs: 1 70-338, 456-560, 785-9 1 8, to the extent that a contiguous span of these 
lengths is consistent with the lengths of said selected sequence, where said contiguous span is a 
domain of said selected sequence. The present invention also encompasses isolated, purified or 
recombinant polynucleotides encoding a polypeptide comprising a domain of a sequence selected 

15 from the group consisting of the sequences of SEQ ID NOs: 170-338, 456-560, 785-918. 

The present invention further encompasses any combination of the polynucleotide 
fragments listed in this section. 

Oligonucleotide primers and probes 

The present invention also encompasses fragments of GENSET polynucleotides for use as 
20 primers and probes. Polynucleotides derived from the GENSET genomic and cDNA sequences are 
useful in order to detect the presence of at least a copy of a GENSET polynucleotide or fragment, 
complement, or variant thereof in a test sample. 

Structural definition 

Any polynucleotide of the invention may be used as a primer or probe. Particularly 
25 preferred probes and primers of the invention include isolated, purified, or recombinant 

polynucleotides comprising a contiguous span of at least 12, 15, 18, 20, 25, 30, 35, 40, 50, 60, 70, 
80, 90, 100, 150, 200, 500, or 1000 nucleotides of a sequence selected from the group consisting of 
the GENSET genomic sequences, the cDNA sequences and the sequences fully complementary 
thereto. Another object of the invention is a purified, isolated, or recombinant polynucleotide 
30 comprising the nucleotide sequence of a sequence selected from the group consisting of the 
sequences of SEQ IDNOs:l-169, 339-455, 561-784, sequences of clone inserts of the deposited 
clone pool, sequences fully complementary thereto, allelic variants thereof, and fragments thereof. 
Moreover, preferred probes and primers of the invention include purified, isolated, or recombinant 
GENSET cDNAs consisting of, consisting essentially of, or comprising the sequences of SEQ ID 
35 NOs: 1-1 69, 339-455, 561-784 and sequences of clone inserts of the deposited clone pool. 

Particularly preferred probes and primers of the invention include isolated, purified, or recombinant 
polynucleotides comprising a contiguous span of at least 12, 15, 18, 20, 25, 30, 35, 40, 50, 60, 70, 

34 



[fije>'/^:/yv'GG208389e.s^] 



WO 02/083898 PCT/IBOl/00914 
80, 90, 100, 150, 200, 500, or 1000 nucleotides of a sequence selected from the group consisting of 
the sequences of SEQ ID NOs:l-169, 339-455, 561-784 and the sequences fully complementary 
thereto. 

Design of primers and probes 

5 A probe or a primer according to the invention has between 8 and 1 000 nucleotides in 

length, or is specified to be at least 12, 15, 18, 20, 25, 35, 40, 50, 60, 70, 80, 100, 250, 500 or 1000 
nucleotides in length. More particularly, the length of these probes and primers can range from 8, 
10, 15, 20, or 30 to 100 nucleotides, preferably from 10 to 50, more preferably from 15 to 30 
nucleotides. Shorter probes and primers tend to lack specificity for a target nucleic acid sequence 

1 0 and generally require cooler temperatures to form sufficiently stable hybrid complexes with the 
template. Longer probes and primers are expensive to produce and can sometimes self-hybridize to 
form hairpin structures. The appropriate length for primers and probes under a particular set of 
assay conditions may be empirically determined by one of skill in the art. The formation of stable 
hybrids depends on the melting temperature ( Tm) of the DN A. The Tm depends on the length of 

1 5 the primer or probe, the ionic strength of the solution and the G+C content. The higher the G+C 
content of the primer or probe, the higher is the melting temperature because G:C pairs are held by 
three H bonds whereas A:T pairs have only two. The GC content in the probes of the invention 
usually ranges between 1 0 and 75%, preferably between 35 and 60%, and more preferably between 
40 and 55%. 

20 For amplification purposes, pairs of primers with approximately the same Tm are 

preferable. Primers may be designed using the OSP software (Hillier and Green, 1991), the 
disclosure of which is incorporated by reference in its entirety, based on GC content and melting 
temperatures of oligonucleotides, or using PC-Rare (http:// 

bioinformatics.weizmann.ac.il/software/PC-Rare/doc/manuel.html) based on the octamer frequency 
25 disparity method (Griffais et al, 1991), the disclosure of which is incorporated by reference in its 
entirety. DNA amplification techniques are well known to those skilled in the art. Amplification 
techniques that can be used in the context of the present invention include, but are not limited to, the 
ligase chain reaction (LCR) described in EP-A- 320 308, WO 9320227 and EP-A-439 182, the 
polymerase chain reaction (PCR, RT-PCR) and techniques such as the nucleic acid sequence based 
30 amplification (NASBA) described in Guatelli et a/.(l990) and in Compton (1991), Q-beta 
amplification as described in European Patent Application No 4544610, strand displacement 
amplification as described in Walker et al. (1996) and EP A 684 315 and, target mediated 
amplification as described in PCT Publication WO 9322461, the disclosures of which are 
incorporated by reference in their entireties. 
35 LCR and Gap LCR are exponential amplification techniques, both depending on DNA 

ligase to join adjacent primers annealed to a DNA molecule. In Ligase Chain Reaction (LCR), 
probe pairs are used which include two primary (first and second) and two secondary (third and 
fourth) probes, all of which are employed in molar excess to target. The first probe hybridizes to a 
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first segment of the target strand and the second probe hybridizes to a second segment of the target 
strand, the first and second segments being contiguous so that the primary probes abut one another 
in 5' phosphate-3'hydroxyl relationship, and so that a iigase can covalently fuse or ligate the two 
probes into a fused product. In addition, a third (secondary) probe can hybridize to a portion of the 
5 first probe and a fourth (secondary) probe can hybridize to a portion of the second probe in a similar 
abutting fashion. Of course, if the target is initially double stranded, the secondary probes also will 
hybridize to the target complement in the first instance. Once the ligated strand of primary probes 
is separated from the target strand, it will hybridize with the third and fourth probes, which can be 
ligated to form a complementary, secondary ligated product. It is important to realize that the 

1 0 ligated products are functionally equivalent to either the target or its complement. By repeated 
cycles of hybridization and ligation, amplification of the target sequence is achieved. A method for 
multiplex LCR has also been described (WO 9320227), the disclosure of which is incorporated by 
reference in its entirety. Gap LCR (GLCR) is a version of LCR where the probes are not adjacent 
but are separated by 2 to 3 bases. 

1 5 For amplification of mRNAs, it is within the scope of the present invention to reverse 

transcribe mRNA into cDNA followed by polymerase chain reaction (RT-PCR); or, to use a single 
enzyme for both steps as described in U.S. Patent No. 5,322,770 or, to use Asymmetric Gap LCR 
(RT-AGLCR) as described by Marshall et a/.(1994), the disclosures of which are incorporated by 
reference in its entireties. AGLCR is a modification of GLCR that allows the amplification of 

20 RNA. 

PCR technology is the preferred amplification technique used in the present invention. A 
variety of PCR techniques are familiar to those skilled in the art. For a review of PCR technology, 
see White (1997), Eriich (1992) and the publication entitled "PCR Methods and Applications" 
((1991) Cold Spring Harbor Laboratory Press), the disclosures of which are incorporated by 

25 reference in their entireties. In each of these PCR procedures, PCR primers on either side of the 
nucleic acid sequences to be amplified are added to a suitably prepared nucleic acid sample along 
with dNTPs and a thermostable polymerase such as Taq polymerase, Pfu polymerase, Tth 
polymerase or Vent polymerase. The nucleic acid in the sample is denatured and the PCR primers 
are specifically hybridized to complementary nucleic acid sequences in the sample. The hybridized 

30 primers are extended. Thereafter, another cycle of denaturation, hybridization, and extension is 
initiated. The cycles are repeated multiple times to produce an amplified fragment containing the 
nucleic acid sequence between the primer sites. PCR has further been described in several patents 
including US Patent Nos. 4,683,195; 4,683,202; and 4,965,1 88, the disclosures of which are 
incorporated herein by reference in their entireties. 

35 Preparation of primers and probes 

Primers and probes can be prepared by any suitable method, including, for example, 
cloning and restriction of appropriate sequences and direct chemical synthesis by a method such as 
the phosphodiester method of Narang et a/.(1979), the phosphodiester method of Brown et 
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a/.(]979), the diethylphosphoramidite method of Beaucage e/fl/.(l981) and the solid support 
method described in EP 0 707 592, which disclosures are hereby incorporated by reference in their 
entireties. 

Detection probes are generally nucleic acid sequences or uncharged nucleic acid analogs 
5 such as, for example peptide nucleic acids which are disclosed in International Patent Application 
WO 92/20702, morpholino analogs which are described in U.S. Patent Nos. 5,185,444; 5,034,506 
and 5,142,047, which disclosures are hereby incorporated by reference in their entireties. The probe 
may have to be rendered "non-extendable" in that additional dNTPs cannot be added to the probe. 
In and of themselves analogs usually are non-extendable and nucleic acid probes can be rendered 

1 0 non-extendable by modifying the 3' end of the probe such that the hydroxyl group is no longer 
capable of participating in elongation. For example, the 3' end of the probe can be functionalized 
with the capture or detection label to thereby consume or otherwise block the hydroxyl group. 
Alternatively, the 3' hydroxyl group simply can be cleaved, replaced or modified, U.S. Patent 
Application Serial No. 07/049,061 filed April 19, 1993, which disclosure is hereby incorporated by 

1 5 reference in its entirety, describes modifications, which can be used to render a probe non- 
extendable. 

Labeling of probes 

Any of the polynucleotides of the present invention can be labeled, if desired, by 

20 incorporating any label known in the art to be detectable by spectroscopic, photochemical, 

biochemical, immunochemical, or chemical means. For example, useful labels include radioactive 
substances (including, ^^P, ^'S, ^H, '^'l), fluorescent dyes (including, 5-bromodesoxyuridin, 
fluorescein, acetylaminofluorene, digoxigenin) or biotin. Preferably, polynucleotides are labeled at 
their 3' and 5' ends. Examples of non-radioactive labeling of nucleic acid fragments are described 

25 in the French patent No. FR-7810975 or by Urdea et al (1988) or Sanchez-Pescador et o/ (1988), 
which disclosures are hereby incorporated by reference in their entireties. In addition, the probes 
according to the present invention may have structural characteristics such that they allow the signal 
amplification, such structural characteristics being, for example, branched DNA probes as those 
described by Urdea et al. in 1991 or in the European patent No. EP 0 225 807 (Chiron), which 

30 disclosures are hereby incorporated by reference in their entireties. 

The detectable probe may be single stranded or double stranded and may be made using 
techniques known in the art, including in vitro transcription, nick translation, or kinase reactions. A 
nucleic acid sample containing a sequence capable of hybridizing to the labeled probe is contacted 
with the labeled probe. If the nucleic acid in the sample is double stranded, it may be denatured 

35 prior to contacting the probe. In some applications, the nucleic acid sample may be immobilized on 
a surface such as a nitrocellulose or nylon membrane. The nucleic acid sample may comprise 
nucleic acids obtained from a variety of sources, including genomic DNA, cDNA libraries, RNA, or 
tissue samples. 
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Procedures used to detect the presence of nucleic acids capable of hybridizing to the 
detectable probe include well known techniques such as Southern blotting, Northern blotting, dot 
blotting, colony hybridization, and plaque hybridization. In some applications, the nucleic acid 
capable of hybridizing to the labeled probe may be cloned into vectors such as expression vectors, 
5 sequencing vectors, or in vitro transcription vectors to facilitate the characterization and expression 
of the hybridizing nucleic acids in the sample. For example, such techniques may be used to isolate 
and clone sequences in a genomic library or cDNA library which are capable of hybridizing to the 
detectable probe as described herein. 

Immobilization of probes 

10 A label can also be used to capture the primer, so as to facilitate the immobilization of 

either the primer or a primer extension product, such as amplified DNA, on a solid support. A 
capture label is attached to the primers or probes and can be a specific binding member which forms 
a binding pair with the solid phase reagent's specific binding member (e.g. biotin and streptavidin). 
Therefore depending upon the type of label carried by a polynucleotide or a probe, it may be 

1 5 employed to capture or to detect the target DNA. Further, it will be understood that the 

polynucleotides, primers or probes provided herein, may, themselves, serve as the capture label. 
For example, in the case where a solid phase reagent's binding member is a nucleic acid sequence, 
it may be selected such that it binds a complementary portion of a primer or probe to thereby 
immobilize the primer or probe to the solid phase. In cases where a polynucleotide probe itself 

20 serves as the binding member, those skilled in the art will recognize that the probe will contain a 
sequence or '^il" that is not complementary to the target. In the case where a polynucleotide 
primer itself serves as the capture label, at least a portion of the primer will be free to hybridize with 
a nucleic acid on a solid phase. DNA Labeling techniques are well known to the skilled technician. 
The probes of the present invention are useful for a number of purposes. They can notably 

25 be used in Southern hybridization to genomic DNA. The probes can also be used to detect PCR 
amplification products. They may also be used to detect mismatches in the GENSET gene or 
mRNA using other techniques. They may also be used for in situ hybridization. 

Any of the polynucleotides, primers and probes of the present invention can be 
conveniently immobilized on a solid support. The solid support is not critical and can be selected 

30 by one skilled in the art. Thus, latex particles, microparticles, magnetic beads, non-magnetic beads 
(including polystyrene beads), membranes (including nitrocellulose strips), plastic tubes, walls of 
microliter wells, glass or silicon chips, sheep (or other suitable animal's) red blood cells and 
duracytes are all suitable examples. Suitable methods for immobilizing nucleic acids on solid 
phases include ionic, hydrophobic, covalent interactions and the like. A solid support, as used 

35 herein, refers to any material which is insoluble, or can be made insoluble by a subsequent reaction. 
The solid support can be chosen for its intrinsic ability to attract and immobilize the capture reagent. 
Alternatively, the solid phase can retain an additional receptor which has the ability to attract and 
immobilize the capture reagent. The additional receptor can include a charged substance that is 
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oppositely charged with respect to the capture reagent itself or to a charged substance conjugated to 
the capture reagent. As yet another alternative, the receptor molecule can be any specific binding 
member which is immobilized upon (attached to) the solid support and which has the ability to 
immobilize the capture reagent through a specific binding reaction. The receptor molecule enables 
5 the indirect binding of the capture reagent to a solid support material before the performance of the 
assay or during the performance of the assay. The solid phase thus can be a plastic, derivatized 
plastic, magnetic or non-magnetic metal, glass or silicon surface of a test tube, microtiter well, 
sheet, bead, microparticle, chip, sheep (or other suitable animal's) red blood cells, duracytes® and 
other configurations known to those of ordinary skill in the art. The polynucleotides of the 
10 invention can be attached to or immobilized on a solid support individually or in groups of at least 
2, 5, 8, 10, 12, 15, 20, or 25 distinct polynucleotides of the invention to a single solid support. In 
addition, polynucleotides other than those of the invention may be attached to the same solid 
support as one or more polynucleotides of the invention. 

15 Oligonucleotide array 

A substrate comprising a plurality of oligonucleotide primers or probes of the invention 
may be used either for detecting or amplifying targeted sequences in GENSET genes, may be used 
for detecting mutations in the coding or in the non-coding sequences of GENSET genes, and may 
also be used to determine GENSET gene expression in different contexts such as in different 

20 tissues, at different stages of a process (embryo development, disease treatment), and in patients 
versus healthy individuals as described elsewhere in the application. 

As used herein, the term "array" means a one dimensional, two dimensional, or 
multidimensional arrangement of nucleic acids of sufficient length to permit specific detection of 
gene expression. For example, the array may contain a plurality of nucleic acids derived from 

25 genes whose expression levels are to be assessed. The array may include a GENSET genomic 
DNA, a GENSET cDNA, sequences complementary thereto or fragments thereof Preferably, the 
fragments are at least 12, 15, 18, 20, 25, 30, 35, 40 or 50 nucleotides in length. More preferably, the 
fragments are at least 100 nucleotides in length. Even more preferably, the fragments are more than 
100 nucleotides in length. In some embodiments the fragments may be more than 500 nucleotides 

30 in length. 

Any polynucleotide provided herein may be attached in overlapping areas or at random 
locations on the solid support. Alternatively the polynucleotides of the invention may be attached 
in an ordered array wherein each polynucleotide is attached to a distinct region of the solid support 
which does not overlap with the attachment site of any other polynucleotide. Preferably, such an 
35 ordered array of polynucleotides is designed to be "addressable" where the distinct locations are 
recorded and can be accessed as part of an assay procedure. Addressable polynucleotide arrays 
typically comprise a plurality of different oligonucleotide probes that are coupled to a surface of a 
substrate in different known locations. The knowledge of the precise location of each 
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polynucleotides location makes these "addressable" arrays particularly useful in hybridization 
assays. Any addressable array technology known in the art can be employed with the 
polynucleotides of the invention. One particular embodiment of these polynucleotide arrays is 
known as the Genechips™, and has been generally described in US Patent No. 5,143,854; PCT 
5 publications WO 90/1 5070 and 92/1 0092, which disclosures are hereby incorporated by reference in 
their entireties. These arrays may generally be produced using mechanical synthesis methods or 
light directed synthesis methods which incorporate a combination of photolithographic methods and 
solid phase oligonucleotide synthesis (Fodor et al, 1991), which disclosure is hereby incorporated 
by reference in its entirety. The immobilization of arrays of oligonucleotides on solid supports has 

10 been rendered possible by the development of a technology generally identified as "Very Large 
Scale Immobilized Polymer Synthesis" (VLSIPS™) in which, typically, probes are immobilized in 
a high density array on a solid surface of a chip. Examples of VLSIPS^m technologies are provided 
in US Patents 5,143,854; and 5,412,087 and in PCT Publications WO 90/15070, WO 92/10092 and 
WO 95/1 1995, which disclosures are hereby incorporated by reference in their entireties, which 

15 describe methods for forming oligonucleotide arrays through techniques such as light-directed 
synthesis techniques. In designing strategies aimed at providing arrays of nucleotides immobilized 
on solid supports, further presentation strategies were developed to order and display the 
oligonucleotide arrays on the chips in an attempt to maximize hybridization patterns and sequence 
information. Examples of such presentation strategies are disclosed in PCT Publications WO 

20 94/1 2305, WO 94/1 1 530, WO 97/292 1 2 and WO 97/3 1 256, the disclosures of which are 
incorporated herein by reference in their entireties. 

Consequently, the invention concerns an array of nucleic acid molecules comprising at least 
one polynucleotide of the invention, particularly a probe or primer as described herein. Preferably, 
the invention concerns an array of nucleic acids comprising at least two polynucleotides of the 

25 invention, particularly probes or primers as described herein. Preferably, the invention concerns an 
array of nucleic acids comprising at least five polynucleotides of the invention, particularly probes 
or primers as described herein. 

A preferred embodiment of the present invention is an array of polynucleotides of at least 
12, 15, 18, 20, 25, 30, 35, 40, 50, 100 or 500 nucleotides in length which includes at least 1, 2, 5, 

30 10, 1 5, 20, 35, 50 or 100 sequences selected from the group consisting of the sequences of SEQ ID 
NOs:l-169, 339-455, 561-784 and sequences of clone inserts of the deposited clone pool, sequences 
fully complementary thereto, and fragments thereof. 

Methods of making the polynucleotides of the invention 

The present invention also comprises methods of making the polynucleotides of the 
35 invention, including the polynucleotides of SEQ ID NOs: 1 - 1 69, 339-455, 561-784, genomic DNA 
obtainable therefrom, or fragments thereof. These methods comprise sequentially linking together 
nucleotides to produce the nucleic acids having the preceding sequences. Polynucleotides of the 
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invention may be syntiiesized either enzymatically using techniques well known to those skilled in 
the art including amplification or hybridization-based methods as described herein, or chemically. 

A variety of chemical methods of synthesizing nucleic acids are known to those skilled in 
the art. In many of these methods, synthesis is conducted on a solid support. These included the 3' 
5 phosphoramidite methods in which the 3' terminal base of the desired oligonucleotide is 

immobilized on an insoluble carrier. The nucleotide base to be added is blocked at the 5' hydroxyl 
and activated at the 3' hydroxyl so as to cause coupling with the immobilized nucleotide base. 
Deblocking of the new immobilized nucleotide compound and repetition of the cycle will produce 
the desired polynucleotide. Alternatively, polynucleotides may be prepared as described in U.S. 
10 Patent No. 5,049,656, which disclosure is hereby incorporated by reference in its entirety. In some 
embodiments, several polynucleotides prepared as described above are ligated together to generate 
longer polynucleotides having a desired sequence. 

Pol ypeptides of the invention 

The term " GENSET polypeptides" is used herein to embrace all of the proteins and 
15 polypeptides of the present invention. The present invention encompasses GENSET polypeptides, 
including recombinant, isolated or purified GENSET polypeptides consisting of, consisting 
essentially of, or comprising a sequence selected from the group consisting of SEQ ID NOs: 1 70- 
338, 456-560, 785-918 and the polypeptides encoded by human cDNAs contained in the deposited 
clones. Other objects of the invention are polypeptides encoded by the polynucleotides of the 
20 invention as well as fusion polypeptides comprising such polypeptides. 

Polypeptide variants 

The present invention further provides for GENSET polypeptides encoded by allelic and 
splice variants, orthologs, and/or species homologues. Procedures known in the art can be used to 
obtain, allelic variants, splice variants, orthologs, and/or species homologues of polynucleotides 

25 encoding by polypeptides of the group consisting of SEQ ID NOs: 1 70-338, 456-560, 785-9 1 8 and 
polypeptides encoded by the clone inserts of the deposited clone pool, using information from the 
sequences disclosed herein or the clones deposited with the ATCC. 

The polypeptides of the present invention also include polypeptides having an amino acid 
sequence at least 50% identical, more preferably at least 60% identical, and still more preferably 

30 70%, 80%, 90%, 95%, 96%, 97%), 98% or 99% identical to a polypeptide selected from the group 
consisting of the sequences of SEQ ID NOs: 170-338, 456-560, 785-918 and those encoded by the 
clone inserts of the deposited clone pool. By a polypeptide having an amino acid sequence at least, 
for example, 95% "identical" to a query amino acid sequence of the present invention, it is intended 
that the amino acid sequence of the subject polypeptide is identical to the query sequence except 

35 that the subject polypeptide sequence may include up to five amino acid alterations per each 1 00 
amino acids of the query amino acid sequence. In other words, to obtain a polypeptide having an 
amino acid sequence at least 95% identical to a query amino acid sequence, up to 5% (5 of 100) of 



41 



[fije>'/^:/yv'GG208389e.s^] 



WO 02/083898 PCT/IBOl/00914 
the amino acid residues in tiie subject sequence may be inserted, deleted, (indels) or substituted witii 
another amino acid. 

Further polypeptides of the present invention include polypeptides which have at least 90% 
similarity, more preferably at least 95% similarity, and still more preferably at least 96%, 97%, 98% 
5 or 99% similarity to those described above. By a polypeptide having an amino acid sequence at 
least, for example, 95% "similar" to a query amino acid sequence of the present invention, it is 
intended that the amino acid sequence of the subject polypeptide is similar (i.e. contains identical or 
equivalent amino acid residues) to the query sequence except that the subject polypeptide sequence 
may include up to five amino acid alterations per each 100 amino acids of the query amino acid 
1 0 sequence. In other words, to obtain a polypeptide having an amino acid sequence at least 95% 
similar to a query amino acid sequence, up to 5% (5 of 100) of the amino acid residues in the 
subject sequence may be inserted, deleted, (indels) or substituted with another non-equivalent 
amino acid. 

These alterations of the reference sequence may occur at the amino or carboxy terminal 
1 5 positions of the reference amino acid sequence or anywhere between those terminal positions, 
interspersed either individually among residues in the reference sequence or in one or more 
contiguous groups within the reference sequence. The query sequence may be an entire amino acid 
sequence selected from the group consisting of sequences of SEQ ID NOs: 1 70-338, 456-560, 785- 
9 1 8 and those encoded by the clone inserts of the deposited clone pool or any fragment specified as 
20 described herein. 

The variant polypeptides described herein are included in the present invention regardless 
of whether they have their normal biological activity. This is because even where a particular 
polypeptide molecule does not have biological activity, one of skill in the art would still know how 
to use the polypeptide, for instance, as a vaccine or to generate antibodies. Other uses of the 

25 polypeptides of the present invention that do not have GENSET biological activity include, inter 
alia, as epitope tags, in epitope mapping, and as molecular weight markers on SDS-PAGE gels or 
on molecular sieve gel filtration columns using methods known to those of skill in the art. As 
described below, the polypeptides of the present invention can also be used to raise polyclonal and 
monoclonal antibodies, which are useful in assays for detecting GENSET protein expression or as 

30 agonists and antagonists capable of enhancing or inhibiting GENSET protein function. Further, 
such polypeptides can be used in the yeast two-hybrid system to "capture" GENSET protein binding 
proteins, which are also candidate agonists and antagonists according to the present invention (see, 
e.g.. Fields et al. 1989, which disclosure is hereby incorporated by reference in its entirety). 

Preparation of the polypeptides of tiie invention 

35 The polypeptides of the present invention can be prepared in any suitable manner. Such 

polypeptides include isolated naturally occurring polypeptides, recombinantly produced 
polypeptides, synthetically produced polypeptides, or polypeptides produced by a combination of 
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these methcxJs. The polypeptides of the present invention are preferably provided in an isolated 
form, and may be partially or preferably substantially purified. 

Consequently, the present invention also comprises methods of making the polypeptides of 
the invention, particularly polypeptides encoded by the cDNAs of SEQ ID NOs: 1-169, 339-455, 
5 561-784 or by the clone inserts of the deposited clone pool, genomic DNA obtainable therefrom, or 
fragments thereof and methods of making the polypeptides of SEQ ID NOs: 170-338, 456-560, 785- 
9 1 8 or fragments thereof. The methods comprise sequentially linking together amino acids to 
produce the nucleic polypeptides having the preceding sequences. In some embodiments, the 
polypeptides made by these methods are 150 amino acids or less in length. In other embodiments, 
1 0 the polypeptides made by these methods are 1 20 amino acids or less in length. 

Isolation 

From natural sources 

The GENSET proteins of the invention may be isolated from natural sources, including 
bodily fluids, tissues and cells, whether directly isolated or cultured cells, of humans or non-human 

1 5 animals. Methods for extracting and purifying natural proteins are known in the art, and include the 
use of detergents or chaotropic agents to disrupt particles followed by differential extraction and 
separation of the polypeptides by ion exchange chromatography, affinity chromatography, 
sedimentation according to density, and gel electrophoresis. See, for example, "Methods in 
Enzymology, Academic Press, 1993" for a variety of methods for purifying proteins, which 

20 . disclosure is hereby incorporated by reference in its entirety. Polypeptides of the invention also can 
be purified from natural sources using antibodies directed against the polypeptides of the invention, 
such as those described herein, in methods which are well known in the art of protein purification. 

From recombinant sources 

Preferably, the GENSET polypeptides of the invention are recombinantly produced using 
25 routine expression methods known in the art. The polynucleotide encoding the desired polypeptide 
is operably linked to a promoter into an expression vector suitable for any convenient host. Both 
eukaryotic and prokaryotic host systems are used in forming recombinant polypeptides. The 
polypeptide is then isolated from lysed cells or from the culture medium and purified to the extent 
needed for its intended use. 
30 Any GENSET polynucleotide, including those described in SEQ ID NOs: 1 -1 69, 339-455, 

56 1 -784, those of clone inserts of the deposited clone pool, and allelic variants thereof may be used 
to express GENSET polypeptides. The nucleic acid encoding the GENSET polypeptide to be 
expressed is operably linked to a promoter in an expression vector using conventional cloning 
technology. The GENSET insert in the expression vector may comprise the full coding sequence 
35 for the GENSET protein or a portion thereof. For example, the GENSET derived insert may encode 
a polypeptide comprising at least 6, 8, 10, 12, 15, 20, 25, 30, 35, 40, 50, 60, 75, 100, 150 or 200 
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consecutive amino acids of a GENSET protein selected from the group consisting of sequences of 
SEQ ID NOs: 170-338, 456-560, 785-918 and polypeptides encoded by the clone inserts of the 
deposited clone pool. 

Consequently, a further embodiment of the present invention is a method of making a 
5 polypeptide comprising a protein selected from the group consisting of sequences of SEQ ID 
NOs: 170-338, 456-560, 785-918 and polypeptides encoded by the clone inserts of the deposited 
clone pool, said method comprising the steps of: 

a) obtaining a cDNA comprising a sequence selected from the group consisting of i) the 
sequences SEQ ID NOs:l-169, 339-455, 561-784, ii) the sequences of clone inserts of the deposited 

10 clone pool one, iii) sequences encoding one of the polypeptide of SEQ ID NOs: 170-338, 456-560, 
785-918, and iv) sequences of polynucleotides encoding a polypeptide which is encoded by one of 
the clone insert of the deposited clone pool; 

b) inserting said cDNA in an expression vector such that the cDNA is operably linked to a 
promoter; and 

1 5 c) introducing said expression vector into a host cell whereby said host cell produces said 

polypeptide. 

In one aspect of this embodiment, the method further comprises the step of isolating the 
polypeptide. Another embodiment of the present invention is a polypeptide obtainable by the 
method described in the preceding paragraph. 

20 The expression vector is any of the mammalian, yeast, insect or bacterial expression 

systems known in the art. Commercially available vectors and expression systems are available 
from a variety of suppliers including Genetics Institute (Cambridge, MA), Stratagene (La Jolla, 
California), Promega (Madison, Wisconsin), and Invitrogen (San Diego, California). If desired, to 
enhance expression and facilitate proper protein folding, the codon context and codon pairing of the 

25 sequence is optimized for the particular expression organism in which the expression vector is 
introduced, as explained in U.S. Patent No. 5,082,767, which disclosure is hereby incorporated by 
reference in its entirety. 

In one embodiment, the entire coding sequence of a GENSET cDNA and the 3'UTR 
through the poly A signal of the cDNA is operably linked to a promoter in the expression vector. 

30 Alternatively, if the nucleic acid encoding a portion of the GENSET protein lacks a methionine to 
serve as the initiation site, an initiating methionine can be introduced next to the first codon of the 
nucleic acid using conventional techniques. Similarly, if the insert from the GENSET cDNA lacks 
a poly A signal, this sequence can be added to the construct by, for example, splicing out the Poly A 
signal from pSG5 (Stratagene) using Bgll and Sail resfriction endonuclease enzymes and 

35 incorporating it into the mammalian expression vector pXTl (Stratagene). pXTl contains the LTRs 
and a portion of the gag gene from Moloney Murine Leukemia Virus. The position of the LTRs in 
the construct allow efficient stable transfection. The vector includes the Herpes Simplex Thymidine 
Kinase promoter and the selectable neomycin gene. The nucleic acid encoding the GENSET 
protein or a portion thereof is obtained by PCR from a vector containing a GENSET cDNA selected 
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from the group consisting oftiie sequences of SEQ ID NOs:l-169, 339-455, 561-784 and the clone 
inserts of the deposited clone pool using oligonucleotide primers complementary to the GENSET 
cDNA or portion thereof and containing restriction endonuclease sequences for Pst I incorporated 
into the 5' primer and Bglll at the 5' end of the corresponding cDNA 3' primer, taking care to ensure 
5 that the sequence encoding the GENSET protein or a portion thereof is positioned properly with 
respect to the poly A signal. The purified fragment obtained from the resulting PGR reaction is 
digested with PstI, blunt ended with an exonuclease, digested with Bgl II, purified and ligated to 
pXTl, now containing a poly A signal and digested with Bglll. 

In another embodiment, it is often advantageous to add to the recombinant polynucleotide 
1 0 additional nucleotide sequence which codes for secretoiy or leader sequences, pro-sequences, 
sequences which aid in purification, such as multiple histidine residues, or an additional sequence 
for stability during recombinant production. 

As a control, the expression vector lacking a cDNA insert is introduced into host cells or 
organisms. 

15 Transfection of a GENSET expression vector into mouse NTH 3T3 cells is but one 

embodiment of introducing polynucleotides into host cells. Introduction of a polynucleotide 
encoding a polypeptide into a host cell can be effected by calcium phosphate transfection, 
DEAE-dextran mediated transfection, cationic lipid-mediated transfection, electroporation, 
transduction, infection, or other methods. Such methods are described in many standard laboratory 

20 manuals, such as Davis et al. (1986), which disclosure is hereby incorporated by reference in its 
entirety. It is specifically contemplated that the polypeptides of the present invention may in fact be 
expressed by a host cell lacking a recombinant vector. 

Recombinant cell extracts, or proteins from the culture medium if the expressed polypeptide 
is secreted, are then prepared and proteins separated by gel electrophoresis. If desired, the proteins 

25 may be ammonium sulfate precipitated or separated based on size or charge prior to electrophoresis. 
The proteins present are detected using techniques such as Coomassie or silver staining or using 
antibodies against the protein encoded by the GENSET cDNA of interest. Coomassie and silver 
staining techniques are familiar to those skilled in the art. 

Proteins from the host cells or organisms containing an expression vector which contains 

30 the GENSET cDNA or a fragment thereof are compared to those from the control cells or organism. 
The presence of a band from the cells containing the expression vector which is absent in control 
cells indicates that the GENSET cDNA is expressed. Generally, the band corresponding to the 
protein encoded by the GENSET cDNA will have a mobility near that expected based on the 
number of amino acids in the open reading frame of the cDNA. However, the band may have a 

35 mobility different than that expected as a result of modifications such as glycosylation, 
ubiquitination, or enzymatic cleavage. 

Alternatively, the GENSET polypeptide to be expressed may also be a product of 
transgenic animals, i.e., as a component of the milk of transgenic cows, goats, pigs or sheep which 
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are characterized by somatic or germ cells containing a nucleotide sequence encoding the protein of 
interest. 

A polypeptide of this invention can be recovered and purified from recombinant cell 
cultures by well-known methods including differential extraction, ammonium sulfate or ethanol 
S precipitation, acid extraction, anion or cation exchange chromatography, phosphocellulose 
chromatography, hydrophobic interaction chromatography, affinity chromatography, 
hydroxylapatite chromatography and lectin chromatography. See, for example, "Methods in 
Enzymology", supra for a variety of methods for purifying proteins. Most preferably, high 
performance liquid chromatography ("HPLC") is employed for purification. A recombinantly 

1 0 produced version of a GENSET polypeptide can be substantially purified using techniques 
described herein or otherwise known in the art, such as, for example, by the one-step method 
described in Smith and Johnson (1988), which disclosure is hereby incorporated by reference in its 
entirety. Polypeptides of the invention also can be purified from recombinant sources using 
antibodies directed against the polypeptides of the invention, such as those described herein, in 

1 5 methods which are well known in the art of protein purification. 

Preferably, the recombinantly expressed GENSET polypeptide is purified using standard 
immunochromatography techniques such as the one described in the section entitled 
"Immunoaffinity Chromatography". In such procedures, a solution containing the protein of 
interest, such as the culture medium or a cell extract, is applied to a column having antibodies 

20 against the protein attached to the chromatography matrix. The recombinant protein is allowed to 
bind the immunochromatography column. Thereafter, the column is washed to remove non- 
specifically bound proteins. The specifically bound secreted protein is then released from the 
column and recovered using standard techniques. 

If antibody production is not possible, the GENSET cDNA sequence or fragment thereof 

25 may be incorporated into expression vectors designed for use in purification schemes employing 
chimeric polypeptides. In such strategies the coding sequence of the GENSET cDNA or fragment 
thereof is inserted in frame with the gene encoding the other half of the chimera. The other half of 
the chimera may be beta-globin or a nickel binding polypeptide encoding sequence. A 
chromatography matrix having antibody to beta-globin or nickel attached thereto is then used to 

30 purify the chimeric protein. Protease cleavage sites may be engineered between the beta-globin 
gene or the nickel binding polypeptide and the GENSET cDN A or fragment thereof Thus, the two 
polypeptides of the chimera may be separated from one another by protease digestion. 

One useful expression vector for generating beta-globin chimerics is pSG5 (Stratagene), 
which encodes rabbit beta-globin. Intron 11 of the rabbit beta-globin gene facilitates splicing of the 

35 expressed transcript, and the polyadenylation signal incorporated into the construct increases the 
level of expression. These techniques as described are well known to those skilled in the art of 
molecular biology. Standard methods are published in methods texts such as Davis et ah, (1986) 
and many of the methods are available from Stratagene, Life Technologies, Inc., or Promega. 
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Polypeptide may additionally be produced from the construct using in vitro translation systems such 
as the In vitro Express^"^ Translation Kit (Stratagene). 

Depending upon the host employed in a recombinant production procedure, the 
polypeptides of the present invention may be glycosylated or may be non-glycosylated. In addition, 
5 polypeptides of the invention may also include an initial modified methionine residue, in some 
cases as a result of host-mediated processes. Thus, it is well known in the art that the N-terminal 
methionine encoded by the translation initiation codon generally is removed with high efficiency 
from any protein after translation in all eukaryotic cells. While the N-terminal methionine on most 
proteins also is efficiently removed in most prokaryotes, for some proteins, this prokaryotic removal 
1 0 process is inefficient, depending on the nature of the amino acid to which the N-terminal 
methionine is covalently linked. 

From chemical synthesis 

In addition, polypeptides of the invention, especially short protein fragments, can be 
chemically synthesized using techniques known in the art {See, e.g., Creighton, 1983; and 

15 Hunkapiller et al., 1984), which disclosures are hereby incorporated by reference in their entireties. 
For example, a polypeptide corresponding to a fragment of a polypeptide sequence of the invention 
can be synthesized by use of a peptide synthesizer. A variety of methods of making polypeptides 
are known to those skilled in the art, including methods in which the carboxyl terminal amino acid 
is bound to polyvinyl benzene or another suitable resin. The amino acid to be added possesses 

20 blocking groups on its amino moiety and any side chain reactive groups so that only its carboxyl 
moiety can react. The carboxyl group is activated with carbodiimide or another activating agent 
and allowed to couple to the immobilized amino acid. After removal of the blocking group, the 
cycle is repeated to generate a polypeptide having the desired sequence. Alternatively, the methods 
described in U.S. Patent No. 5,049,656, which disclosure is hereby incorporated by reference in its 

25 entirety, may be used. 

Furthermore, if desired, nonclassical amino acids or chemical amino acid analogs can be 
introduced as a substitution or addition into the polypeptide sequence. Non-classical amino acids 
include, but are not limited to, to the D-isomers of the common amino acids, 2,4-diaminobutyric 
acid, a-amino isobutyric acid, 4-aminobutyric acid, Abu, 2-amino butyric acid, g-Abu, e-Ahx, 6- 

30 amino hexanoic acid, Aib, 2-amino isobutyric acid, 3-amino propionic acid, ornithine, norleucine, 
norvaline, hydroxyproline, sarcosine, citrulline, homocitrulline, cysteic acid, t-butylglycine, t- 
butylalanine, phenylglycine, cyclohexylalanine, b-alanine, fluoroamino acids, designer amino acids 
such as b-methyl amino acids, Ca-methyl amino acids, Na-methyl amino acids, and amino acid 
analogs in general. Furthermore, the amino acid can be D (dextrorotary) or L (levorotary). 

35 Modifications 

The invention encompasses polypeptides which are differentially modified during or after 
translation, e.g., by glycosylation, acetylation, phosphorylation, amidation, derivatization by known 
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protecting/blocking groups, proteolytic cleavage, linkage to an antibody molecule or other cellular 
ligand, etc. Any of numerous chemical modifications may be carried out by known techniques, 
including but not limited, to specific chemical cleavage by cyanogen bromide, trypsin, 
chymotrypsin, papain, V8 protease, NaBH4; acetylation, formylation, oxidation, reduction; 
5 metabolic synthesis in the presence of tunicamycin; etc. 

Additional post-translational modifications encompassed by the invention include, for 
example, e.g., N-1 inked or O-linked carbohydrate chains, processing of N-terminal or C-terminal 
ends), attachment of chemical moieties to the amino acid backbone, chemical modifications of 
N-linked or O-linked carbohydrate chains, and addition or deletion of an N-terminal methionine 

10 residue as a result of prokaryotic host cell expression. The polypeptides may also be modified with 
a detectable label, such as an enzymatic, fluorescent, isotopic or affinity label to allow for detection 
and isolation of the protein. 

Also provided by the invention are chemically modified derivatives of the polypeptides of 
the invention which may provide additional advantages such as increased solubility, stability and 

15 circulating time of the polypeptide, or decreased immunogenic ity. See U.S. Patent No: 4,179,337. 
The chemical moieties for derivatization may be selected See U.S. Patent No: 4,179,337, which 
disclosure is hereby incorporated by reference in its entirety. The chemical moieties for 
derivatization may be selected from water soluble polymers such as polyethylene glycol, ethylene 
glycol/propylene glycol copolymers, carboxymethylcellulose, dextran, polyvinyl alcohol and the 

20 like. The polypeptides may be modified at random positions within the molecule, or at 

predetermined positions within the molecule and may include one, two, three or more attached 
chemical moieties. 

The polymer may be of any molecular weight, and may be branched or unbranched. For 
polyethylene glycol, the preferred molecular weight is between about 1 kDa and about 100 kDa (the 

25 term "about" indicating that in preparations of polyethylene glycol, some molecules will weigh 
more, some less, than the stated molecular weight) for ease in handling and manufacturing. Other 
sizes may be used, depending on the desired therapeutic profile (e.g., the duration of sustained 
release desired, the effects, if any on biological activity, the ease in handling, the degree or lack of 
antigenicity and other known effects of the polyethylene glycol to a therapeutic protein or analog). 

30 The polyethylene glycol molecules (or other chemical moieties) should be attached to the 

protein with consideration of effects on functional or antigenic domains of the protein. There are a 
number of attachment methods available to those skilled in the art, e.g., EP 0 401 384, (coupling 
PEG to G-CSF), and Malik et al. (1992) (reporting pegylation of GM-CSF using tresyl chloride), 
which disclosures are hereby incorporated by reference in their entireties. For example, 

35 polyethylene glycol may be covalently bound through amino acid residues via a reactive group, 
such as, a free amino or carboxyl group. Reactive groups are those to which an activated 
polyethylene glycol molecule may be bound. The amino acid residues having a free amino group 
may include lysine residues and the N-terminal amino acid residues; those having a free carboxyl 
group may include aspartic acid residues glutamic acid residues and the C-terminal amino acid 
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residue. Sulfhydryl groups may also be used as a reactive group for attaching the polyethylene 
glycol molecules. Preferred for therapeutic purposes is attachment at an amino group, such as 
attachment at the N-terminus or lysine group. 

One may specifically desire proteins chemically modified at the N-terminus. Using 
5 polyethylene glycol as an illustration of the present composition, one may select from a variety of 
polyethylene glycol molecules (by molecular weight, branching, etc.), the proportion of 
polyethylene glycol molecules to protein (polypeptide) molecules in the reaction mix, the type of 
pegylation reaction to be performed, and the method of obtaining the selected N-terminally 
pegylated protein. The method of obtaining the N-terminally pegylated preparation (i.e., separating 

10 this moiety from other monopegylated moieties if necessary) may be by purification of the 
N-terminally pegylated material from a population of pegylated protein molecules. Selective 
proteins chemically modified at the N-terminus modification may be accomplished by reductive 
alkylation, which exploits differential reactivity of different types of primary amino groups (lysine 
versus the N-terminal) available for derivatization in a particular protein. Under the appropriate 

15 reaction conditions, substantially selective derivatization of the protein at the N-terminus with a 
carbonyl group containing polymer is achieved. 

Multimerization 

The polypeptides of the invention may be in monomers or multimers (i.e., dimers, trimers, 
tetramers and higher multimers). Accordingly, the present invention relates to monomers and 

20 multimers of the polypeptides of the invention, their preparation, and compositions containing them. 
In specific embodiments, the polypeptides of the invention are monomers, dimers, trimers or 
tetramers. In additional embodiments, the multimers of the invention are at least dimers, at least 
trimers, or at least tetramers. 

Multimers encompassed by the invention may be homomers or heteromers. As used herein, 

25 the term "homomer", refers to a multimer containing only polypeptides corresponding to the amino 
acid sequences of SEQ ID NOs: 1 70-338, 456-560, 785-91 8 or encoded by the clone inserts of the 
deposited clone pool (including fragments, variants, splice variants, and fusion proteins, 
corresponding to these polypeptides as described herein). These homomers may contain 
polypeptides having identical or different amino acid sequences. In a specific embodiment, a 

30 homomer of the invention is a multimer containing only polypeptides having an identical amino 
acid sequence. In another specific embodiment, a homomer of the invention is a multimer 
containing polypeptides having different amino acid sequences. In specific embodiments, the 
multimer of the invention is a homodimer {e.g., containing polypeptides having identical or 
different amino acid sequences) or a homotrimer (e.g., containing polypeptides having identical 

35 and/or different amino acid sequences). In additional embodiments, the homomenc multimer of the 
invention is at least a homodimer, at least a homotrimer, or at least a homotetramer. 

As used herein, the term "heteromer" refers to a multimer containing one or more 
heterologous polypeptides (i.e., polypeptides of different proteins) in addition to the polypeptides of 
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the invention. In a specific embodiment, the multimer of the invention is a heterodimer, a 
heterotrimer, or a heterotetramer. In additional embodiments, the heteromeric multimer of the 
invention is at least a heterodimer, at least a heterotrimer, or at least a heterotetramer. 

Multimers of the invention may be the result of hydrophobic, hydrophilic, ionic and/or 
5 covalent associations and/or may be indirectly linked, by for example, liposome formation. Thus, 
in one embodiment, multimers of the invention, such as, for example, homodimers or homotrimers, 
are formed when polypeptides of the invention contact one another in solution. In another 
embodiment, heteromultimers of the invention, such as, for example, heterotrimers or 
heterotetramers, are formed when polypeptides of the invention contact antibodies to the 

1 0 polypeptides of the invention (including antibodies to the heterologous polypeptide sequence in a 
fusion protein of the invention) in solution. In other embodiments, multimers of the invention are 
formed by covalent associations with and/or between the polypeptides of the invention. Such 
covalent associations may involve one or more amino acid residues contained in the polypeptide 
sequence (e.g., that recited in the sequence listing, or contained in the polypeptide encoded by a 

15 deposited clone). In one instance, the covalent associations are cross-linking between cysteine 
residues located within the polypeptide sequences, which interact in the native (i.e., naturally 
occurring) polypeptide. In another instance, the covalent associations are the consequence of 
chemical or recombinant manipulation. Alternatively, such covalent associations may involve one 
or more amino acid residues contained in the heterologous polypeptide sequence in a fusion protein 

20 of the invention. 

In one example, covalent associations are between the heterologous sequence contained in a 
fusion protein of the invention (see, e.g., US Patent Number 5,478,925, which disclosure is hereby 
incorporated by reference in its entirety). In a specific example, the covalent associations are 
between the heterologous sequence contained in an Fc fusion protein of the invention (as described 

25 herein). In another specific example, covalent associations of fusion proteins of the invention are 
between heterologous polypeptide sequence from another protein that is capable of forming 
covalently associated multimers, such as for example, oseteoprotegerin (see, e.g., International 
Publication No: WO 98/49305, the contents of which are herein incorporated by reference in its 
entirety). In another embodiment, two or more polypeptides of the invention are joined through 

30 peptide linkers. Examples include those peptide linkers described in U.S. Pat. No. 5,073,627 
(hereby incorporated by reference). Proteins comprising multiple polypeptides of the invention 
separated by peptide linkers may be produced using conventional recombinant DNA technology. 

Another method for preparing multimer polypeptides of the invention involves the use of 
polypeptides of the invention fused to a leucine zipper or isoleucine zipper polypeptide sequence. 

35 Leucine zipper and isoleucine zipper domains are polypeptides that promote multimerization of the 
proteins in which they are found. Leucine zippers were originally identified in several 
DNA-binding proteins, and have since been found in a variety of different proteins (Landschulz ei 
al., 1988). Among the known leucine zippers are naturally occurring peptides and derivatives 
thereof that dimerize or trimerize. Examples of leucine zipper domains suitable for producing 
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soluble multimeric proteins of the invention are those described in PCX application WO 94/10308, 
hereby incorporated by reference. Recombinant fusion proteins comprising a polypeptide of the 
invention fused to a polypeptide sequence that dimerizes ortrimerizes in solution are expressed in 
suitable host cells, and the resulting soluble multimeric fusion protein is recovered from the culture 
5 supernatant using techniques known in the art. 

Trimeric polypeptides of the invention may offer the advantage of enhanced biological 
activity. Preferred leucine zipper moieties and isoleucine moieties are those that preferentially form 
trimers. One example is a leucine zipper derived from lung surfactant protein D (SPD), as 
described in Hoppe et al. (1994) and in U.S. patent application Ser. No. 08/446,922, which 

10 disclosure is hereby incorporated by reference in its entirety. Other peptides derived from naturally 
occurring trimeric proteins may be employed in preparing trimeric polypeptides of the invention. In 
another example, proteins of the invention are associated by interactions between Flag® 
polypeptide sequence contained in fusion proteins of the invention containing Flag® polypeptide 
sequence. In a further embodiment, associations proteins of the invention are associated by 

1 5 interactions between heterologous polypeptide sequence contained in Flag® fusion proteins of the 
invention and anti Flag® antibody. 

The multimers of the invention may be generated using chemical techniques known in the 
art. For example, polypeptides desired to be contained in the multimers of the invention may be 
chemically cross-linked using linker molecules and linker molecule length optimization techniques 

20 known in the art (see, e.g., US Patent Number 5,478,925, which is herein incorporated by reference 
in its entirety). Additionally, multimers of the invention may be generated using techniques known 
in the art to form one or more inter-molecule cross-links between the cysteine residues located 
within the sequence of the polypeptides desired to be contained in the multimer (see, e.g., US Patent 
Number 5,478,925, which is herein incorporated by reference in its entirety). Further, polypeptides 

25 of the invention may be routinely modified by the addition of cysteine or biotin to the C terminus or 
N-terminus of the polypeptide and techniques known in the art may be applied to generate 
multimers containing one or more of these modified polypeptides (see, e.g., US Patent Number 
5,478,925, which is herein incorporated by reference in its entirety). Additionally, other techniques 
known in the art may be applied to generate liposomes containing the polypeptide components 

30 desired to be contained in the multimer of the invention (see, e.g., US Patent Number 5,478,925, 
which is herein incorporated by reference in its entirety). 

Alternatively, multimers of the invention may be generated using genetic engineering 
techniques known in the art. In one embodiment, polypeptides contained in multimers of the 
invention are produced recombinantly using fusion protein technology described herein or 

35 otherwise known in the art (see, e.g., US Patent Number 5,478,925, which is herein incorporated by 
reference in its entirety). In a specific embodiment, polynucleotides coding for a homodimer of the 
invention are generated by ligating a polynucleotide sequence encoding a polypeptide of the 
invention to a sequence encoding a linker polypeptide and then further to a synthetic polynucleotide 
encoding the translated product of the polypeptide in the reverse orientation from the original 
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C-terminus to the N-terminus (lacking the leader sequence) (see, e.g., US Patent Number 5,478,925, 
which is herein incorporated by reference in its entirety). In another embodiment, recombinant 
techniques described herein or otherwise known in the art are applied to generate recombinant 
polypeptides of the invention which contain a transmembrane domain (or hydrophobic or signal 
5 peptide) and which can be incorporated by membrane reconstitution techniques into liposomes (see, 
e.g., US Patent Number 5,478,925, which is herein incorporated by reference in its entirety). 

Mutated polypeptides 

To improve or alter the characteristics of GENSET polypeptides of the present invention, 
protein engineering may be employed. Recombinant DNA technology known to those skilled in the 

10 art can be used to create novel mutant proteins or muteins including single or multiple amino acid 
substitutions, deletions, additions, or fusion proteins. Such modified polypeptides can show, e.g., 
increased/decreased biological activity or increased/decreased stability. In addition, they may be 
purified in higher yields and show better solubility than the corresponding natural polypeptide, at 
least under certain purification and storage conditions. Further, the polypeptides of the present 

1 5 invention may be produced as multimers including dimers, trimers and tetramers. Multimerization 
may be facilitated by linkers or recombinantly though heterologous polypeptides such as Fc regions. 

N- and C-terminal deletions 

It is known in the art that one or more amino acids may be deleted from the N-terminus or 
C-terminus without substantial loss of biological function. For instance, Ron etal. (1993), reported 

20 modified KGF proteins that had heparin binding activity even if 3, 8, or 27 N-terminal amino acid 
residues were missing. Accordingly, the present invention provides polypeptides having one or 
more residues deleted from the amino terminus of the polypeptides of SEQ ID NOs: 1 70-338, 456- 
560, 785-9 1 8 or that encoded by the clone inserts of the deposited clone pool. Similarly, many 
examples of biologically functional C-terminal deletion mutants are known. For instance, 

25 Interferon gamma shows up to ten times higher activities by deleting 8 1 0 amino acid residues from 
the C-terminus of the protein {See, e.g., Dobeii, et al. 1988), which disclosure is hereby 
incorporated by reference in its entirety. Accordingly, the present invention provides polypeptides 
having one or more residues deleted from the carboxy terminus of the polypeptides shown of SEQ 
ID NOs: 170-338, 456-560, 785-91 8 or encoded by the clone inserts of the deposited clone pool. 

30 The invention also provides polypeptides having one or more amino acids deleted from both the 
amino and the carboxyl termini as described below. 

Other mutations 

Other mutants in addition to N- and C-terminal deletion forms of the protein discussed 
above are included in the present invention. It also will be recognized by one of ordinary skill in 
35 the art that some amino acid sequences of the GENSET polypeptides of the present invention can be 
varied without significant effect of the structure or function of the protein. If such differences in 



52 



[fije>'/^:/yv'GG208389e.s^] 



WO 02/083898 PCT/IBOl/00914 
sequence are contemplated, it should be remembered that there will be critical areas on the protein 
which determine activity. Thus, the invention further includes variations of the GENSET 
polypeptides which show substantial GENSET polypeptide activity. Such mutants include 
deletions, insertions, inversions, repeats, and substitutions selected according to general rules 
5 knov^oi in the art so as to have little effect on activity. For example, guidance concerning how to 
make phenotypically silent amino acid substitutions is provided. 

There are two main approaches for studying the tolerance of an amino acid sequence to 
change (see, Bowie et al. 1994, which disclosure is hereby incorporated by reference in its entirety). 
The first method relies on the process of evolution, in which mutations are either accepted or 

1 0 rejected by natural selection. 

The second approach uses genetic engineering to introduce amino acid changes at specific 
positions of a cloned gene and selections or screens to identify sequences that maintain 
functionality. These studies have revealed that proteins are surprisingly tolerant of amino acid 
substitutions. The studies indicate which amino acid changes are likely to be permissive at a certain 

1 5 position of the protein. For example, most buried amino acid residues require nonpolar side chains, 
whereas few features of surface side chains are generally conserved. Other such phenotypically 
silent substitutions are described by Bowie et al. (supra) and the references cited therein. 

Typically seen as conservative substitutions are the replacements, one for another, among 
the aliphatic amino acids Ala, Val, Leu and Phe; interchange of the hydroxyl residues Ser and Thr, 

20 exchange of the acidic residues Asp and Glu, substitution between the amide residues Asn and Gin, 
exchange of the basic residues Lys and Arg and replacements among the aromatic residues Phe, 
Tyr. Thus, the fragment, derivative, analog, or homologue of the polypeptide of the present 
invention may be, for example: (i) one in which one or more of the amino acid residues are 
substituted with a conserved or non-conserved amino acid residue (preferably a conserved amino 

25 acid residue) and such substituted amino acid residue may or may not be one encoded by the genetic 
code: or (ii) one in which one or more of the amino acid residues includes a substituent group: or 
(iii) one in which the GENSET polypeptide is fused with another compound, such as a compound to 
increase the half-life of the polypeptide (for example, polyethylene glycol): or (iv) one in which the 
additional amino acids are fused to the above form of the polypeptide, such as an IgG Fc fusion 

30 region peptide or leader or secretory sequence or a sequence which is employed for purification of 
the above form of the polypeptide or a pro-protein sequence. Such fragments, derivatives and 
analogs are deemed to be within the scope of those skilled in the art from the teachings herein. 

Thus, the GENSET polypeptides of the present invention may include one or more amino 
acid substitutions, deletions, or additions, either from natural mutations or human manipulation. As 

35 indicated, changes are preferably of a minor nature, such as conservative amino acid substitutions 
that do not significantly affect the folding or activity of the protein. The following groups of amino 
acids generally represent equivalent changes: (1) Ala, Pro, Gly, Glu, Asp, Gin, Asn, Ser, Thr; (2) 
Cys, Ser, Tyr, Thr; (3) Val, He, Leu, Met, Ala, Phe; (4) Lys, Arg, His; (5) Phe, Tyr, Trp, His. 
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A specific embodiment of a modified GENSET peptide molecule of interest according to 
the present invention, includes, but is not limited to, a peptide molecule which is resistant to 
proteolysis, is a peptide in which the -CONH- peptide bond is modified and replaced by a (CH2NH) 
reduced bond, a G^HCO) retro inverse bond, a (CH2-0) methylene-oxy bond, a (CH2-S) 
5 thiomethylene bond, a (CH2CH2) carba bond, a (CO-CH2) cetomethylene bond, a (CH0H-CH2) 
hydroxyethylene bond), a (N-N) bound, a E-alcene bond or also a -CH=CH- bond. The invention 
also encompasses a human GENSET polypeptide or a fragment or a variant thereof in which at least 
one peptide bond has been modified as described above. 

Amino acids in the GENSET proteins of the present invention that are essential for function 

1 0 can be identified by methods known in the art, such as site-directed mutagenesis or 

alanine-scanning mutagenesis (see, e.g., Cunningham et al. 1989, which disclosure is hereby 
incorporated by reference in its entirety). The latter procedure introduces single alanine mutations 
at every residue in the molecule. The resulting mutant molecules are then tested for biological 
activity using assays appropriate for measuring the function of the particular protein. Of special 

1 5 interest are substitutions of charged amino acids with other charged or neutral amino acids which 
may produce proteins with highly desirable improved characteristics, such as less aggregation. 
Aggregation may not only reduce activity but also be problematic when preparing pharmaceutical 
formulations, because aggregates can be immunogenic, (see, e.g., Pinckard et al., 1967; Robbins, et 
al, 1987; and Cleland, et al, 1993). 

20 A further embodiment of the invention relates to a polypeptide which comprises the amino 

acid sequence of a GENSET polypeptide having an amino acid sequence which contains at least one 
conservative amino acid substitution, but not more than SO conservative amino acid substitutions, 
not more than 40 conservative amino acid substitutions, not more than 30 conservative amino acid 
substitutions, and not more than 20 conservative amino acid substitutions. Also provided are 

25 polypeptides which comprise the amino acid sequence of a GENSET polypeptide, having at least 
one, but not more than 10, 9, 8, 7, 6, 5, 4, 3, 2 or 1 conservative amino acid substitutions. 

Polypeptide fragments 

Structural definition 

The present invention is further directed to fragments of the amino acid sequences 
30 described herein such as the polypeptides of SEQ ID NOs: 1 70-33 8, 456-560, 785-9 1 8 or those 
encoded by the clone inserts of the deposited clone pool. More specifically, the present invention 
embodies purified, isolated, and recombinant polypeptides comprising at least 6, preferably at least 
8 to 10, more preferably 12, 15, 20, 25, 30, 35, 40, 50, 60, 75, 100, 125, 150, 175, 200, 225, 250, 
275, or 300 consecutive amino acids of a polypeptide selected from the group consisting of the 
35 sequences of SEQ ID NOs: 1 70-338, 456-560, 785-9 1 8, the polypeptides encoded by the clone 
inserts of the deposited clone pool, and other polypeptides of the present invention. 
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In addition to tiie above polypeptide fragments, further preferred sub-genuses of 
polypeptides comprise at least 6 amino acids, wherein "at least 6" is defined as any integer between 
6 and the integer representing the C-terminal amino acid of the polypeptide of the present invention 
including the polypeptide sequences of the sequence listing below. Further included are species of 
5 polypeptide fragments at least 6 amino acids in length, as described above, that are further specified 
in terms of their N-terminal and C-terminal positions. However, included in the present invention 
as individual species are all polypeptide fragments, at least 6 amino acids in length, as described 
above, and may be particularly specified by a N-terminal and C-terminal position. That is, every 
combination of a N-terminal and C-terminal position that a fragment at least 6 contiguous amino 
10 acid residues in length could occupy, on any given amino acid sequence of the sequence listing or 
of the present invention is included in the present invention 

The present invention also provides for the exclusion of any fragment species specified by 
N-terminal and C-terminal positions or of any fragment sub-genus specified by size in amino acid 
residues as described above. Any number of fragments specified by N-terminal and C-terminal 
IS positions or by size in amino acid residues as described above may be excluded as individual 
species. 

The above polypeptide fragments of the present invention can be immediately envisaged 
using the above description and are therefore not individually listed solely for the purpose of not 
unnecessarily lengthening the specification. Moreover, the above fragments need not have a 

20 GENSET biological activity, although polypeptides having these activities are preferred 
embodiments of the invention, since they would be usefiil, for example, in immunoassays, in 
epitope mapping, epitope tagging, as vaccines, and as molecular weight markers. The above 
fragments may also be used to generate antibodies to a particular portion of the polypeptide. These 
antibodies can then be used in immunoassays well known in the art to distinguish between human 

25 and non-human cells and tissues or to determine whether cells or tissues in a biological sample are 
or are not of the same type which express the polypeptides of the present invention. 

It is noted that the above species of polypeptide fragments of the present invention may 
alternatively be described by the formula "a to b"; where "a" equals the N-terminal most amino acid 
position and "b" equals the C-terminal most amino acid position of the polynucleotide; and further 

30 where "a" equals an integer between 1 and the number of amino acids of the polypeptide sequence 
of the present invention minus 6, and where "b" equals an integer between 7 and the number of 
amino acids of the polypeptide sequence of the present invention; and where "a" is an integer 
smaller then "b" by at least 6. 

The present invention also provides for the exclusion of any species of polypeptide 

35 fragments of the present invention specified by 5' and 3' positions or sub-genuses of polypeptides 
specified by size in amino acids as described above. Any number of fragments specified by 5' and 
3' positions or by size in amino acids, as described above, may be excluded. 
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Domains 

Preferred polynucleotide fragments of the invention are domains of polypeptides of the 
invention. Such domains may eventually comprise linear or structural motifs and signatures 
5 including, but not limited to, leucine zippers, helix-tum-helix motifs, post-trans lational modification 
sites such as glycosylation sites, ubiquitination sites, alpha helices, and beta sheets, signal sequences 
encoding signal peptides which direct the secretion of the encoded proteins, sequences implicated in 
transcription regulation such as homeoboxes, acidic stretches, enzymatic active sites, substrate 
binding sites, and enzymatic cleavage sites. Such domains may present a particular biological 

10 activity such as DNA or RNA-binding, secretion of proteins, transcription regulation, enzymatic 
activity, substrate binding activity, etc. 

A domain has a size generally comprised between 3 and 1000 amino acids. In a preferred 
embodiment, domains comprise a number of amino acids that is any integer between 6 and 200. 
Domains may be synthesized using any methods known to those skilled in the art, including those 

1 5 disclosed herein, particularly in the section entitled "Preparation of the polypeptides of the 

invention". Methods for determining the amino acids which make up a domain with a particular 
biological activity include mutagenesis studies and assays to determine the biological activity to be 
tested. 

Alternatively, the polypeptides of the invention may be scanned for motifs, domains and/or 

20 signatures in databases using any computer method known to those skilled in the art. Searchable 
databases include Prosite (Hofinann et al, 1999; Bucher and Bairoch 1994), Pfam (Sonnhammer et 
al, 1997; Henikoffe/ al, 2000; Bateman et al, 2000), Blocks (Henikoff e/ al, 2000), Print 
(Attwood et al , 1 996), Prodom (Sonnhammer and Kahn, 1 994; Corpet et al 2000), Sbase (Pongor 
et al., 1993; Murvai et al, 2000), Smart (Schultz et al., 1998), Dali/FSSP (Holm and Sander, 1996, 

25 1 997 and 1 999), HSSP (Sander and Schneider 1 99 1 ), CATH (Orengo et al , 1 997; Pearl et al. , 
2000), SCOP (Murzin et al, 1995; Lo Conte et al, 2000), COG (Tatusov et al, 1997 and 2000), 
specific family databases and derivatives thereof (Nevill-Manning et al, 1998; Yona et al, 1999; 
Attwood et al, 2000), each of which disclosures are hereby incorporated by reference in their 
entireties. For a review on available databases, see issue 1 of volume 28 of Nucleic Acid Research 

30 (2000), which disclosure is hereby incorporated by reference in its entirety. 

The domains of the present invention preferably comprises 6 to 200 amino acids (i.e. any 
integer between 6 and 200, inclusive) of a polypeptide of the present invention. Also, included in 
the present invention are domain fragments between the integers of 6 and the full length GENSET 
sequence of the sequence listing. All combinations of sequences between the integers of 6 and the 

35 full-length sequence of a GENSET polypeptide are included. The domain fragments may be 

specified by either the number of contiguous amino acid residues (as a sub-genus) or by specific N- 
terminal and C-terminal positions (as species) as described above for the polypeptide fragments of 
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the present invention. Any number of domain fragments of the present invention may also be 
excluded in the same manner. 

Epitopes and Antibody Fusions: 

A preferred embodiment of the present invention is directed to epitope-bearing polypeptides 
5 and epitope-bearing polypeptide fragments. These epitopes may be "antigenic epitopes" or both an 
"antigenic epitope" and an "immunogenic epitope". An "immunogenic epitope" is defined as a part 
of a protein that elicits an antibody response in vivo when the polypeptide is the immunogen. On 
the other hand, a region of polypeptide to which an antibody binds is defined as an "antigenic 
determinant" or "antigenic epitope." The number of immunogenic epitopes of a protein generally is 

10 less than the number of antigenic epitopes (See, e.g., Geysen, et ah, 1984), which disclosure is 
hereby incorporated by reference in its entirety. It is particularly noted that although a particular 
epitope may not be immunogenic, it is nonetheless useful since antibodies can be made to both 
immunogenic and antigenic epitopes. 

An epitope can comprise as few as 3 amino acids in a spatial conformation, which is unique 

1 5 to the epitope. Generally an epitope consists of at least 6 such amino acids, and more often at least 
8-10 such amino acids. In preferred embodiment, antigenic epitopes comprise a number of amino 
acids that is any integer between 3 and SO. Fragments which function as epitopes may be produced 
by any conventional means {See, e.g., Houghten, 1985), also further described in U.S. Patent No. 
4,631,21, which disclosures are hereby incorporated by reference in their entireties. Methods for 

20 determining the amino acids which make up an epitope include x-ray crystallography, 2- 

dimensional nuclear magnetic resonance, and epitope mapping, e.g., the Pepscan method described 
by Geysen et al. (1984); PCT Publication No. WO 84/03564; and PCT Publication No. WO 
84/03506, which disclosures are hereby incorporated by reference in their entireties. Another 
example is the algorithm of Jameson and Wolf, (1988) (said reference incorporated by reference in 

25 its entirety). The Jameson- Wolf antigenic analysis, for example, may be performed using the 
computer program PROTEAN, using default parameters (Version 4.0 Windows, DNASTAR, Inc., 
1228 South Park Street Madison, WI. 

The epitope-bearing fragments of the present invention preferably comprise 6 to 50 amino 
acids (i.e. any integer between 6 and 50, inclusive) of a polypeptide of the present invention. Also, 

30 included in the present invention are antigenic fragments between the integers of 6 and the full 
length GENSET sequence of the sequence listing. All combinations of sequences between the 
integers of 6 and the full-length sequence of a GENSET polypeptide are included. The epitope- 
bearing fragments may be specified by either the number of contiguous amino acid residues (as a 
sub-genus) or by specific N-terminal and C-terminal positions (as species) as described above for 

35 the polypeptide fragments of the present invention. Any number of epitope-bearing fragments of 
the present invention may also be excluded in the same manner. 

Antigenic epitopes are useful, for example, to raise antibodies, including monoclonal 
antibodies that specifically bind the epitope (see, Wilson et al, 1984; and SutcliflFe, et al., 1983, 
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which disclosures are hereby incorporated by reference in their entireties). The antibodies are then 
used in various techniques such as diagnostic and tissue/cell identification techniques, as described 
herein, and in purification methods such as immunoafflnity chromatography. 

Similarly, immunogenic epitopes can be used to induce antibodies according to methods 
5 well known in the art (See, Sutcliffe etal., supra; Wilson etal., supra; Chow e/a/.(1985); and 
Bittle, et al., ( 1 985), which disclosures are hereby incorporated by reference in their entireties). A 
preferred immunogenic epitope includes the natural GENSET protein. The immunogenic epitopes 
may be presented together with a carrier protein, such as an albumin, to an animal system (such as 
rabbit or mouse) or, if it is long enough (at least about 2S amino acids), without a carrier. However, 

10 immunogenic epitopes comprising as few as 8 to 10 amino acids have been shown to be sufficient 
to raise antibodies capable of binding to, at the very least, linear epitopes in a denatured polypeptide 
(e.g., in Western blotting.). 

Epitope-bearing polypeptides of the present invention are used to induce antibodies 
according to methods well known in the art including, but not limited to, in vivo immunization, in 

15 vitro immunization, and phage display methods {See, e.g., Sutcliffe, et ah, supra; Wilson, etal, 
supra, and Bittle, et al, supra). If in vivo immunization is used, animals may be immunized with 
free peptide; however, anti-peptide antibody titer may be boosted by coupling of the peptide to a 
macromolecular carrier, such as keyhole limpet hemacyanin (KLH) or tetanus toxoid. For instance, 
peptides containing cysteine residues may be coupled to a carrier using a linker such as 

20 -maleimidobenzoyi- N-hydroxysuccinimide ester (MBS), while other peptides may be coupled to 
carriers using a more general linking agent such as glutaraldehyde. Animals such as rabbits, rats 
and mice are immunized with either free or carrier-coupled peptides, for instance, by intraperitoneal 
and/or intradermal injection of emulsions containing about 100 ngs of peptide or carrier protein and 
Freund's adjuvant. Several booster injections may be needed, for instance, at intervals of about two 

25 weeks, to provide a useful titer of anti-peptide antibody, which can be detected, for example, by 
ELISA assay using free peptide adsorbed to a solid surface. The titer of anti-peptide antibodies in 
serum from an immunized animal may be increased by selection of anti-peptide antibodies, for 
instance, by adsorption to the peptide on a solid support and elution of the selected antibodies 
according to methods well known in the art. 

30 As one of skill in the art will appreciate, and discussed above, the polypeptides of the 

present invention comprising an immunogenic or antigenic epitope can be fused to heterologous 
polypeptide sequences. For example, the polypeptides of the present invention may be fused with 
the constant domain of immunoglobulins (IgA, IgE, IgG, IgM), or portions thereof (CHI, CH2, 
CHS, any combination thereof including both entire domains and portions thereof) resulting in 

35 chimeric polypeptides. These fusion proteins facilitate purification, and show an increased half-life 
in vivo. This has been shown, e.g., for chimeric proteins consisting of the first two domains of the 
human CD4-polypeptide and various domains of the constant regions of the heavy or light chains of 
mammalian immunoglobulins (See. e.g., EPA 0,394,827; and Traunecker et al., 1988, which 
disclosures are hereby incorporated by reference in their entireties). Fusion proteins that have a 
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disulfide-linked dimeric structure due to the IgG portion can also be more efficient in binding and 
neutralizing other molecules than monomeric polypeptides or fragments thereof alone (See, e.g., 
Fountoulakis et ah, 1995, which disclosure is hereby incorporated by reference in its entirety). 
Nucleic acids encoding the above epitopes can also be recombined with a gene of interest as an 
5 epitope tag to aid in detection and purification of the expressed polypeptide. 

Additional fusion proteins of the invention may be generated through the techniques of 
gene-shuffling, motif-shuffling, exon-shuffling, or codon-shuffling (collectively referred to as 
"DNA shuffling"). DNA shuffling may be employed to modulate the activities of polypeptides of 
the present invention thereby effectively generating agonists and antagonists of the polypeptides. 

10 See, for example, U.S. Patent Nos.: 5,605,793; 5,811,238; 5,834,252; 5,837,458; and Patten, et al, 
(1997); Harayama, (1998); Hansson, et o/(1999); and Lorenzo and Blasco, (1998). (Each of these 
documents are hereby incorporated by reference). In one embodiment, one or more components, 
motifs, sections, parts, domains, fragments, etc., of coding polynucleotides of the invention, or the 
polypeptides encoded thereby may be recombined with one or more components, motifs, sections, 

15 parts, domains, fragments, etc. of one or more heterologous molecules. 

The present invention further encompasses any combination of the polypeptide fragments 
listed in this section. 

Antibodies 

20 Definitions 



The present invention further relates to antibodies and T-cell antigen receptors (TCR), 
which specifically bind the polypeptides, and more specifically, the epitopes of the polypeptides of 
the present invention. The antibodies of the present invention include IgG (including IgGl, IgG2, 
IgG3, and IgG4), IgA (including IgAI and lgA2), IgD, IgE, or IgM, and IgY. The term "antibody" 

25 (Ab) refers to a polypeptide or group of polypeptides which are comprised of at least one binding 
domain, where a binding domain is formed from the folding of variable domains of an antibody 
molecule to form three-dimensional binding spaces with an internal surface shape and charge 
distribution complementary to the features of an antigenic determinant of an antigen, which allows 
an immunological reaction with the antigen. As used herein, the term "antibody" is meant to 

30 include whole antibodies, including single-chain whole antibodies, and antigen binding fragments 
thereof In a preferred embodiment the antibodies are human antigen binding antibody fragments of 
the present invention include, but are not limited to, Fab, Fab' F(ab)2 and F(ab')2, Fd, single-chain 
Fvs (scFv), single-chain antibodies, disulfide-linked Fvs (sdFv) and fragments comprising either a 
Vl or Vh domain. The antibodies may be from any animal origin including birds and mammals. 

35 Preferably, the antibodies are human, murine, rabbit, goat, guinea pig, camel, horse, or chicken. 

Antigen-binding antibody fragments, including single-chain antibodies, may comprise the 
variable region(s) alone or in combination with the entire or partial of the following: hinge region, 
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CHI , CH2, and CHS domains. Also included in the invention are any combinations of variable 
region(s) and hinge region, CHI, CH2, and CHS domains. The present invention further includes 
chimeric, humanized, and human monoclonal and polyclonal antibodies, which specifically bind the 
polypeptides of the present invention. The present invention further includes antibodies that are 
5 anti-idiotypic to the antibodies of the present invention. 

The antibodies of the present invention may be monospecific, bispecific, and trispecific or 
have greater multispecificity. Multispecific antibodies may be specific for different epitopes of a 
polypeptide of the present invention or may be specific for both a polypeptide of the present 
invention as well as for heterologous compositions, such as a heterologous polypeptide or solid 

10 support material. See, e.g., WO 93/17715; WO 92/08802; WO 91/00360; WO 92/05793; Tutt, etal. 
(1991); US Patents 5,573,920, 4,474,893, 5,601,819, 4,714,681, 4,925,648; Kostelny et al. (1992), 
which disclosures are hereby incorporated by reference in their entireties. 

Antibodies of the present invention may be described or specified in terms of the epitope(s) 
or epitope-bearing portion(s) of a polypeptide of the present invention, which are recognized or 

1 5 specifically bound by the antibody. The antibodies may specifically bind a complete protein encoded 
by a nucleic acid of the present invention, or a fragment thereof Therefore, the epitope(s) or epitope 
bearing polypeptide portion(s) may be specified as described herein, e.g., by N-terminal and C- 
terminai positions, by size in contiguous amino acid residues, or otherwise described herein 
(including the sequence listing). Antibodies which specifically bind any epitope or polypeptide of 

20 the present invention may also be excluded as individual species. Therefore, the present invention 
includes antibodies that specifically bind specified polypeptides of the present invention, and allows 
for the exclusion of the same. 

Thus, another embodiment of the present invention is a purified or isolated antibody 
capable of specifically binding to a polypeptide comprising a sequence selected from the group 

25 consisting of the sequences of SEQ ID NOs: 1 70-338, 456-560, 785-9 1 8 and the sequences of the 
clone inserts of the deposited clone pool. In one aspect of this embodiment, the antibody is capable 
of binding to an epitope-containing polypeptide comprising at least 6 consecutive amino acids, 
preferably at least 8 to 10 consecutive amino acids, more preferably at least 12, 15, 20, 25, SO, 40, 
50, or 100 consecutive amino acids of a sequence selected from the group consisting of SEQ ID 

30 NOs: 1 70-33 8, 456-560, 785-9 1 8 and sequences of the clone inserts of the deposited clone pool. 

Antibodies of the present invention may also be described or specified in terms of their 
cross-reactivity. Antibodies that do not specifically bind any other analog, ortholog, or homologue 
of the polypeptides of the present invention are included. Antibodies that do not bind polypeptides 
with less than 95%, less than 90%, less than 85%, less than 80%, less than 75%, less than 70%, less 

35 than 65%, less than 60%, less than 55%, and less than 50% identity (as calculated using methods 
known in the art and described herein, e.g., using FASTDB and the parameters set forth herein) to a 
polypeptide of the present invention are also included in the present invention. Further included in 
the present invention are antibodies, which only bind polypeptides encoded by polynucleotides, 
which hybridize to a polynucleotide of the present invention under stringent hybridization 
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conditions (as described Fierein). Antibodies of the present invention may also be described or 
specified in terms of tiieir binding affinity. Preferred binding affinities include tiiose witii a 
dissociation constant or Kd less than 5X10"*M, 10"*M, SXIO 'M, IO 'M, 5X10"*M, lO'^M, SXIO'^M, 
lO'M, SXIO 'V, lO-'H 5X10-"M, 10-"M, SXIQ-'^M, IQ-'H SXIO'^M, IQ-'^M, SXIO^'X lO" 
5 '"M, 5X10 "M, and lO '^M. 

The invention also concerns a purified or isolated antibody capable of specifically binding 
to a mutated GEN SET protein or to a fragment or variant thereof comprising an epitope of the 
mutated GENSET protein. 



10 Preparation of antibodies 

The antibodies of the present invention may be prepared by any suitable method known in 
the art. Some of these methods are described in more detail in the example entitled "Preparation of 
Antibody Compositions to the GENSET protein". For example, a polypeptide of the present 
invention or an antigenic fragment thereof can be administered to an animal in order to induce the 

1 5 production of sera containing "polyclonal antibodies". As used herein, the term "monoclonal 

antibody" is not limited to antibodies produced through hybridoma technology but it rather refers to 
an antibody that is derived from a single clone, including eukaryotic, prokaryotic, or phage clone, 
and not the method by which it is produced. Monoclonal antibodies can be prepared using a wide 
variety of techniques known in the art including the use of hybridoma, recombinant, and phage 

20 display technology. 

Hybridoma techniques include those known in the art {See, e.g., Harlow et al. 1988; 
Hammerling, etal, 1981). (Said references incorporated by reference in their entireties). Fab and 
F(ab')2 fragments may be produced, for example, from hybridoma-produced antibodies by 
proteolytic cleavage, using en2ymes such as papain (to produce Fab fragments) or pepsin (to 

25 produce F(ab')2 fragments). 

Alternatively, antibodies of the present invention can be produced through the application 
of recombinant DNA technology or through synthetic chemistry using methods known in the art. 
For example, the antibodies of the present invention can be prepared using various phage display 
methods known in the art. In phage display methods, functional antibody domains are displayed on 

30 the surface of a phage particle, which carries polynucleotide sequences encoding them. Phage with 
a desired binding property are selected from a repertoire or combinatorial antibody library (e.g. 
human or murine) by selecting directly with antigen, typically antigen bound or captured to a solid 
surface or bead. Phage used in these methods are typically filamentous phage including fd and M13 
with Fab, Fv or disulfide stabilized Fv antibody domains recombinantly fused to either the phage 

35 gene 111 or gene VIII protein. Examples of phage display methods that can be used to make the 
antibodies of the present invention include those disclosed in Brinkman et al. (1995); Ames, et al. 
(1995); Kettleborough, etal. (1994); Persic, etal. (1997); Burton etal. (1994); PCT/GB91/01 134; 
WO 90/02809; WO 91/10737; WO 92/01047; WO 92/18619; WO 93/1 1236; WO 95/15982; WO 
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95/20401; and US Patents 5,698,426, 5,223,409, 5,403,484, 5,580,717, 5,427,908, 5,750,753, 
5,821,047, 5,571,698, 5,427,908, 5,516,637, 5,780,225, 5,658,727 and 5,733,743 (said references 
incorporated by reference in their entireties). 

As described in the above references, after phage selection, the antibody coding regions 
5 from the phage can be isolated and used to generate whole antibodies, including human antibodies, 
or any other desired antigen binding fragment, and expressed in any desired host including 
mammalian cells, insect cells, plant cells, yeast, and bacteria. For example, techniques to 
recombinantly produce Fab, Fab' F(ab)2 and F(ab')2 fragments can also be employed using 
methods known in the art such as those disclosed in WO 92/22324; Mullinax et al. (1992); and 
10 Sawai et al. (1995); and Better et al. (1988) (said references incorporated by reference in their 
entireties). 

Examples of techniques which can be used to produce single-chain Fvs and antibodies 
include those described in U.S. Patents 4,946,778 and 5,258,498; Huston etal. (1991); Shu et al. 
(1993); and Skerra etal. (1988), which disclosures are hereby incorporated by reference in their 

15 entireties. For some uses, including in vivo use of antibodies in humans and in vitro detection 
assays, it may be preferable to use chimeric, humanized, or human antibodies. Methods for 
producing chimeric antibodies are known in the art. See e.g., Morrison, (1985); Oi et o/., (1986); 
Gillies et al. (1989); and US Patent 5,807,715, which disclosures are hereby incorporated by 
reference in their entireties. Antibodies can be humanized using a variety of techniques including 

20 CDR-grafting (EP 0 239 400; WO 9 1/09967; US Patent 5,530, 1 0 1 ; and 5,585,089), veneering or 
resurfacing, (EP 0 592 106; EP 0 519 596; Padlan, 1991 ; Studnicka et al., 1994; Roguska et al., 
1994), and chain shuffling (US Patent 5,565,332), which disclosures are hereby incorporated by 
reference in their entireties. Human antibodies can be made by a variety of methods known in the 
art including phage display methods described above. See also, US Patents 4,444,887, 4,716,1 11, 

25 5,545,806, and 5,814,318; WO 98/46645; WO 98/50433; WO 98/24893; WO 96/34096; WO 
96/33735; and WO 91/10741 (said references incorporated by reference in their entireties). 

Further included in the present invention are antibodies recombinantly fused or chemically 
conjugated (including both covalently and non-covalently conjugations) to a polypeptide of the 
present invention. The antibodies may be specific for antigens other than polypeptides of the 

30 present invention. For example, antibodies of the present invention may be recombinantly fused or 
conjugated to molecules useful as labels in detection assays and effector molecules such as 
heterologous polypeptides, drugs, or toxins. See, e.g., WO 92/08495; WO 91/14438; WO 89/12624; 
US Patent 5,314,995; and EP 0 396 387, which disclosures are hereby incorporated by reference in 
their entireties. Fused antibodies may also be used to target the polypeptides of the present 

35 invention to particular cell types, either in vitro or in vivo, by fusing or conjugating the polypeptides 
of the present invention to antibodies specific for particular cell surface receptors. Antibodies fused 
or conjugated to the polypeptides of the present invention may also be used in vitro immunoassays 
and purification methods using methods known in the art (See e.g., Harbor et al. supra; WO 
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93/21232; EP 0 439 095; Naramura, M. etal. 1994; US Patent 5,474,981; Gillies etal., 1992; Fell et 
al., 1991) (said references incorporated by reference in their entireties). 

The present invention further includes compositions comprising the polypeptides of the 
present invention fused or conjugated to antibody domains other than the variable regions. For 
5 example, the polypeptides of the present invention may be fused or conjugated to an antibody Fc 
region, or portion thereof. The antibody portion fused to a polypeptide of the present invention may 
comprise the hinge region, CHI domain, CH2 domain, and CH3 domain or any combination of 
whole domains or portions thereof. The polypeptides of the present invention may be fused or 
conjugated to the above antibody portions to increase the in vivo half-life of the polypeptides or for 

1 0 use in immunoassays using methods known in the art. The polypeptides may also be fused or 

conjugated to the above antibody portions to form multimers. For example, Fc portions fused to the 
polypeptides of the present invention can form dimers through disulfide bonding between the Fc 
portions. Higher multimeric forms can be made by fusing the polypeptides to portions of IgA and 
IgM. Methods for fusing or conjugating the polypeptides of the present invention to antibody 

1 5 portions are known in the art. See e.g.. US Patents 5,336,603, 5,622,929, 5,359,046, 5,349,053, 
5,447,85 1, 5,1 12,946; EP 0 307 434, EP 0 367 166; WO 96/04388, WO 91/06570; Ashkenazi et al. 
(1991); Zheng et al. (1995); and Vil et al. (1992) (said references incorporated by reference in their 
entireties). 

Non-human animals or mammals, whether wild-type or transgenic, which express a 
20 different species of GENSET than the one to which antibody binding is desired, and animals which 
do not express GENSET (i.e. a GENSET knock out animal as described herein) are particularly 
useful for preparing antibodies. GENSET knock out animals will recognize all or most of the 
exposed regions of a GENSET protein as foreign antigens, and therefore produce antibodies with a 
wider array of GENSET epitopes. Moreover, smaller polypeptides with only 1 0 to 30 amino acids 
25 may be useful in obtaining specific binding to any one of the GENSET proteins. In addition, the 
humoral immune system of animals which produce a species of GENSET that resembles the 
antigenic sequence will preferentially recognize the differences between the animal's native 
GENSET species and the antigen sequence, and produce antibodies to these unique sites in the 
antigen sequence. Such a technique will be particularly useful in obtaining antibodies that 
30 specifically bind to any one of the GENSET proteins. 

The antibodies of the invention may be labeled by, e.g., any one of the radioactive, 
fluorescent or enzymatic labels known in the art. 

Uses ofpol ynucleotides 

35 Uses of polynucleotides as reagents 

The polynucleotides of the present invention, particularly those described in the 
"Oligonucleotide primers and probes" section, may be used as reagents in isolation procedures. 
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diagnostic assays, and forensic procedures. For example, sequences from the GENSET 
polynucleotides of the invention may be detectably labeled and used as probes to isolate other 
sequences capable of hybridizing to them. In addition, sequences from the GENSET 
polynucleotides of the invention may be used to design PGR primers to be used in isolation, 
5 diagnostic, or forensic procedures. 

In forensic analyses 

PGR primers may be used in forensic analyses, such as the DNA fingerprinting techniques 
described below. Such analyses may utilize detectable probes or primers based on the sequences of 
the polynucleotides of the invention. Consequently, the present invention encompasses methods of 
10 identification of an individual using the polynucleotides of the invention in forensic analyses, 
wherein said method includes the steps of: 

a) obtaining a biological sample containing nucleic acid material from an individual; 

b) obtaining an identification pattern for this individual using the polynucleotides of the 
invention, particularly using GENSET primers and probes; 

1 5 c) comparing said identification pattern with a reference identification pattern; and 

d) determining whether said identification pattern is identical to said reference identification 

pattern. 

In one embodiment of this method, the identification pattern consists in sequences of 
amplicons obtained using GENSET primers as explained in the sections entitled "Forensic 
20 Matching by DNA Sequencing" and "Positive Identification by DNA Sequencing". 

In another embodiment, the identification pattern consists in unique band or dot patterns 
obtained using any method described in the sections entitled "Southern Blot Forensic 
Identification", "Dot Blot Identification Procedure" and "Alternative "Fingerprint" Identification 
Technique". 

25 Table I provides sets of related cDNAs of the invention, e.g. sequences that represent allelic 

variants of a single sequence. Such variants are especially useful for the herein-described forensic 
analyses, and are also useful as polymorphic markers to examine, e.g. associations between the 
herein-discussed GENSET genes and various diseases or conditions. 

Forensic Matching by DNA Sequencing 

30 In one exemplary method, DNA samples are isolated from forensic specimens of, for 

example, hair, semen, blood or skin cells by conventional methods. A panel of PGR primers 
designed from different polynucleotides of the invention using any technique known to those skilled 
in the art including those described herein, is then utilized to amplify DNA of approximately 100- 
200 bases in length from the forensic specimen. Corresponding sequences are obtained from a test 

35 subject. Each of these identification DNAs is then sequenced using standard techniques, and a 
simple database comparison determines the differences, if any, between the sequences from the 
subject and those from the sample. Statistically significant differences between the suspect's DNA 
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sequences and those from the sample conclusively prove a lack of identity. This lack of identity can 
be proven, for example, with only one sequence. Identity, on the other hand, should be 
demonstrated with a large number of sequences, all matching. Preferably, a minimum of 50 
statistically identical sequences of 100 bases in length are used to prove identity between the 
5 suspect and the sample. 

Positive Identification by DNA Sequencing 

The "Forensic Matching by DNA Sequencing" technique described herein may also be used 
on a larger scale to provide a unique fmgerprint-type identification of any individual. In this 
technique, primers are prepared from a large number of polynucleotides of the invention. 

10 Preferably, 20 to 50 different primers are used. These primers are used to obtain a corresponding 
number of PCR-generated DNA segments from the individual in question. Each of these DNA 
segments is sequenced. The database of sequences generated through this procedure uniquely 
identifies the individual from whom the sequences were obtained. The same panel of primers may 
then be used at any later time to absolutely correlate tissue or other biological specimen with that 

15 individual. 

Southern Blot Forensic Identification 

The "Positive Identification by DNA Sequencing" procedure described herein is repeated to 
obtain a panel of at least 10 amplified sequences from an individual and a specimen. Preferably, the 
panel contains at least 50 amplified sequences. More preferably, the panel contains 100 amplified 

20 sequences. In some embodiments, the panel contains 200 amplified sequences. This PCR- 
generated DNA is then digested with one or a combination of, preferably, four base specific 
restriction enzymes. Such enzymes are commercially available and known to those of skill in the 
art. After digestion, the resultant gene fragments are size separated in multiple duplicate wells on 
an agarose gel and transferred to nitrocellulose using Southern blotting techniques well known to 

25 those with skill in the art. For a review of Southern blotting see Davis et al (1986), which 
disclosure is hereby incorporated by reference in its entirety. 

A panel of probes based on the sequences of the polynucleotides of the invention, or 
fragments thereof of at least 10 bases, are radioactively or colorimetrically labeled using methods 
known in the art, such as nick translation or end labeling, and hybridized to the Southern blot using 

30 techniques known in the art. Preferably, the probe comprises at least 12, 1 5, or 1 7 consecutive 
nucleotides from the polynucleotide of the invention. More preferably, the probe comprises at least 
20-30 consecutive nucleotides from the polynucleotide of the invention. In some embodiments, the 
probe comprises more than 30 nucleotides from the polynucleotide of the invention. In other 
embodiments, the probe comprises at least 40, at least 50, at least 75, at least 100, at least 150, or at 

35 least 200 consecutive nucleotides from the polynucleotide of the invention. 

Preferably, at least 5 to 10 of these labeled probes are used, and more preferably at least 
about 20 or 30 are used to provide a unique pattern. The resultant bands appearing from the 
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hybridization of a large sample of polynucleotide of the invention will be a unique identifier. Since 
the restriction enzyme cleavage will be different for every individual, the band pattern on the 
Southern blot will also be unique. Increasing the number of cDNA probes will provide a 
statistically higher level of confidence in the identification since there will be an increased number 
5 of sets of bands used for identification. 

Dot Blot Identification Procedure 

Another technique for identifying individuals using the polynucleotide sequences disclosed 
herein utilizes a dot blot hybridization technique. 

Genomic DNA is isolated from nuclei of subject to be identified. Oligonucleotide probes of 
10 approximately 30 bp in length are synthesized that correspond to at least 10, preferably 50 
sequences from the polynucleotide of the invention. The probes are used to hybridize to the 
genomic DNA through conditions known to those in the art. The oligonucleotides are end labeled 
with P^^ using polynucleotide kinase (Pharmacia). Dot Blots are created by spotting the genomic 
DNA onto nitrocellulose or the like using a vacuum dot blot manifold (BioRad, Richmond 
15 California). The nitrocellulose filter containing the genomic sequences is baked or UV linked to the 
filter, prehybridized and hybridized with labeled probe using techniques known in the art (Davis et 
al. 1986). The '^P labeled DNA fragments are sequentially hybridized with successively stringent 
conditions to detect minimal differences between the 30 bp sequence and the DNA. 
Tetramethylammonium chloride is useful for identifying clones containing small numbers of 
20 nucleotide mismatches (Wood et al, 1985). A unique pattern of dots distinguishes one individual 
from another individual. 

Alternative "Fingerprint" Identification Technique 

In a representative alternative fingerprinting procedure, the probes are derived from 
cDNAs. Preferably, a plurality of probes having sequences from different genes are used as 

25 follows. Polynucleotides containing at least 10 consecutive bases from these sequences can be used 
as probes. Preferably, the probe comprises at least 12, 15, or 17 consecutive nucleotides from the 
polynucleotide of the invention. More preferably, the probe comprises at least 20-30 consecutive 
nucleotides from the polynucleotide of the invention. In some embodiments, the probe comprises 
more than 30 nucleotides from the polynucleotide of the invention. In other embodiments, the 

30 probe comprises at least 40, at least 50, at least 75, at least 100, at least 150, or at least 200 
consecutive nucleotides from the polynucleotide of the invention. 

Oligonucleotides, generally 20-mers, are prepared from a large number, e.g. 50, 100, or 
200, of polynucleotides of the invention using commercially available oligonucleotide services such 
as Genset (Paris, France). Cell samples from the test subject are processed for DNA using 

35 techniques well known to those with skill in the art. The nucleic acid is digested with restriction 
enzymes such as EcoRI and Xbal. Following digestion, samples are applied to wells for 
electrophoresis. The procedure, as known in the art, may be modified to accommodate 
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polyacrylamide electrophoresis, however in this example, samples containing 5 ugof DNA are 
loaded into wells and separated on 0.8% agarose gels. The gels are transferred onto nitrocellulose 
using standard Southern blotting techniques. 

10 ng of each of the oligonucleotides are pooled and end-labeled with P^^. The 
5 nitrocellulose is prehybridized with blocking solution and hybridized with the labeled probes. 
Following hybridization and washing, the nitrocellulose filter is exposed to X-Omat AR X-ray film. 
The resulting hybridization pattern will be unique for each individual. 

It is additionally contemplated within this example that the number of probe sequences used 
can be varied for additional accuracy or clarity. 

10 To find corresponding genomic DNA sequences 

The GENSET cDNAs of the invention may also be used to clone sequences located 
upstream of the cDNAs of the invention on the corresponding genomic DNA. Such upstream 
sequences may be capable of regulating gene expression, including promoter sequences, enhancer 
sequences, and other upstream sequences which influence transcription or translation levels. Once 
1 5 identified and cloned, these upstream regulatory sequences may be used in expression vectors 
designed to direct the expression of an inserted gene in a desired spatial, temporal, developmental, 
or quantitative fashion. 

Use of cDNAs or Fragments thereof to Clone Upstream Sequences from Genomic DNA 

Sequences derived from polynucleotides of the inventions may be used to isolate the 

20 promoters of the corresponding genes using chromosome walking techniques. In one chromosome 
walking technique, which utilizes the GenomeWalker™ kit available from Clontech, five complete 
genomic DNA samples are each digested with a different restriction enzyme which has a 6 base 
recognition site and leaves a blunt end. Following digestion, oligonucleotide adapters are ligated to 
each end of the resulting genomic DNA fragments. 

25 For each of the five genomic DNA libraries, a first PGR reaction is performed according to 

the manufacturer's instructions (which are incorporated herein by reference) using an outer adaptor 
primer provided in the kit and an outer gene specific primer. The gene specific primer should be 
selected to be specific for the polynucleotide of the invention of interest and should have a melting 
temperature, length, and location in the polynucleotide of the invention which is consistent with its 

30 use in PGR reactions. Each first PGR reaction contains 5ng of genomic DNA, 5 |j.l of I OX Tth 

reaction buffer, 0.2 mM of each dNTP, 0.2 [iM each of outer adaptor primer and outer gene specific 
primer, 1.1 mM of Mg(0Ac)2, and 1 ^1 of the Tth polymerase SOX mix in atotal volume of 50 
The reaction cycle for the first PGR reaction is as follows: 1 min at 94 degrees Gelsius / 2 sec at 94 
degree Gelsius, 3 min at 72 degrees Gelsius (7 cycles) / 2 sec at 94 degrees Gelsius, 3 min at 67 

35 degrees Gelsius (32 cycles) / 5 min at 67 degrees Celsius. 

The product of the first PGR reaction is diluted and used as a template for a second PGR 
reaction according to the manufacturer's instructions using a pair of nested primers which are 
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located internally on the amplicon resulting from the first PCR reaction. For example, 5 \i\ of the 
reaction product of the first PCR reaction mixture may be diluted 1 80 times. Reactions are made in 
a 50 nl volume having a composition identical to that of the first PCR reaction except the nested 
primers are used. The first nested primer is specific for the adaptor, and is provided with the 
5 GenomeWalker™ kit. The second nested primer is specific for the particular polynucleotide of the 
invention for which the promoter is to be cloned and should have a melting temperature, length, and 
location in the polynucleotide of the invention which is consistent with its use in PCR reactions. 
The reaction parameters of the second PCR reaction are as follows: 1 m in at 94 degrees Celsius / 2 
sec at 94 degrees Celsius, 3 min at 72 degrees Celsius (6 cycles) / 2 sec at 94 degrees Celsius, 3 min 

10 at 67 degrees Celsius (25 cycles) / 5 min at 67 degrees Celsius 

The product of the second PCR reaction is purified, cloned, and sequenced using standard 
techniques. Alternatively, two or more human genomic DNA libraries can be constructed by using 
two or more restriction enzymes. The digested genomic DNA is cloned into vectors which can be 
converted into single stranded, circular, or linear DNA. A biotinylated oligonucleotide comprising 

15 at least 15 nucleotides from the polynucleotide of the invention sequence is hybridized to the single 
stranded DNA. Hybrids between the biotinylated oligonucleotide and the single stranded DNA 
containing the polynucleotide of the invention sequence are isolated as described herein. 
Thereafter, the single stranded DNA containing the polynucleotide of the invention sequence is 
released from the beads and converted into double stranded DNA using a primer specific for the 

20 polynucleotide of the invention sequence or a primer corresponding to a sequence included in the 
cloning vector. The resulting double stranded DNA is transformed into bacteria. DNAs containing 
the GENSET polynucleotide sequences are identified by colony PCR or colony hybridization. 

Identification of Promoters in Cloned Upstream Sequences 

Once the upstream genomic sequences have been cloned and sequenced as described above, 
25 prospective promoters and transcription start sites within the upstream sequences may be identified 
by comparing the sequences upstream of the polynucleotides of the inventions with databases 
containing known transcription start sites, transcription factor binding sites, or promoter sequences. 

In addition, promoters in the upstream sequences may be identified using promoter reporter 
vectors as follows. The expression of the reporter gene will be detected when placed under the 
30 control of regulatory active polynucleotide fragments or variants of the GENSET promoter region 
located upstream of the first exon of the GENSET gene. Suitable promoter reporter vectors, into 
which the GENSET promoter sequences may be cloned include pSEAP-Basic, pSEAP-Enhancer, 
pPgal-Basic, pPgal-Enhancer, or pEGFP-1 Promoter Reporter vectors available from Clontech, or 
pGL2-basic or pGL3-basic promoterless luciferase reporter gene vector from Promega. Briefly, 
35 each of these promoter reporter vectors include multiple cloning sites positioned upstream of a 
reporter gene encoding a readily assayable protein such as secreted alkaline phosphatase, luciferase, 
beta-galactosidase, or green fluorescent protein. The sequences upstream the GENSET coding 
region are inserted into the cloning sites upstream of the reporter gene in both orientations and 
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introduced into an appropriate iiost ceil. Tiie level of reporter protein is assayed and compared to 
the level obtained from a vector whicli lacks an insert in the cloning site. The presence of an 
elevated expression level in the vector containing the insert with respect to the control vector 
indicates the presence of a promoter in the insert. If necessary, the upstream sequences can be 
5 cloned into vectors which contain an enhancer for increasing transcription levels from weak 
promoter sequences. A significant level of expression above that observed with the vector lacking 
an insert indicates that a promoter sequence is present in the inserted upstream sequence. 

Promoter sequence within the upstream genomic DNA may be further defined by site 
directed mutagenesis, linker scanning analysis, or other techniques familiar to those skilled in the 

10 art. For example, the boundaries of promoters may be further investigated by constructing nested 5' 
and/or 3' deletions in the upstream DNA using conventional techniques such as Exonuclease TTT or 
appropriate restriction endonuclease digestion. The resulting deletion fragments can be inserted 
into the promoter reporter vector to determine whether the deletion has increased, reduced or 
illuminated promoter activity, such as described, for example, by Coles et al. (1998), the disclosure 

15 of which is incorporated herein by reference in its entirety. In this way, the boundaries of the 

promoters may be defined. If desired, potential individual regulatory sites within the promoter may 
be identified using site directed mutagenesis or linker scanning to obliterate potential transcription 
factor binding sites within the promoter individually or in combination. The effects of these 
mutations on transcription levels may be determined by inserting the mutations into cloning sites in 

20 promoter reporter vectors. This type of assay is well known to those skilled in the art and is 
described in WO 97/17359, US Patent No. 5,374,544; EP 582 796; US Patent No. 5,698,389; US 
5,643,746; US Patent No. 5,502,176; and US Patent 5,266,488; the disclosures of which are 
incorporated by reference herein in their entirety. 

The strength and the specificity of the promoter of each GENSET gene can be assessed 

25 through the expression levels of a detectable polynucleotide operably linked to the GENSET 
promoter in different types of cells and tissues. The detectable polynucleotide may be either a 
polynucleotide that specifically hybridizes with a predefined oligonucleotide probe, or a 
polynucleotide encoding a detectable protein, including a GENSET polypeptide or a fragment or a 
variant thereof This type of assay is well known to those skilled in the art and is described in US 

30 Patent No. 5,502,176; and US Patent No. 5,266,488; the disclosures of which are incorporated by 
reference herein in their entirety. Some of the methods are discussed in more detail elsewhere in 
the application. 

The promoters and other regulatory sequences located upstream of the polynucleotides of 
the inventions may be used to design expression vectors capable of directing the expression of an 
35 inserted gene in a desired spatial, temporal, developmental, or quantitative manner. A promoter 
capable of directing the desired spatial, temporal, developmental, and quantitative patterns may be 
selected using the results of the expression analysis described herein. For example, if a promoter 
which confers a high level of expression in muscle is desired, the promoter sequence upstream of a 
polynucleotide of the invention derived from an mRNA which is expressed at a high level in muscle 
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may be used in the expression vector. Such vectors are described in more detail elsewhere in the 
application. 

Preferably, the desired promoter is placed near multiple restriction sites to facilitate the 
cloning of the desired insert downstream of the promoter, such that the promoter is able to drive 
S expression of the inserted gene. The promoter may be inserted in conventional nucleic acid 
backbones designed for extrachromosomal replication, integration into the host chromosomes or 
transient expression. Suitable backbones for the present expression vectors include retroviral 
backbones, backbones from eukaiyotic episomes such as SV40 or Bovine Papilloma Virus, 
backbones from bacterial episomes, or artificial chromosomes. 
1 0 Preferably, the expression vectors also include a polyA signal downstream of the multiple 

restriction sites for directing the polyadenylation of mRNA transcribed from the gene inserted into 
the expression vector. 

To find similar sequences 

Polynucleotides of the invention may be used to isolate and/or purily nucleic acids similar 
1 5 thereto using any methods well known to those skilled in the art including the techniques based on 
hybridization or on amplification described in this section. These methods may be used to obtain the 
genomic DNAs which encode the mRNAs from which the GENSET cDNAs are derived, mRNAs 
corresponding to GENSET cDNAs, or nucleic acids which are homologous to GENSET cDNAs or 
fragments thereof, such as variants, species homologues or orthologs. Thus, a plurality of cDNAs 
20 similar to GENSET polynucleotides may be provided as cDNA libraries for subsequent evaluation 
of the encoded proteins or used in diagnostic assays as described herein. cDNAs prepared by any 
method described therein may be subsequently engineered to obtain nucleic acids which include 
desired fragments of the cDNA using conventional techniques such as subcloning, PGR, or in vitro 
oligonucleotide synthesis. For example, nucleic acids which include only the coding sequences 
25 may be obtained using techniques known to those skilled in the art. Similarly, nucleic acids 
containing any other desired fragment of the coding sequences for the encoded protein may be 
obtained. 

Indeed, cDNAs of the present invention or fragments thereof may be used to isolate nucleic 
acids similar to cDNAs from a cDNA library or a genomic DNA library. Such cDNA libraries or 
30 genomic DNA libraries may be obtained from a commercial source or made using techniques 

familiar to those skilled in the art such as those described in PCT publication WO 00/37491, which 
disclosure is hereby incorporated by reference in its entirety. Examples of methods for obtaining 
nucleic acids similar to GENSET polynucleotides are described below. 

Hybridization-based methods 

35 Techniques for identifying cDNA clones in a cDNA library which hybridize to a given 

probe sequence are disclosed in Sambrook et ai, (1989) and in Hames and Higgins (1985), the 
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disclosures of which are incorporated herein by reference in their entireties. The same techniques 

may be used to isolate genomic DNAs. 

Briefly, cDNA or genomic DNA clones which hybridize to the detectable probe are 

identified and isolated for further manipulation as follows. Any polynucleotide fragment of the 
5 invention may be used as a probe, in particular those defined in the "Oligonucleotide primers and 

probes" section. A probe comprising at least 10 consecutive nucleotides from a GENSET cDNA or 

fragment thereof is labeled with a detectable label such as a radioisotope or a fluorescent molecule. 

Preferably, the probe comprises at least 12, 1 5, or 17 consecutive nucleotides from the cDNA or 

fragment thereof. More preferably, the probe comprises 20 to 30 consecutive nucleotides from the 
1 0 cDNA or fragment thereof. In some embodiments, the probe comprises more than 30 nucleotides 

from the cDNA or fragment thereof. 

Techniques for labeling the probe are well known and include phosphorylation with 

polynucleotide kinase, nick translation, in vitro transcription, and non radioactive techniques. The 

cDNAs or genomic DNAs in the library are transferred to a nitrocellulose or nylon filter and 
1 5 denatured. After blocking of non specific sites, the filter is incubated with the labeled probe for an 

amount of time sufficient to allow binding of the probe to cDNAs or genomic DNAs containing a 

sequence capable of hybridizing thereto. 

By varying the stringency of the hybridization conditions used to identify cDNAs or 

genomic DNAs which hybridize to the detectable probe, cDNAs or genomic DNAs having different 
20 levels of identify to the probe can be identified and isolated as described below. 



Stringent conditions 

"Stringent hybridization conditions" are defmed as conditions in which only nucleic acids 
having a high level of identify to the probe are able to hybridize to said probe. These conditions may 
25 be calculated as follows. 

For probes between 14 and 70 nucleotides in length the melting temperature (Tm) is 
calculated using the formula: Tm=8 1 .5+1 6.6(log (Na+))+0.41(lTaction G+C)-(600/N) where N is 
the length of the probe. 

If the hybridization is carried out in a solution containing formamide, the melting 
30 temperature may be calculated using the equation: Tm=8 1 .5+1 6.6(log (Na+))+0.4 1 (fraction G+C)- 
(0.63% formamideH600/N) where N is the length of the probe. 

Prehybridization may be carried out in 6X SSC, 5X Denhardt's reagent, 0.5% SDS, 100 jig 
denatured fragmented salmon sperm DNA or 6X SSC, 5X Denhardt's reagent, 0.5% SDS, 100 ng 
denatured fragmented salmon sperm DNA, 50% formamide. The formulas for SSC and Denhardt's 
35 solutions are listed in Sambrook et al., 1986. 

Hybridization is conducted by adding the detectable probe to the prehybridization solutions 
listed above. Where the probe comprises double stranded DNA, it is denatured before addition to 
the hybridization solution. The filter is contacted with the hybridization solution for a sufficient 
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period of time to allow the probe to hybridize to nucleic acids containing sequences complementary 
thereto or homologous thereto. For probes over 200 nucleotides in length, the hybridization may be 
carried out at 1 5-25°C below the Tm. For shorter probes, such as oligonucleotide probes, the 
hybridization may be conducted at 15-25°C below the Tm. Preferably, for hybridizations in 6X 
5 SSC, the hybridization is conducted at approximately 68''C. Preferably, for hybridizations in 50% 
formamide containing solutions, the hybridization is conducted at approximately 42''C. 

Following hybridization, the filter is washed in 2X SSC, 0. 1% SDS at room temperature for 
15 minutes. The filter is then washed with 0. IX SSC, 0.5% SDS at room temperature for 30 
minutes to 1 hour. Thereafter, the solution is washed at the hybridization temperature in O.IX SSC, 
10 0.5% SDS. A final wash is conducted in O.IX SSC at room temperature. 

Nucleic acids which have hybridized to the probe are identified by autoradiography or other 
conventional techniques. 

Low and moderate conditions 

Changes in the stringency of hybridization and signal detection are primarily accomplished 

1 5 through the manipulation of formamide concentration (lower percentages of formamide result in 
lowered stringency); salt conditions, or temperature. The above procedure may thus be modified to 
identify nucleic acids having decreasing levels of identity to the probe sequence. For example, the 
hybridization temperature may be decreased in increments of 5°C from 68°C to 42°C in a 
hybridization buffer having a sodium concentration of approximately IM. Following hybridization, 

20 the filter may be washed with 2X SSC, 0.5% SDS at the temperature of hybridization. These 
conditions are considered to be "moderate" conditions above 50''C and "low" conditions below 
50°C. Alternatively, the hybridization may be carried out in buffers, such as 6X SSC, containing 
formamide at a temperature of 42°C. In this case, the concentration of formamide in the 
hybridization buffer may be reduced in 5% increments from 50% to 0% to identify clones having 

25 decreasing levels of identity to the probe. Following hybridization, the filter may be washed with 
6X SSC, 0.5% SDS at 50°C. These conditions are considered to be "moderate" conditions above 
25% formamide and "low" conditions below 25% formamide. cDNAs or genomic DNAs which 
have hybridized to the probe are identified by autoradiography or other conventional techniques. 
Note that variations in the above conditions may be accomplished through the inclusion 

30 and/or substitution of alternate blocking reagents used to suppress background in hybridization 
experiments. Typical blocking reagents include Denhardt's reagent, BLOTTO, heparin, denatured 
salmon sperm DNA, and commercially available proprietary formulations. The inclusion of specific 
blocking reagents may require modification of the hybridization conditions described above, due to 
problems with compatibility. 

35 Consequently, the present invention encompasses methods of isolating nucleic acids similar 

to the polynucleotides of the invention, comprising the steps of: 

a) contacting a collection of cDNA or genomic DNA molecules with a detectable probe 
comprising at least 12, 15, 18, 20, 23, 25, 28, 30, 35, 40 or 50 consecutive nucleotides of a sequence 
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selected from the group consisting of the sequences of SEQ IDNOs: 1-169, 339-455, 561-784, the 
sequences of clones inserts of the deposited clone pool and sequences complementary thereto under 
stringent, moderate or low conditions which permit said probe to hybridize to at least a cDNA or 
genomic DNA molecule in said collection; 
5 b) identilying said cDNA or genomic DNA molecule which hybridizes to said detectable 

probe; and 

c) isolating said cDNA or genomic DNA molecule which hybridized to said probe. 

PCR-based methods 

In addition to the above described methods, other protocols are available to obtain 
1 0 homologous cDNAs using GENSET cDNA of the present invention or fragment thereof as outlined 
in the following paragraphs. 

cDNAs may be prepared by obtaining mRNA from the tissue, cell, or organism of interest 
using mRNA preparation procedures utilizing polyA selection procedures or other techniques 
known to those skilled in the art. A first primer capable of hybridizing to the polyA tail of the 
1 5 mRNA is hybridized to the mRNA and a reverse transcription reaction is performed to generate a 
first cDNA strand. 

The term "capable of hybridizing to the polyA tail of said mRNA" refers to and embraces 
all primers containing stretches of thymidine residues, so-called oligo(dT) primers, that hybridize to 
the 3' end of eukaryotic poly(A)+ mRNAs to prime the synthesis of a first cDNA strand. 

20 Techniques for generating said oligo (dT) primers and hybridizing them to mRNA to subsequently 
prime the reverse transcription of said hybridized mRNA to generate a first cDNA strand are well 
known to those skilled in the art and are described in Current Protocols in Molecular Biology, John 
Wiley and Sons, Inc. 1997 and Sambrook et al, 1989. Preferably, said oligo (dT) primers are 
present in a large excess in order to allow the hybridization of all mRNA 3 'ends to at least one oligo 

25 (dT) molecule. The priming and reverse transcription steps are preferably performed between 37''C 
and 55°C depending on the type of reverse transcriptase used. Preferred oligo(dT) primers for 
priming reverse transcription of mRNAs arc oligonucleotides containing a stretch of thymidine 
residues of sufficient length to hybridize specifically to the polyA tail of mRNAs, preferably of 12 
to 18 thymidine residues in length. More preferably, such oligo(T) primers comprise an additional 

30 sequence upstream of the poly(dT) stretch in order to allow the addition of a given sequence to the 
5'end of all first cDNA strands which may then be used to facilitate subsequent manipulation of the 
cDNA. Preferably, this added sequence is 8 to 60 residues in length. For instance, the addition of a 
restriction site in 5' of cDNAs facilitates subcloning of the obtained cDNA. Alternatively, such an 
added 5' end may also be used to design primers of PGR to specifically amplify cDNA clones of 

35 interest. 

The first cDNA strand is then hybridized to a second primer. Any polynucleotide fragment 
of the invention may be used, and in particular those described in the "Oligonucleotide primers and 
probes" section. This second primer contains at least 10 consecutive nucleotides of a polynucleotide 
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of the invention. Preferably, the primer comprises at least 10, 12, 15, 17, 18, 20, 23, 25, or 28 
consecutive nucleotides of a polynucleotide of the invention. In some embodiments, the primer 
comprises more than 30 nucleotides of a polynucleotide of the invention. If it is desired to obtain 
cDNAs containing the full protein coding sequence, including the authentic translation initiation 
5 site, the second primer used contains sequences located upstream of the translation initiation site. 
The second primer is extended to generate a second cDN A strand complementary to the first cDNA 
strand. Alternatively, RT-PCR may be performed as described above using primers from both ends 
of the cDNA to be obtained. 

The double stranded cDNAs made using the methods described above are isolated and 
10 cloned. The cDNAs may be cloned into vectors such as plasmids or viral vectors capable of 
replicating in an appropriate host cell. For example, the host cell may be a bacterial, mammalian, 
avian, or insect cell. 

Techniques for isolating mRNA, reverse transcribing a primer hybridized to mRNA to 
generate a first cDNA strand, extending a primer to make a second cDNA strand complementary to 
15 the first cDNA strand, isolating the double stranded cDNA and cloning the double stranded cDNA 
are well known to those skilled in the art and are described in Current Protocols in Molecular 
Biology, John Wiley & Sons, Inc. 1997 and Sambrook el al., 1989. 

Consequently, the present invention encompasses methods of making cDNAs. In a first 
embodiment, the method of making a cDNA comprises the steps of 
20 a) contacting a collection of mRNA molecules from human cells with a primer comprising 

at least 12, 15, 18, 20, 23, 25, 28, 30, 35, 40, or 50 consecutive nucleotides of a sequence selected 
from the group consisting of the sequences complementary to SEQ ID NOs: 1-169, 339-455, 561- 
784 and sequences complementary to a clone insert of the deposited clone pool; 
b) hybridizing said primer to an mRNA in said collection; 
25 c) reverse transcribing said hybridized primer to make a first cDNA strand from said 

mRNA; 

d) making a second cDNA strand complementary to said first cDNA strand; and 

e) isolating the resulting cDNA comprising said first cDNA strand and said second cDNA 

strand. 

30 Another embodiment of the present invention is a purified cDNA obtainable by the method 

of the preceding paragraph. In one aspect of this embodiment, the cDNA encodes at least a portion 
of a human polypeptide. 

In a second embodiment, the method of making a cDNA comprises the steps of 

a) contacting a collection of mRNA molecules from human cells with a first primer capable 
35 of hybridizing to the polyA tail of said mRNA; 

b) hybridizing said first primer to said polyA tail; 

c) reverse transcribing said mRNA to make a first cDNA strand; 

d) making a second cDNA strand complementary to said first cDNA strand using at least 
one primer comprising at least 12, 15, 18, 20, 23, 25, 28, 30, 35, 40, or 50 consecutive nucleotides of 
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a sequence selected from the group consisting of SEQ ID NOs: 1-169, 339-455, 561-784 and 
sequences of clone inserts of the deposited clone pool; and 

e) isolating the resulting cDNA comprising said first cDNA strand and said second cDNA 

strand. 

5 In another aspect of this method the second cDNA strand is made by 

a) contacting said first cDNA strand with a second primer comprising at least 12, 15, 18, 20, 
23, 25, 28, 30, 35, 40, or 50 consecutive nucleotides of a sequence selected from the group consisting 
of SEQ IDNOs:l-169, 339-455, 561-784 and sequences of clone inserts of the deposited clone 
pool, and a third primer which sequence is fully included within the sequence of said first primer; 
1 0 b) performing a first polymerase chain reaction with said second and third primers to 

generate a first PCR product; 

c) contacting said first PCR product with a fourth primer, comprising at least 12, 15, 18, 20, 
23, 25, 28, 30, 35, 40, or 50 consecutive nucleotides of said sequence selected from the group 
consisting of SEQ ID NOs:l-169, 339-455, 561-784 and sequences of clone inserts of the deposited 

1 5 clone pool, and a fifth primer, which sequence is fully included within the sequence of said third 
primer, wherein said fourth and fifth hybridize to sequences within said first PCR product; and 

d) performing a second polymerase chain reaction, thereby generating a second PCR 
product. 

Alternatively, the second cDNA strand may be made by contacting said first cDNA strand 
20 with a second primer comprising at least 12, 15, 1 8, 20, 23, 25, 28, 30, 35, 40, or 50 consecutive 
nucleotides of a sequence selected from the group consisting of SEQ ID NOs:l-169, 339-455, 561- 
784 and sequences of clone inserts of the deposited clone pool, and a third primer which sequence is 
fully included within the sequence of said first primer and performing a polymerase chain reaction 
with said second and third primers to generate said second cDNA strand. 
25 Alternatively, the second cDNA strand may be made by: 

a) contacting said first cDNA strand with a second primer comprising at least 12, 15, 18, 20, 
23, 25, 28, 30, 35, 40, or 50 consecutive nucleotides of a sequence selected from the group consisting 
of SEQ ID NOs:l-169, 339-455, 561-784 and sequences of clone inserts of the deposited clone 
pool; 

30 b) hybridizing said second primer to said first strand cDNA; and 

c) extending said hybridized second primer to generate said second cDNA strand. 

Another embodiment of the present invention is a purified cDNA obtainable by a method of 

making a cDNA of the invention. In one aspect of this embodiment, said cDNA encodes at least a 

portion of a human polypeptide. 

35 Other protocols 

Alternatively, other procedures may be used for obtaining homologous cDNAs. In one 
approach, cDNAs are prepared from mRNA and cloned into double stranded phagemids as follows. 
The cDNA library in the double stranded phagemids is then rendered single stranded by treatment 
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with an endonuclease, such as the Gene II product of the phage F I and an exonuclease (Chang et 
al, 1993, which disclosure is hereby incorporated by reference in its entirety). A biotinylated 
oligonucleotide comprising the sequence of a fragment of a known GENSET cDNA, genomic DNA 
or fragment thereof is hybridized to the single stranded phagemids. Preferably, the fragment 
5 comprises at least 10, 12, 15, 17, 18, 20, 23, 25, or 28 consecutive nucleotides of a sequence 
selected from the group consisting of the sequences of SEQ IDNOs:I-169, 339-455, 561-784 and 
sequences of clone inserts of the deposited clone pool. 

Hybrids between the biotinylated oligonucleotide and phagemids are isolated by incubating 
the hybrids with streptavidin coated paramagnetic beads and retrieving the beads with a magnet 

10 (Fry et al, 1992, which disclosure is hereby incorporated by reference in its entirety). Thereafter, 
the resulting phagemids are released from the beads and converted into double stranded DNA using 
a primer specific for the GENSET cDNA or fragment used to design the biotinylated 
oligonucleotide. Alternatively, protocols such as the Gene Trapper kit (Gibco BRL), which 
disclosure is which disclosure is hereby incorporated by reference in its entirety, may be used. The 

15 resulting double stranded DNA is transformed into bacteria. Homologous cDNAs to the GENSET 
cDNA or fragment thereof sequence are identified by colony PGR or colony hybridization. 

As a chromosome marker 

Chromosomal localization of the cDNA of the present invention were determined using 
information from public and proprietary databases. Table II lists the putative chromosomal location 

20 of the polynucleotides of the present invention. Column one lists the sequence identification number 
with the corresponding chromosomal location listed in column two. Thus, the present invention 
also relates to methods and compositions using the chromosomal location of the polynucleotides of 
the invention to construct a human high resolution map or to identify a given chromosome in a 
sample using any techniques known to those skilled in the art including those disclosed below. 

25 GENSET polynucleotides may also be mapped to their chromosomal locations using any 

methods or techniques known to those skilled in the art including radiation hybrid (RH) mapping, 
PCR-based mapping and Fluorescence in situ hybridization (FISH) mapping described below. 

Radiation hybrid mapping 

Radiation hybrid (RH) mapping is a somatic cell genetic approach that can be used for high 
30 resolution mapping of the human genome. In this approach, cell lines containing one or more 
human chromosomes are lethally irradiated, breaking each chromosome into fragments whose size 
depends on the radiation dose. These fragments are rescued by fusion with cultured rodent cells, 
yielding subclones containing different fragments of the human genome. This technique is 
described by Benham et al. (1989) and Cox et al, (1990), which disclosures are hereby 
35 incorporated by reference in their entireties. The random and independent nature of the subclones 
permits efficient mapping of any human genome marker. Human DNA isolated from a panel of 80- 
100 cell lines provides a mapping reagent for ordering GENSET cDNAs or genomic DNAs. In this 
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approach, the frequency of breakage between markers is used to measure distance, allowing 
construction of fine resolution maps as has been done using conventional ESTs (Schuler et al, 
1996), which disclosure is hereby incorporated by reference in its entirety. 

RH mapping has been used to generate a high-resolution whole genome radiation hybrid 
5 map of human chromosome I7q22-q25.3 across the genes for growth hormone (GH) and thymidine 
kinase (TK) (Foster et al, 1996), the region surrounding the Gorlin syndrome gene (Obermayr et 
al., 1996), 60 loci covering the entire short arm of chromosome 1 2 (Raeymaekers et al, 1995), the 
region of human chromosome 22 containing the neurofibromatosis tj^je 2 locus (Frazer et al, 1992) 
and 13 loci on the long arm of chromosome 5 (Warrington et al, 1991), which disclosures are 
10 hereby incorporated by reference in their entireties. 

Mapping of cDNAs to Human Chromosomes using PCR techniques 

GENSET cDNAs and genomic DNAs may be assigned to human chromosomes using PCR 
based methodologies. In such approaches, oligonucleotide primer pairs are designed from the 
cDNA sequence to minimize the chance of amplifying through an intron. Preferably, the 
1 5 oligonucleotide primers are 1 8-23 bp in length and are designed for PCR amplification. The 
creation of PCR primers from known sequences is well known to those with skill in the art. For a 
review of PCR technology see Erlich (1992), which disclosure is hereby incorporated by reference 
in its entirety. 

The primers are used in polymerase chain reactions (PCR) to amplify templates from total 

20 human genomic DNA. PCR conditions are as follows: 60 ng of genomic DNA is used as a template 
for PCR with 80 ng of each oligonucleotide primer, 0.6 unit of Taq polymerase, and 1 uCu of a ^^P- 
labeled deoxycytidine triphosphate. The PCR is performed in a microplate thermocycler (Techne) 
under the following conditions: 30 cycles of 94 degrees Celsius, 1 .4 min; 55 degrees Celsius, 2 min; 
and 72 degrees Celsius, 2 min; with a final extension at 72 degrees Celsius for 1 0 min. The 

25 amplified products are analyzed on a 6% polyacrylamide sequencing gel and visualized by 

autoradiography. If the length of the resulting PCR product is identical to the distance between the 
ends of the primer sequences in the cDNA from which the primers are derived, then the PCR 
reaction is repeated with DNA templates from two panels of human-rodent somatic cell hybrids, 
BIOS PCRable DNA (BIOS Corporation) and NIGMS Human-Rodent Somatic Cell Hybrid 

30 Mapping Panel Number I (NIGMS, Camden, NJ). 

PCR is used to screen a series of somatic cell hybrid cell lines containing defined sets of 
human chromosomes for the presence of a given cDNA or genomic DNA. DNA is isolated from 
the somatic hybrids and used as starting templates for PCR reactions using the primer pairs from the 
GENSET cDNAs or genomic DNAs. Only those somatic cell hybrids with chromosomes 

35 containing the human gene corresponding to the GENSET cDNA or genomic DNA will yield an 
amplified fragment. The GENSET cDNAs or genomic DNAs are assigned to a chromosome by 
analysis of the segregation pattern of PCR products from the somatic hybrid DNA templates. The 
single human chromosome present in all cell hybrids that give rise to an amplified fragment is the 
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chromosome containing tiiat GENSET cDNA or genomic DNA. For a review of techniques and 
analysis of results from somatic cell gene mapping experiments, see Ledbetter et a/., (1990), which 
disclosure is hereby incorporated by reference in its entirety. 

Mapping of cDNAs to Chromosomes Using Fluorescence in situ Hybridization 

5 Fluorescence in situ hybridization (FISH) allows the GENSET cDNA or genomic DNA to 

be mapped to a particular location on a given chromosome. The chromosomes to be used for 
fluorescence in situ hybridization techniques may be obtained from a variety of sources including 
cell cuhures, tissues, or whole blood. 

In a preferred embodiment, chromosomal localization of a GENSET cDNA or genomic 

10 DNA is obtained by FISH as described by Cherif et al. (1990), which disclosure is hereby 
incorporated by reference in its entirety. Metaphase chromosomes are prepared from 
phytohemagglutinin (PHA)-stimulatcd blood cell donors. PHA-stimulated lymphocytes from 
healthy males are cultured for 72 h in RPMI-1640 medium. For synchronization, methotrexate (10 
uM) is added for 17 h, followed by addition of 5-bromodeoxyuridine (5-BudR, 0.1 mM) for 6 h. 

15 Colcemid (1 ug/ml) is added for the last 15 min before harvesting the cells. Cells are collected, 
washed in RPMI, incubated with a hypotonic solution of KCl (75 mM) at 37 degrees Celsius for 1 5 
min and fixed in three changes of methanol:acetic acid (3:1). The cell suspension is dropped onto a 
glass slide and air dried. The GENSET cDNA or genomic DNA is labeled with biotin-16 dUTP by 
nick translation according to the manufacturer's instructions (Bethesda Research Laboratories, 

20 Bethesda, MD), purified using a Sephadex G-50 column (Phamacia, Upssala, Sweden) and 

precipitated. Just prior to hybridization, the DNA pellet is dissolved in hybridization buffer (50% 
formamide, 2 X SSC, 10% dextran sulfate, 1 mg/ml sonicated salmon sperm DNA, pH 7) and the 
probe is denatured at 70 degrees Celsius for S-10 min. 

Slides kept at -20 degrees Celsius are treated for 1 h at 37 degrees Celsius with RNase A 

25 (100 ug/ml), rinsed three times in 2 X SSC and dehydrated in an ethanol series. Chromosome 
preparations are denatured in 70% formamide, 2 X SSC for 2 min at 70 degrees Celsius, then 
dehydrated at 4 degrees Celsius. The slides are treated with proteinase K (10 ug/100 ml in 20 mM 
Tris-HCI, 2 mM CaCb) at 37 degrees Celsius for 8 min and dehydrated. The hybridization mixture 
containing the probe is placed on the slide, covered with a coverslip, sealed with rubber cement and 

30 incubated overnight in a humid chamber at 37 degrees Celsius. After hybridization and post- 
hybridization washes, the biotinylated probe is detected by avidin-FITC and amplified with 
additional layers of biotinylated goat anti-avidin and avidin-FITC. For chromosomal localization, 
fluorescent R-bands are obtained as previously described (Cherif et al, 1990). The slides are 
observed under a LEICA fluorescence microscope (DMRXA). Chromosomes are counterstained 

35 with propidium iodide and the fluorescent signal of the probe appears as two symmetrical yellow- 
green spots on both chromatids of the fluorescent R-band chromosome (red). Thus, a particular 
GENSET cDNA or genomic DNA may be localized to a particular cytogenetic R-band on a given 
chromosome. 
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Use of cDNAs to Construct or Expand Chromosome Maps 

Once the GENSET cDNAs or genomic DNAs have been assigned to particular 
chromosomes using any technique known to those skilled in the art those skilled in the art, 
5 particularly those described herein, they may be utilized to construct a high resolution map of the 
chromosomes on which they are located or to identify the chromosomes in a sample. 

Chromosome mapping involves assigning a given unique sequence to a particular 
chromosome as described above. Once the unique sequence has been mapped to a given 
chromosome, it is ordered relative to other unique sequences located on the same chromosome. 

10 One approach to chromosome mapping utilizes a series of yeast artificial chromosomes (YACs) 
bearing several thousand long inserts derived from the chromosomes of the organism from which 
the GENSET cDNAs or genomic DNAs are obtained. This approach is described in Nagaraja et al. 
(\991), which disclosure is hereby incorporated by reference in its entirety. Briefly, in this 
approach each chromosome is broken into overlapping pieces which are inserted into the YAC 

1 5 vector. The YAC inserts are screened using PCR or other methods to determine whether they 
include the GENSET cDNA or genomic DNA whose position is to be determined. Once an insert 
has been found which includes the GENSET cDNA or genomic DNA, the insert can be analyzed by 
PCR or other methods to determine whether the insert also contains other sequences known to be on 
the chromosome or in the region from which the GENSET cDNA or genomic DNA was derived. 

20 This process can be repeated for each insert in the YAC library to determine the location of each of 
the GENSET cDNA or genomic DNA relative to one another and to other known chromosomal 
markers. In this way, a high resolution map of the distribution of numerous unique markers along 
each of the organisms chromosomes may be obtained. 

Identification of genes associated with hereditary diseases or drii^ response 

25 This example illustrates an approach useful for the association of GENSET cDNAs or 

genomic DNAs with particular phenotypic characteristics. In this example, a particular GENSET 
cDNA or genomic DNA is used as a test probe to associate that GENSET cDNA or genomic DNA 
with a particular phenotypic characteristic. 

GENSET cDNAs or genomic DNAs are mapped to a particular location on a human 

30 chromosome using techniques such as those described herein or other techniques known in the art. 
A search of Mendelian Inheritance in Man (V. McKusick, Mendelian Inheritance in Man; available 
on line through Johns Hopkins University Welch Medical Library) reveals the region of the human 
chromosome which contains the GENSET cDNA or genomic DNA to be a very gene rich region 
containing several known genes and several diseases or phenotypes for which genes have not been 

35 identified. The gene corresponding to this GENSET cDNA or genomic DNA thus becomes an 
immediate candidate for each of these genetic diseases. 



79 



WO 02/083898 PCT/IBOl/00914 

Cells from patients with these diseases or phenotypes are isolated and expanded in culture. 
PCR primers from the GENSET cDNA or genomic DNA are used to screen genomic DNA, mRNA 
or cDNA obtained from the patients. GENSET cDNAs or genomic DNAs that are not amplified in 
the patients can be positively associated with a particular disease by further analysis. Alternatively, 
5 the PCR analysis may yield fragments of different lengths when the samples are derived from an 
individual having the phenotype associated with the disease than when the sample is derived from a 
healthy individual, indicating that the gene containing the cDNA may be responsible for the genetic 
disease. 

Uses of polynucleotides in recombinant vectors 

10 The present invention also relates to recombinant vectors including the isolated 

polynucleotides of the present invention, and to host cells recombinant for a polynucleotide of the 
invention, such as the above vectors, as well as to methods of making such vectors and host cells 
and for using them for production of GENSET polypeptides by recombinant techniques. 

Recombinant Vectors 

1 5 The term "vector" is used herein to designate either a circular or a linear DNA or RNA 

molecule, which is either double-stranded or single-stranded, and which comprise at least one 
polynucleotide of interest that is sought to be transferred in a cell host or in a unicellular or 
multicellular host organism. The present invention encompasses a family of recombinant vectors 
that comprise a regulatory polynucleotide and/or a coding polynucleotide derived from either the 

20 GENSET genomic sequence or the cDNA sequence. Generally, a recombinant vector of the 
invention may comprise any of the polynucleotides described herein, including regulatory 
sequences, coding sequences and polynucleotide constructs, as well as any GENSET primer or 
probe as defined herein. 

In a first preferred embodiment, a recombinant vector of the invention is used to amplify 

25 the inserted polynucleotide derived from a GENSET genomic sequence or a GENSET cDNA, for 
example any cDNA selected from the group consisting of sequences of SEQ lDNOs:l-169, 339- 
455, 561-784, sequences of clone inserts of the deposited clone pool, variants and fragments thereof 
in a suitable cell host, this polynucleotide being amplified at every time that the recombinant vector 
replicates. 

30 A second preferred embodiment of the recombinant vectors according to the invention 

comprises expression vectors comprising either a regulatory polynucleotide or a coding nucleic acid 
of the invention, or both. Within certain embodiments, expression vectors are employed to express 
a GENSET polypeptide which can be then purified and, for example be used in ligand screening 
assays or as an immunogen in order to raise specific antibodies directed against the GENSET 

35 protein. In other embodiments, the expression vectors are used for constructing transgenic animals 
and also for gene therapy. Expression requires that appropriate signals are provided in the vectors, 
said signals including various regulatory elements, such as enhancers/promoters from both viral and 
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mammalian sources that drive expression of the genes of interest in host cells. Dominant drug 
selection markers for establishing permanent, stable cell clones expressing the products are 
generally included in the expression vectors of the invention, as they are elements that link 
expression of the drug selection markers to expression of the polypeptide. 
5 More particularly, the present invention relates to expression vectors which include nucleic 

acids encoding a GENSET protein, preferably a GENSET protein with an amino acid sequence 
selected from the group consisting of sequences of SEQ ID NOs: 170-338, 456-560, 785-918, 
sequences of polypeptides encoded by the clone inserts of the deposited clone pool, variants and 
fragments thereof. The polynucleotides of the present invention may be used to express an encoded 

1 0 protein in a host organism to produce a beneficial effect. In such procedures, the encoded protein 
may be transiently expressed in the host organism or stably expressed in the host organism. The 
encoded protein may have any of the activities described herein. The encoded protein may be a 
protein which the host organism lacks or, alternatively, the encoded protein may augment the 
existing levels of the protein in the host organism. 

1 5 Some of the elements which can be found in the vectors of the present invention are 

described in further detail in the following sections. 

General features of the expression vectors of the invention 

A recombinant vector according to the invention comprises, but is not limited to, a YAC 
(Yeast Artificial Chromosome), a BAG (Bacterial Artificial Chromosome), a phage, a phagemid, a 
20 cosmid, a plasmid or even a linear DNA molecule which may comprise a chromosomal, non- 
chromosomal, semi-synthetic and synthetic DNA. Such a recombinant vector can comprise a 
transcriptional unit comprising an assembly of: 

(1) a genetic element or elements having a regulatory role in gene expression, for example 
promoters or enhancers. Enhancers are cis-acting elements of DNA, usually from about 10 to 300 

25 bp in length that act on the promoter to increase the transcription. 

(2) a structural or coding sequence which is transcribed into mRNA and eventually 
translated into a polypeptide, said structural or coding sequence being operably linked to the 
regulatory elements described in (1); and 

(3) appropriate transcription initiation and termination sequences. Structural units intended 
30 for use in yeast or eukaiyotic expression systems preferably include a leader sequence enabling 

extracellular secretion of translated protein by a host cell. Alternatively, when a recombinant 
protein is expressed without a leader or transport sequence, it may include a N-terminal residue. 
This residue may or may not be subsequently cleaved from the expressed recombinant protein to 
provide a final product. 

35 Generally, recombinant expression vectors will include origins of replication, selectable 

markers permitting transformation of the host cell, and a promoter derived from a highly expressed 
gene to direct transcription of a downstream structural sequence. The heterologous structural 
sequence is assembled in appropriate phase with translation initiation and termination sequences. 
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and preferably a leader sequence capable of directing secretion of the translated protein into the 
periplasmic space or the extracellular medium. In a specific embodiment wherein the vector is 
adapted for transfecting and expressing desired sequences in mammalian host cells, preferred 
vectors will comprise an origin of replication in the desired host, a suitable promoter and enhancer, 
5 and also any necessary ribosome binding sites, polyadenylation signals, splice donor and acceptor 
sites, transcriptional termination sequences, and 5'-flanking non-transcribed sequences. DNA 
sequences derived from the SV40 viral genome, for example SV40 origin, early promoter, 
enhancer, splice and polyadenylation signals may be used to provide the required non-transcribed 
genetic elements. 

1 0 The in vivo expression of a GENSET polypeptide of the present invention may be useful in 

order to correct a genetic defect related to the expression of the native gene in a host organism, for 
the treatment or prevention of any disease or condition that can be treated or prevented by 
increasing the level of GENSET polypeptide expression, or to the production of a biologically 
inactive GENSET protein. Consequently, the present invention also comprises recombinant 

1 5 expression vectors mainly designed for the in vivo production of a GENSET polypeptide the present 
invention by the introduction of the appropriate genetic material in the organism or the patient to be 
treated. This genetic material may be introduced in vitro in a cell that has been previously extracted 
from the organism, the modified cell being subsequently reintroduced in the said organism, directly 
in vivo into the appropriate tissue. 

20 Regulatory Elements 

The suitable promoter regions used in the expression vectors according to the present 
invention are chosen taking into account the cell host in which the heterologous gene has to be 
expressed. The particular promoter employed to control the expression of a nucleic acid sequence 
of interest is not believed to be important, so long as it is capable of directing the expression of the 

25 nucleic acid in the targeted cell. Thus, where a human cell is targeted, it is preferable to position the 
nucleic acid coding region adjacent to and under the control of a promoter that is capable of being 
expressed in a human cell, such as, for example, a human or a viral promoter. 

A suitable promoter may be heterologous with respect to the nucleic acid for which it 
controls the expression or alternatively can be endogenous to the native polynucleotide containing 

30 the coding sequence to be expressed. Additionally, the promoter is generally heterologous with 
respect to the recombinant vector sequences within which the construct promoter/coding sequence 
has been inserted. 

Promoter regions can be selected from any desired gene using, for example, CAT 
(chloramphenicol transferase) vectors and more preferably pKK232-8 and pCM7 vectors. 
35 Preferred bacterial promoters are the Lad, LacZ, the T3 or T7 bacteriophage RNA 

polymerase promoters, the gpt, lambda PR, PL and trp promoters (BP 0036776), the polyhedrin 
promoter, or the pi 0 protein promoter from baculovirus (Kit Novagen), (Smith et at., 1 983; 
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O'Reilly et ah, 1992; which disclosures are hereby incorporated by reference in their entireties), the 
lambda PR promoter or also the trc promoter. 

Eukaryotic promoters include CMV immediate early, HSV thymidine kinase, early and late 
SV40, LTRs from retrovirus, and mouse metallothionein-L. Selection of a convenient vector and 
5 promoter is well within the level of ordinary skill in the art. The choice of a promoter is well within 
the ability of a person skilled in the field of genetic engineering. For example, one may refer to the 
book of Sambrook et al.,(\ 989) or also to the procedures described by Fuller et al., ( 1 996), which 
disclosures are hereby incorporated by reference in their entireties. 

Other regulatory elements 

10 Where a cDNA insert is employed, one will typically desire to include a polyadenylation 

signal to effect proper polyadenylation of the gene transcript. The nature of the polyadenylation 
signal is not believed to be crucial to the successful practice of the invention, and any such sequence 
may be employed such as human growth hormone and SV40 polyadenylation signals. Also 
contemplated as an element of the expression cassette is a terminator. These elements can serve to 

1 5 enhance message levels and to minimize read through from the cassette into other sequences. 

Selectable Markers 

Selectable markers confer an identifiable change to the cell permitting easy identification of 
cells containing the expression construct. The selectable marker genes for selection of transformed 
host cells are preferably dihydrofolate reductase or neomycin resistance for eukaryotic cell culture, 
20 TRPl for S. cerevisiae or tetracycline, rifampicin or ampicillin resistance in E. Coli, or levan 
saccharase for mycobacteria, this latter marker being a negative selection marker. 

Preferred Vectors 
Bacterial vectors 

As a representative but non-limiting example, useful expression vectors for bacterial use 
25 can comprise a selectable marker and a bacterial origin of replication derived from commercially 
available plasmids comprising genetic elements of pBR322 (ATCC 37017). Such commercial 
vectors include, for example, pKK223-3 (Pharmacia, Uppsala, Sweden), and pGEMI (Promega 
Biotec, Madison, WI, USA). 

Large numbers of other suitable vectors are known to those of skill in the art, and 
30 commercially available, such as the following bacterial vectors: pQE70, pQE60, pQE-9 (Qiagen), 
pbs, pDlO, phagescript, psiX174, pbluescript SK, pbsks, pNH8A, pNH16A, pNHlSA, pNH46A 
(Stratagene); ptrc99a, pKK223-3, pKK233-3, pDR540, pRIT5 (Pharmacia); pWLNEO, pSV2CAT, 
pOG44, pXTl, pSG (Stratagene); pSVK3, pBPV, pMSG, pSVL (Pharmacia); pQE-30 
(QIAexpress). 
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The PI bacteriophage vector may contain large inserts ranging from about 80 to about 100 
kb. The construction of PI bacteriophage vectors such as pi 58 or pl58/neo8 are notably described 
by Sternberg (1992, 1994), which disclosure is hereby incorporated by reference in its entirety. 
5 Recombinant PI clones comprising GENSET nucleotide sequences may be designed for inserting 
large polynucleotides of more than 40 kb (See Linton et ah, 1993), which disclosure is hereby 
incorporated by reference in its entirety. To generate PI DNA for transgenic experiments, a 
preferred protocol is the protocol described by McCormick et al. (1994), which disclosure is hereby 
incorporated by reference in its entirety. Briefly, E. coli (preferably strain NS3529) harboring the 

10 PI plasmid are grown overnight in a suitable broth medium containing 25 jig/ml of kanamycin. 
The PI DNA is prepared from the E. coli by alkaline lysis using the Qiagen Plasmid Maxi kit 
(Qiagen, Chatsworth, CA, USA), according to the manufacturer's instructions. The PI DNA is 
purified from the bacterial lysate on two Qiagen-tip 500 columns, using the washing and elution 
buffers contained in the kit. A phenol/chloroform extraction is then performed before precipitating 

1 5 the DNA with 70% ethanol. After solubilizing the DNA in TE ( 1 0 mM Tris-HCl, pH 7.4, 1 mM 
EDTA), the concentration of the DNA is assessed by spectrophotometry. 

When the goal is to express a PI clone comprising GENSET polypeptide-encoding 
nucleotide sequences in a transgenic animal, typically in transgenic mice, it is desirable to remove 
vector sequences from the PI DNA fragment, for example by cleaving the PI DNA at rare-cutting 

20 sites within the PI polylinker {Sfi\, Nod or Sail). The PI insert is then purified from vector 

sequences on a pulsed-field agarose gel, using methods similar to those originally reported for the 
isolation of DNA from YACs (See e. g., Schedl et al., 1993a; Peterson etal., 1993), which 
disclosures are hereby incorporated by reference in their entireties. At this stage, the resulting 
purified insert DNA can be concentrated, if necessary, on a Millipore Ultrafree-MC Filter Unit 

25 (Millipore, Bedford, MA, USA - 30,000 molecular weight limit) and then dialyzed against 

microinjection buffer (10 mM Tris-HCl, pH 7.4; 250 (iM EDTA) containing 100 mM NaCl, 30 (iM 
spermine, 70 jiM spermidine on a microdyalisis membrane (type VS, 0.025 (iM from Millipore). 
The intactness of the purified PI DNA insert is assessed by electrophoresis on 1% agarose (Sea 
Kem GTG; FMC Bio-products) pulse-field gel and staining with ethidium bromide. 

30 Viral vectors 

In one specific embodiment, the vector is derived from an adenovirus. Preferred adenovirus 
vectors according to the invention are those described by Feldman and Steg (1996), or Ohno et al., 
(1 994), which disclosures are hereby incorporated by reference in their entireties. Another 
preferred recombinant adenovirus according to this specific embodiment of the present invention is 
35 the human adenovirus type 2 or 5 (Ad 2 or Ad 5) or an adenovirus of animal origin (French patent 
application No. FR-93 .05954), which disclosure is hereby incorporated by reference in its entirety. 

Retrovirus vectors and adeno-associated virus vectors are generally understood to be the 
recombinant gene delivery systems of choice for the transfer of exogenous polynucleotides in vivo , 
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particularly to mammals, including humans. These vectors provide efficient delivery of genes into 
cells, and the transferred nucleic acids are stably integrated into the chromosomal DNA of the host. 
Particularly preferred retroviruses for the preparation or construction of retroviral in vitro or in vitro 
gene delivery vehicles of the present invention include retroviruses selected from the group 
5 consisting of Mink-Cell Focus Inducing Virus, Murine Sarcoma Virus, Reticuloendotheliosis virus 
and Rous Sarcoma virus. Particularly preferred Murine Leukemia Viruses include the 4070A and 
the 1504A viruses, Abelson (ATCC No VR-999), Friend (ATCC No VR-245), Gross (ATCC No 
VR-590), Rauscher (ATCC No VR-998) and Moloney Murine Leukemia Virus (ATCC No VR- 
190; PCT Application No WO 94/24298). Particularly preferred Rous Sarcoma Viruses include 

10 Bryan high titer (ATCC Nos VR-334, VR-657, VR-726, VR-659 and VR-728). aher prefen-ed 
retroviral vectors are those described in Roth et al. (1996), PCT Application No WO 93/25234, 
PCT Application No WO 94/ 06920, Roux et al, (1989), Julan et al, (1992), and Neda et al, 
(1991), which disclosures are hereby incorporated by reference in their entireties. 

Yet another viral vector system that is contemplated by the invention comprises the adeno- 

15 associated virus (AAV). The adeno-associated virus is a naturally occurring defective virus that 
requires another virus, such as an adenovirus or a herpes virus, as a helper virus for efficient 
replication and a productive life cycle (Muzyczka etal, 1992), which disclosure is hereby 
incorporated by reference in its entirety. It is also one of the few viruses that may integrate its DNA 
into non-dividing cells, and exhibits a high frequency of stable integration (Flotte et al 1 992; 

20 Samulski et al., 1989; McLaughlin et al, 1989), which disclosures are hereby incorporated by 
reference in their entireties. One advantageous feature of AAV derives from its reduced efficacy 
for transducing primary cells relative to transformed cells. 

BAC vectors 

The bacterial artificial chromosome (BAC) cloning system (Shizuya et al, 1992), which 
25 disclosure is hereby incorporated by reference in its entirety, has been developed to stably maintain 
large fragments of genomic DNA (100-300 kb) in E. coli. A preferred BAC vector comprises a 
pBeloBACl I vector that has been described by Kim et al (1996), which disclosure is hereby 
incorporated by reference in its entirety. BAC libraries are prepared with this vector using size- 
selected jgenomic DNA that has been partially digested using enzymes that permit ligation into 
30 either the Bam HI or Hindlll sites in the vector. Flanking these cloning sites are T7 and SP6 RNA 
polymerase transcription initiation sites that can be used to generate end probes by either RNA 
transcription or PCR methods. After the construction of a BAC libraty in E. coli, BAC DNA is 
purified from the host cell as a supercoiled circle. Converting these circular molecules into a linear 
form precedes both size determination and introduction of the BACs into recipient cells. The 
35 cloning site is flanked by two Not I sites, permitting cloned segments to be excised from the vector 
by Not I digestion. Alternatively, the DNA insert contained in the pBeloBAC 1 1 vector may be 
linearized by treatment of the BAC vector with the commercially available enzyme lambda 
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terminase that leads to the cleavage at the unique cosN site, but this cleavage method results in a 
full length BAC clone containing both the insert DNA and the BAC sequences. 

Baculovirus 

Another specific suitable host vector system is the pVL 1392/1 393 baculovirus transfer 
5 vector (Pharmingen) that is used to transfect the SF9 cell line (ATCC No. CRL 1711) which is 
derived from Spodoptera frugiperda. Other suitable vectors for the expression of the GENSET 
polypeptide of the present invention in a baculovirus expression system include those described by 
Chai et al, (1993), Vlasak et al, (1983), and Lenhard et a/., (1996), which disclosures are hereby 
incorporated by reference in their entireties. 

1 0 Delivery Of The Recombinant Vectors 

To effect expression of the polynucleotides and polynucleotide constructs of the invention, 
the constructs must be delivered into a cell. This delivery may be accomplished in vitro, as in 
laboratory procedures for transforming ceil lines, or in vivo or ex vivo, as in the treatment of certain 
diseases states. One mechanism is viral infection where the expression construct is encapsulated in 

15 an infectious viral particle. 

Several non-viral methods for the transfer of polynucleotides into cultured mammalian cells 
are also contemplated by the present invention, and include, without being limited to, calcium 
phosphate precipitation (Graham etal., 1973; Chen et al., 1987); DEAE-dextran (Gopal, 1985); 
electroporation (Tur-Kaspa et al., 1986; Potter et al., 1984); direct microinjection (Harland et al., 

20 1985); DNA-loaded liposomes (Nicolau et al., 1982; Fraley et al., 1979); and receptor-mediated 
transfection. (Wu and Wu, 1987, 1988), which disclosures are hereby incorporated by reference in 
their entireties. Some of these techniques may be successfully adapted for in vivo or ex vivo use. 

Once the expression polynucleotide has been delivered into the cell, it may be stably 
integrated into the genome of the recipient cell. This integration may be in the cognate location and 

25 orientation via homologous recombination (gene replacement) or it may be integrated in a random, 
non-specific location (gene augmentation). In yet further embodiments, the nucleic acid may be 
stably maintained in the cell as a separate, episomal segment of DNA. Such nucleic acid segments 
or "episomes" encode sequences sufficient to permit maintenance and replication independent of or 
in synchronization with the host cell cycle. 

30 One specific embodiment for a method for delivering a protein or peptide to the interior of a 

cell of a vertebrate in vivo comprises the step of introducing a preparation comprising a 
physiologically acceptable carrier and a naked polynucleotide operatively coding for the 
polypeptide of interest into the interstitial space of a tissue comprising the cell, whereby the naked 
polynucleotide is taken up into the interior of the cell and has a physiological effect. This is 

35 particularly applicable for transfer in vitro but it may be applied to in vivo as well. 

Compositions for use in vitro and in vivo comprising a "naked" polynucleotide are 
described in PCT application No. WO 90/1 1092 (Vical Inc.) and also in PCT application No. WO 
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95/1 1307 (Institut Pasteur, INSERM, Universite d'Ottawa) as well as in the articles of Tascon et al. 
(1996) and of Huygen et al., (1996), which disclosures are hereby incorporated by reference in their 
entireties. 

In still another embodiment of the invention, the transfer of a naked polynucleotide of the 
5 invention, including a polynucleotide construct of the invention, into cells may be accomplished 
with particle bombardment (biolistic), said particles being DNA-coated microprojectiles accelerated 
to a high velocity allowing them to pierce cell membranes and enter cells without killing them, such 
as described by Klein et al, (1987), which disclosure is hereby incorporated by reference in its 
entirety. 

10 In a further embodiment, the polynucleotide of the invention may be entrapped in a 

liposome (Ghosh and Bacchawat, 1 99 1 ; Wong et al, 1 980; Nicolau et al, 1 987, which disclosures 
are hereby incorporated by reference in their entireties). 

In a specific embodiment, the invention provides a composition for the in vivo production 
of the GENSET polypeptides described herein. It comprises a naked polynucleotide operatively 

1 5 coding for this polypeptide, in solution in a physiologically acceptable carrier, and suitable for 
introduction into a tissue to cause cells of the tissue to express the said protein or polypeptide. 

The amount of vector to be injected to the desired host organism varies according to the site 
of injection. As an indicative dose, it will be injected between 0. 1 and 1 00 \yg of the vector in an 
animal body, preferably a mammal body, for example a mouse body. 

20 In another embodiment of the vector according to the invention, it may be introduced in 

vitro in a host cell, preferably in a host cell previously harvested from the animal to be treated and 
more preferably a somatic cell such as a muscle cell. In a subsequent step, the cell that has been 
transformed with the vector coding for the desired GENSET polypeptide or the desired fragment 
thereof is reintroduced into the animal body in order to deliver the recombinant protein within the 

25 body either locally or systemically. 



Secretion vectors 

Some of the GENSET cDNAs or genomic DNAs of the invention may also be used to 
construct secretion vectors capable of directing the secretion of the proteins encoded by genes 
30 inserted in the vectors. Such secretion vectors may facilitate the purification or enrichment of the 
proteins encoded by genes inserted therein by reducing the number of background proteins from 
which the desired protein must be purified or enriched. Exemplary secretion vectors are described 
below. 

The secretion vectors of the present invention include a promoter capable of directing gene 
35 expression in the host cell, tissue, or organism of interest. Such promoters include the Rous 
Sarcoma Virus promoter, the SV40 promoter, the human cytomegalovirus promoter, and other 
promoters familiar to those skilled in the art. 
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A signal sequence from a polynucleotide of the invention, preferably a signal sequences 
selected from the group of signal sequences of SEQ IDNOs: 1-85, 339-400, 406-407, 413-415, 
561-594, and 634-651 and signal sequences of clone inserts of the deposited clone pool is operably 
linked to the promoter such that the mRNA transcribed from the promoter will direct the translation 
5 of the signal peptide. The host cell, tissue, or organism may be any cell, tissue, or organism which 
recognizes the signal peptide encoded by the signal sequence in the GENSET cDNA or genomic 
DNA. Suitable hosts include mammalian cells, tissues or organisms, avian cells, tissues, or 
organisms, insect cells, tissues or organisms, or yeast. 

In addition, the secretion vector contains cloning sites for inserting genes encoding the 
10 proteins which are to be secreted. The cloning sites facilitate the cloning of the insert gene in frame 
with the signal sequence such that a fusion protein in which the signal peptide is fused to the protein 
encoded by the inserted gene is expressed from the mRNA transcribed from the promoter. The 
signal peptide directs the extracellular secretion of the fusion protein. 

The secretion vector may be DNA or RNA and may integrate into the chromosome of the 
15 host, be stably maintained as an extrachromosomal replicon in the host, be an artificial 

chromosome, or be transiently present in the host. Preferably, the secretion vector is maintained in 
multiple copies in each host cell. As used herein, multiple copies means at least 2, 5, 10, 20, 25, 50 
or more than 50 copies per cell. In some embodiments, the multiple copies are maintained 
extrachromosomally. In other embodiments, the multiple copies result from amplification of a 
20 chromosomal sequence. 

Many nucleic acid backbones suitable for use as secretion vectors are known to those 
skilled in the art, including retroviral vectors, SV40 vectors. Bovine Papilloma Virus vectors, yeast 
integrating plasmids, yeast episomal plasmids, yeast artificial chromosomes, human artificial 
chromosomes, P element vectors, baculovirus vectors, or bacterial plasmids capable of being 
25 transiently introduced into the host. 

The secretion vector may also contain a polyA signal such that the polyA signal is located 
downstream of the gene inserted into the secretion vector. 

After the gene encoding the protein for which secretion is desired is inserted into the 
secretion vector, the secretion vector is introduced into the host cell, tissue, or organism using 
30 calcium phosphate precipitation, DEAE-Dextran, electroporation, liposome-mediated transfection, 
viral particles or as naked DNA. The protein encoded by the inserted gene is then purified or 
enriched from the supernatant using conventional techniques such as ammonium sulfate 
precipitation, immunoprecipitation, immunochromatography, size exclusion chromatography, ion 
exchange chromatography, and hplc. Alternatively, the secreted protein may be in a sufficiently 
35 enriched or pure state in the supernatant or growth media of the host to permit it to be used for its 
intended purpose without further enrichment. 

The signal sequences may also be inserted into vectors designed for gene therapy. In such 
vectors, the signal sequence is operably linked to a promoter such that mRNA transcribed from the 
promoter encodes the signal peptide. A cloning site is located downstream of the signal sequence 
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such that a gene encoding a protein whose secretion is desired may readily be inserted into the 
vector and fused to the signal sequence. The vector is introduced into an appropriate host cell. The 
protein expressed from the promoter is secreted extracellularly, thereby producing a therapeutic 
effect. 

5 

Cell Hosts 

Another object of the invention comprises a host cell that has been transformed or 
transfected with one of the polynucleotides described herein, and in particular a polynucleotide 
either comprising a GENSET polypeptide-encoding polynucleotide regulatory sequence or the 

10 polynucleotide coding for a GENSET polypeptide. Also included are host cells that are transformed 
(prokaryotic cells) or that are transfected (eukaryotic cells) with a recombinant vector such as one of 
those described above. However, the cell hosts of the present invention can comprise any of the 
polynucleotides of the present invention. In a preferred embodiment, host cells contain a 
polynucleotide sequence comprising a sequence selected from the group consisting of sequences of 

1 5 SEQ ID NOs: 1 -1 69, 339-455, 561-784, sequences of clone inserts of the deposited clone pool, 
variants and fragments thereof. Preferred host cells used as recipients for the expression vectors of 
the invention are the following: 

a) Prokaryotic host cells: Escherichia coli strains (I.E.DH5-a strain), BckHIus subtilis. 
Salmonella typhimurium, and strains from species like Pseudomonas, Streptomyces and 

20 Staphylococcus. 

b) Eukaryotic host cells: HeLa cells (ATCC No.CCL2; No.CCL2.1; No.CCL2.2), Cv I 
cells (ATCC NO.CCL70), COS cells (ATCC No.CRL1650; No.CRL1651), Sf-9 cells (ATCC 
NO.CRL171 1), C127 cells (ATCC No. CRL-1804), 3T3 (ATCC No. CRL-6361), CHO (ATCC No. 
CCL-6 1 ), human kidney 293. (ATCC No. 45504; No. CRL-1 573) and BHK (ECACC No. 

25 84100501; No. 84111301). 

c) Other mammalian host cells. 

The present invention also encompasses primary, secondary, and immortalized 
homologously recombinant host cells of vertebrate origin, preferably mammalian origin and 

30 particularly human origin, that have been engineered to: a) insert exogenous (heterologous) 

polynucleotides into the endogenous chromosomal DNA of a targeted gene, b) delete endogenous 
chromosomal DNA, and/or c) replace endogenous chromosomal DNA with exogenous 
polynucleotides. Insertions, deletions, and/or replacements of polynucleotide sequences may be to 
the coding sequences of the targeted gene and/or to regulatory regions, such as promoter and 

35 enhancer sequences, operably associated with the targeted gene. 

In addition to encompassing host cells containing the vector constructs discussed herein, the 
invention also encompasses primary, secondary, and immortalized host cells of vertebrate origin, 
particularly mammalian origin, that have been engineered to delete or replace endogenous genetic 
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material (e.g., coding sequence), and/or to include genetic material (e.g., heterologous 
polynucleotide sequences) that is operably associated with the polynucleotides of the invention, and 
which activates, alters, and/or amplifies endogenous polynucleotides. For example, techniques 
known in the art may be used to operably associate heterologous control regions (e.g., promoter 
5 and/or enhancer) and endogenous polynucleotide sequences via homologous recombination, see, 
e.g., U.S. Patent No. 5,641,670, issued June 24, 1997; International Publication No. WO 96/2941 1, 
published September 26, 1996; International Publication No. WO 94/12650, published August 4, 
1994; Koller et al, (1989); and Zijlstra et al. (1989) (the disclosures of each of which are 
incorporated by reference in their entireties). 

1 0 The present invention further relates to a method of making a homologously recombinant 

host cell in vitro or in vivo, wherein the expression of a targeted gene not normally expressed in the 
cell is altered. Preferably the alteration causes expression of the targeted gene under normal growth 
conditions or under conditions suitable for producing the polypeptide encoded by the targeted gene. 
The method comprises the steps of: (a) transfecting the cell in vitro or in vivo with a polynucleotide 

1 5 construct, said polynucleotide construct comprising; (i) a targeting sequence; (ii) a regulatory 
sequence and/or a coding sequence; and (iii) an unpaired splice donor site, if necessary, thereby 
producing a transfected cell; and (b) maintaining the transfected cell in vitro or in vivo under 
conditions appropriate for homologous recombination. 

The present invention further relates to a method of altering the expression of a targeted 

20 gene in a cell in vitro or in vivo wherein the gene is not normally expressed in the cell, comprising 
the steps of: (a) transfecting the cell in vitro or in vivo with a polynucleotide construct, said 
polynucleotide construct comprising: (i) a targeting sequence; (ii) a regulatory sequence and/or a 
coding sequence; and (iii) an unpaired splice donor site, if necessary, thereby producing a 
transfected cell; and (b) maintaining the transfected cell in vitro or in vivo under conditions 

25 appropriate for homologous recombination, thereby producing a homologously recombinant cell; 
and (c) maintaining the homologously recombinant cell in vitro or in vivo under conditions 
appropriate for expression of the gene. 

The present invention further relates to a method of making a polypeptide of the present 
invention by altering the expression of a targeted endogenous gene in a cell in vitro or in vivo 

30 wherein the gene is not normally expressed in the cell, comprising the steps of: a) transfecting the 
cell in vitro with a polynucleotide construct, said polynucleotide construct comprising: (i) a 
targeting sequence; (ii) a regulatory sequence and/or a coding sequence; and (iii) an unpaired splice 
donor site, if necessary, thereby producing a transfected cell; (b) maintaining the transfected cell in 
vitro or in vivo under conditions appropriate for homologous recombination, thereby producing a 

35 homologously recombinant cell; and c) maintaining the homologously recombinant cell in vitro or 
in vivo under conditions appropriate for expression of the gene thereby making the polypeptide. 

The present invention further relates to a polynucleotide construct which alters the 
expression of a targeted gene in a cell type in which the gene is not normally expressed. This 
occurs when the polynucleotide construct is inserted into the chromosomal DNA of the target cell, 
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wherein said polynucleotide construct comprises: a) a targeting sequence; b) a regulatory sequence 
and/or coding sequence; and c) an unpaired splice-donor site, if necessary. Further included are a 
polynucleotide construct, as described above, wherein said polynucleotide construct further 
comprises a polynucleotide which encodes a polypeptide and is in-frame with the targeted 
5 endogenous gene after homologous recombination with chromosomal DNA. 

The compositions may be produced, and methods performed, by techniques known in the 
art, such as those described in U.S. Patent NOs: 6,054,288; 6,048,729; 6,048,724; 6,048,524; 
5,994,127; 5,968,502; 5,965,125; 5,869,239; 5,817,789; 5,783,385; 5,733,761; 5,641,670; 
5,580,734 ; International Publication NOs:W096/2941 1, WO 94/12650; and scientific articles 
1 0 described by Roller et al, ( 1 994). (The disclosures of each of which are incorporated by reference 
in their entireties). 

GENSET gene expression in mammalian cells, preferably human cells, may be rendered 
defective, or alternatively may be altered by replacing endogenous GENSET polypeptide-encoding 
genes in the genome of an animal cell by a GENSET polypeptide-encoding polynucleotide 

1 5 according to the invention. These genetic alterations may be generated by homologous 
recombination using previously described specific polynucleotide constructs. 

Mammal zygotes, such as murine zygotes may be used as cell hosts. For example, murine 
zygotes may undergo microinjection with a purified DNA molecule of interest, for example a 
purified DNA molecule that has previously been adjusted to a concentration ranging from 1 ng/ml - 

20 for BAC inserts- to 3 ng/fil -for PI bacteriophage inserts- in 10 mM Tris-HCl, pH 7.4, 250 jiM 
EDTA containing 100 mM NaCI, 30 ^M spermine, and70 jiM spermidine. When the DNA to be 
microinjected has a large size, polyamines and high salt concentrations can be used in order to avoid 
mechanical breakage of this DNA, as described by Schedl etal (1993b), which disclosure is hereby 
incorporated by reference in its entirety. 

25 Any one of the polynucleotides of the invention, including the polynucleotide constructs 

described herein, may be introduced in an embryonic stem (ES) cell line, preferably a mouse ES 
cell line. ES cell lines are derived from pluripotent, uncommitted cells of the inner cell mass of pre- 
implantation blastocysts. Preferred ES cell lines are the following: ES-E14TG2a (ATCC No.CRL- 
1821), ES-D3 (ATCC No.CRL1934 and No. CRL-1 1632), YSOOl (ATCC No. CRL-1 1776), 36.5 

30 (ATCC No. CRL-1 1116). ES cells are maintained in an uncommitted state by culture in the 
presence of growth-inhibited feeder cells which provide the appropriate signals to preserve this 
embryonic phenotype and serve as a matrix for ES cell adherence. Preferred feeder cells are 
primary embryonic fibroblasts that are established from tissue of day 13- day 14 embryos of 
virtually any mouse strain, that are maintained in culture, such as described by Abbondanzo et al. 

35 (1 993) and are growth-inhibited by irradiation, such as described by Robertson ( 1 987), or by the 
presence of an inhibitory concentration of LIF, such as described by Pease and Williams (1990), 
which disclosures are hereby incorporated by reference in their entireties. 

The constructs in the host cells can be used in a conventional manner to produce the gene 
product encoded by the recombinant sequence. 
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Following transformation of a suitable host and growth of the host to an appropriate cell 
density, the selected promoter is induced by appropriate means, such as temperature shift or 
chemical induction, and cells are cultivated for an additional period. Cells are typically harvested 
by centrifugation, disrupted by physical or chemical means, and the resulting crude extract retained 
5 for further purification. Microbial cells employed in the expression of proteins can be disrupted by 
any convenient method, including freeze-thaw cycling, sonication, mechanical disruption, or use of 
cell lysing agents. Such methods are well known by the skilled artisan. 

Transgenic Animals 

The terms "transgenic animals" or "host animals" are used herein to designate animals that 

10 have their genome genetically and artificially manipulated so as to include one of the nucleic acids 
according to the invention. Preferred animals are non-human mammals and include those 
belonging to a genus selected from Mus (e.g. mice), Rattus (e.g. rats) and Oryctogalus (e.g. rabbits) 
which have their genome artificially and genetically altered by the insertion of a nucleic acid 
according to the invention. In one embodiment, the invention encompasses non-human host 

1 5 mammals and animals comprising a recombinant vector of the invention or a GENSET gene 
disrupted by homologous recombination with a knock out vector. 

The transgenic animals of the invention all include within a plurality of their cells a cloned 
recombinant or synthetic DNA sequence, more specifically one of the purified or isolated nucleic 
acids comprising a GENSET polypeptide coding sequence, a GENSET polynucleotide regulatory 

20 sequence, a polynucleotide construct, or a DNA sequence encoding an antisense polynucleotide 
such as described in the present specification. 

Generally, a transgenic animal according the present invention comprises any of the 
polynucleotides, the recombinant vectors and the cell hosts described in the present invention. In a 
first preferred embodiment, these transgenic animals may be good experimental models in order to 

25 study the diverse pathologies related to the dysregulation of the expression of a given GENSET 
gene, in particular the transgenic animals containing within their genome one or several copies of an 
inserted polynucleotide encoding a native GENSET polypeptide, or alternatively a mutant GENSET 
polypeptide. 

In a second preferred embodiment, these transgenic animals may express a desired 
30 polypeptide of interest under the control of the regulatory polynucleotides of the GENSET gene, 

leading to high yields in the synthesis of this protein of interest, and eventually to tissue specific 

expression of the protein of interest. 

The design of the transgenic animals of the invention may be made according to the 

conventional techniques well known from the one skilled in the art. For more details regarding the 
35 production of transgenic animals, and specifically transgenic mice, it may be referred to US Patents 

Nos 4,873,191, issued Oct. 10, 1989; 5,464,764 issued Nov 7, 1995; and 5,789,215, issued Aug 4, 

1998; these documents being herein incorporated by reference to disclose methods producing 

transgenic mice. 
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Transgenic animals of the present invention are produced by the application of procedures 
which result in an animal with a genome that has incorporated exogenous genetic material. The 
procedure involves obtaining the genetic material which encodes either a GENSET polypeptide 
coding sequence, a GENSET polynucleotide regulatory sequence, ora DNA sequence encoding a 
5 GENSET polynucleotide antisense sequence, or a portion thereof, such as described in the present 
specification. A recombinant polynucleotide of the invention is inserted into an embryonic or ES 
stem cell line. The insertion is preferably made using electroporation, such as described by Thomas 
et al. (1987), which disclosure is hereby incorporated by reference in its entirety. The cells 
subjected to electroporation are screened (e.g. by selection via selectable markers, by PGR or by 
10 Southern blot analysis) to find positive cells which have integrated the exogenous recombinant 
polynucleotide into their genome, preferably via an homologous recombination event. An 
illustrative positive-negative selection procedure that may be used according to the invention is 
described by Mansourer al. (1988), which disclosure is hereby incorporated by reference in its 
entirety. 

15 The positive cells are then isolated, cloned and injected into 3.5 days old blastocysts from 

mice, such as described by Bradley (1987), which disclosure is hereby incorporated by reference in 
its entirety. The blastocysts are then inserted into a female host animal and allowed to grow to 
term. Alternatively, the positive ES cells are brought into contact with embryos at the 2.5 days old 
8-16 cell stage (morulae) such as described by Wood et al. (1993), or by Nagy et al. (1993), which 

20 disclosures are hereby incorporated by reference in their entireties, the ES cells being internalized to 
colonize extensively the blastocyst including the cells which will give rise to the germ line. 

The offspring of the female host are tested to determine which animals are transgenic e.g. 
include the inserted exogenous DNA sequence and which ones are wild type. 

Thus, the present invention also concerns a transgenic animal containing a nucleic acid, a 

25 recombinant expression vector or a recombinant host cell according to the invention. 

In another embodiment, transgenic animals are produced by microinjecting polynucleotides 
ares microinjected into a fertilized oocyte. Typically, fertilized oocytes are microinjected using 
standard techniques, and then cultured in vitro until a "pre-implantation embryo" is obtained. Such 
pre-implantation embryos preferably contain approximately 16 to 150 cells. Methods forculturing 

30 fertilized oocytes to the pre-implantation stage are described, e.g., by Gordon et al. ((1984) Methods 
in Enzymology, 101, 414); Hogan et al. ((1986) in Manipulating the mouse embryo. A Laboratory 
Manual. Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y) (for the mouse embryo); 
Hammer et al. ((1985) Nature, 315, 680) (for rabbit and porcine embryos); Gandolfi et al. ((1987) J. 
Reprod. Pert. 81, 23-28); Rexroad et al. ((1988) J. Anim. Sci. 66, 947-953) (for ovine embryos); and 

35 Eyestone et al. ((1989) J. Reprod. Pert. 85, 715-720); Camous et al. ((1984) J. Reprod. Pert. 72, 
779-785); and Heyman et al. ((1987) Theriogenology 27, 5968) (for bovine embryos); the 
disclosures of each of which are incorporated herein in their entireties. Pre-implantation embryos 
are then transferred to an appropriate female by standard methods to permit the birth of a transgenic 
or chimeric animal, depending upon the stage of development when the transgene is introduced. 



93 



[fije>'/^:/yv'GG208389e.s^] 



WO 02/083898 PCT/IBOl/00914 

As the frequency of transgene incorporation is often low, the detection of transgene 
integration in pre-implantation embryos is often desirable using any of the herein-described 
methods. Any of a number of methods can be used to detect the presence of a transgene in a pre- 
implantation embryo. For example, one or more cells may be removed from the pre-implantation 
5 embryo, and the presence or absence of the transgene in the removed cell or cells can be detected 
using any standard method e.g. PGR. Alternatively, the presence of a transgene can be detected in 
utero or post partum using standard methods. 

In a particularly preferred embodiment of the present invention, transgenic mammals are 
generated that secrete recombinant GENSET polypeptides in their milk. As the mammary gland is 

10 a highly efficient protein-producing organ, such methods can be used to produce protein 

concentrations in the gram per liter range, and often significantly more. Preferably, expression in 
the mammary gland is accomplished by operably linking the polynucleotide encoding the GENSET 
polypeptide to a mammary gland specific promoter and, optionally, other regulatory elements. 
Suitable promoters and other elements include, but are not limited to, those derived from 

15 mammalian short and long WAP, alpha, beta, and kappa, casein, alpha and beta lactoglobulin, beta- 
CN 5' genes, as well as the the mouse mammary tumor virus (MMTV) promoter. Such promoters 
and other elements may be derived from any mammal, including, but not limited to, cows, goats, 
sheep, pigs, mice, rabbits, and guinea pigs. Promoter and other regulatory sequences, vectors, and 
other relevant teachings are provided, e.g., by Clark (1998) J Mammary Gland Biol Neoplasia 

20 3:337-50; Jost et al. (1999) Nat. Biotechnol 17:160-4; U.S. Patent Nos. 5,994,616; 6,140,552; 
6,013,857; Sohn et al. (1999) DNA Cell Biol. 18:845-52; Kim et al. (1999) J. Biochem. (Japan) 
126:320-5; Soulier et al. (1999) Euro. J. Biochem. 260:533-9; Zhang et al. (1997) Chin. J. Biotech. 
13:271-6; Rijnkels et al. (1998) Transgen. Res. 7:5-14; Korhonen et al. (1997) Euro. J. Biochem. 
245:482-9; Uusi-Oukari et al. (1997) Transgen. Res. 6:75-84; Hitchin et al. (1996) Prot. Expr. Purif. 

25 7:247-52; Platenburg et al. (1994) Transgen. Res. 3:99-108; Heng-Cherl et al. (1993) Animal 
Biotech. 4:89-107; and Christa et al. (2000) Euro. J. Biochem. 267:1665-71; the entire disclosures 
of each of which is herein incorporated by reference. 

In another embodiment, the polypeptides of the invention can be produced in milk by 
introducing polynucleotides encoding the polypeptides into somatic cells of the mammary gland in 

30 vivo, e.g. mammary secreting epithelial cells. For example, plasmid DNA can be infused through 
the nipple canal, e.g. in association with DEAE-dextran (see, e.g., Hens et al. (2000) Biochim. 
Biophys. Acta 1523:161-171), in association with a ligand that can lead to receptor-mediated 
endocytosis of the construct (see, e.g., Sobolev et al. (1998) 273:7928-33), or in a viral vector such 
as a retroviral vector, e.g. the Gibbon ape leukemia virus (see, e.g.. Archer et al. (1994) PNAS 

35 91 :6840-6844). In any of these embodiments, the polynucleotide may be operably linked to a 
mammary gland specific promoter, as described above, or, alternatively, any strongly expressing 
promoter such as CMV or MoMLV LTR. 

The suitability of any vector, promoter, regulatory element, etc. for use in the present 
invention can be assessed beforehand by transfecting cells such as mammary epithelial cells, e.g. 
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MacT cells (bovine mammary epithelial cells) or GME cells (goat mammary epithelial cells), in 
vitro and assessing the efficiency of transfection and expression of the transgene in the cells. 

For in vivo administration, the polynucleotides can be administered in any suitable 
formulation, at any of a range of concentrations (e.g. 1-500 ^g/ml, preferably 50-100 ng/ml), at any 
5 volume (e.g. 1 -1 00 ml, preferably 1 to 20 ml), and can be administered any number of times (e.g. 1 , 
2, 3, 5, or 10 times), at any frequency (e.g. every 1, 2, 3, 5, 10, or any number of days). Suitable 
concentrations, frequencies, modes of administration, etc. will depend upon the particular 
polynucleotide, vector, animal, etc., and can readily be determined by one of skill in the art. 

In a preferred embodiment, a retroviral vector such as as Gibbon ape leukemia viral vector 

1 0 is used, as described in Archer et al. (( 1 994) PNAS 9 1 :6840-6844). As retroviral infection typically 
requires cell division, cell division in the mammary glands can be stimulated in conjunction with 
the administration of the vector, e.g. using a factor such as estrodiol benzoate, progesterone, 
reserpine, or dexamethasone. Further, retroviral and other methods of infection can be facilitated 
using accessory compounds such as polybrene. 

15 In any of the herein-described methods for obtaining GENSET polypeptides from milk, the 

quantity of milk obtained, and thus the quantity of GENSET polypeptides produced, can be 
enhanced using any standard method of lacation induction, e.g. using hexestrol, estrogen, and/or 
progesterone. 

The polynucleotides used in such embodiments can either encode a full-length GENSET 
20 protein or a GENSET fragment. Typically, the encoded polypeptide will include a signal sequence 
to ensure the secretion of the protein into the milk. 

Recombinant Cell Lines Derived From The Transgenic Animals Of The Invention: 

A further object of the invention comprises recombinant host cells obtained from a 
25 transgenic animal described herein. In one embodiment the invention encompasses cells derived 
from non-human host mammals and animals comprising a recombinant vector of the invention or a 
GENSET gene disrupted by homologous recombination with a knock out vector. 

Recombinant cell lines may be established in vitro from cells obtained from any tissue of a 
transgenic animal according to the invention, for example by transfection of primary cell cultures 
30 with vectors expressing o«c-genes such as SV40 large T antigen, as described by Chou (1989), and 
Shay et a/. (1991), which disclosures are hereby incorporated by reference in their entireties. 

Uses of pol ypeptides of the invention 

The polypeptides and polynucleotides of the present invention can be used in any of a large 
35 number of ways, including numerous in vitro and in vivo uses. Specific uses for many of the 
herein-described polypeptides and polynucleotides are described in detail below. 
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Protein of SEP IDNO:255 (internal designation 500762786 2 5 5-24-5-0- A2-R 104) 

The cDNA of clone 500762786_255-24-5-0-A2-R_104 (SEQ ID NO:86) encodes the 
human EDR4 protein 

5 LFPAPAPPPAPAFAPPPKVPSPERSAPRVPLPSPQPSYPFRPAASGGTPPPACLPPAQPCQGSP 
AMNLFRFLGDLSHLLAIILLLLKIWKSRSCAAHPQLPLSFCLSVCLSVSLSLSXSLSLSFSVSK 
KKKK (SEQ ID NO:255). It will be appreciated that all characteristics and uses of the 
polynucleotides of SEQ ID NO:86 and polypeptides of SEQ ID NO:355, described throughout the 
present application also pertain to the human cDNA of clone 500762786_255-24-5-0-A2-R_104 
1 0 and polypeptides encoded thereby. Polypeptide fragments having a biological activity described 
herein and polynucleotides encoding the same are included in the present invention. Related 
polynucleotide and polypeptide sequences included in the present invention are SEQ ID NOs:406 
and 520. 

The normal functioning of the eukatyotic cell requires that all newly synthesized proteins 

15 be correctly folded, modified, and delivered to specific inter and extracellular sites. Newly 
synthesized membrane and secretory proteins enter a cellular sorting and distribution network 
during or immediately after synthesis (cotranslationally or posttranslationally) and are routed to 
specific locations inside and outside of the cell. The initial compartment in this process is the 
endoplasmic reticulum (ER) where proteins undergo modifications such as glycosylation, disulfide 

20 bond formation, and assembly into oligomers. The proteins are then transported through an 
additional series of membrane-bound compartments which include the various cistemae of the 
Golgi complex, where further carbohydrate modifications occur. Transport between compartments 
occurs by means of vesicles that bud and fuse in a specific manner; once within the secretory 
pathway, proteins do not have to cross a membrane to reach the cell surface. 

25 The complexity of this system has advantages for the cell because it allows proteins to fold 

and mature in closed compartments that contain the appropriate enzyme catalysts. It is, however, 
dependent on sorting mechanisms that position the enzymes correctly and maintain them in place. 

The first organelle in this system, the ER, contains multiple enzymes involved in protein 
structure modifications. Among these are BiP (binding protein) which directs the correct folding of 

30 proteins and, PDI (protein disulfide isomerase) and a homologue of the 90 kDa heat-shock protein, 
both of which catalyze the formation and rearrangement of disulfide bonds (Gething, M. J. and 
Sambrook, J. (1992) Nature 355:33-45). These abundant soluble proteins must be retained in the ER 
and must be distinguished from the newly synthesized secretory proteins which are rapidly 
transported to the Golgi apparatus. The signal for retention in the ER in mammalian cells consists of 

35 the tetrapeptide sequence, KDEL, located at the carboxy terminus of proteins. This sequence was 
first identified when the sequences of rat BiP and PDI were compared and it was subsequently 
found at the carboxy terminus of other luminal ER proteins from a number of species (Munro, S. 
(1986) Cell 46:291-300; Pelham, H. R. (1989) Ann. Rev. Cell. Biol. 5:1-23). Proteins containing 
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this sequence leave the ER but are quickly retrieved from the early Golgi compartment and returned 
to the ER, while proteins without this signal continue through the distribution pathway. 

Two endoplasmic retrieval receptors were first identified in S. cerevesiae; two human 
endoplasmic retrieval receptors were subsequently isolated by the use of degenerate PCR primers 
5 based on the S. cerevesiae sequences (Hardwick, K. G. (1990) EMBO J. 9:623-630; Semenza, J. C. 
(1990) Cell 61:1349-1357; Lewis, M. J. and Pelham, H. R. (1990) Nature 348:162-163; Lewis, M. 
J. and Pelham, H. R. (1992) J. Mol. Biol. 226:913-916). Comparisons of these sequences shows that 
they consist of a conserved 7-transmembrane domain structure with only short loops in the cell 
cytoplasm and the ER lumen. Studies with these endoplasmic retrieval receptors show that ligand 

10 binding controls the movement of the receptor; when expressed in COS cells, the human receptor is 
normally concentrated in the Golgi, but moves to the ER when bound to a ligand such as KDEL- 
tagged hen lysozyme (Lewis, M. J. and Pelham, H. R. (1992) Cell 68:353-364). 

The ER retrieval function of these molecules serves to maintain the pool of enzymes in the 
ER that are necessary to perform protein structure modifications, retains newly synthesized proteins 

15 in the ER until they have been correctly modified, and regulates the structure of the Golgi 

apparatus. Saccharomyces cerevisiae cells that lack an ER retrieval receptor (Erd2) have a defective 
Golgi apparatus and fail to grow. Analysis of yeast Erd2 mutants suggests that their growth requires 
both the retention of multiple proteins in the ER and the selective removal of specific proteins from 
the Golgi (Townsley, F. M. (1994) J. Cell Biol. 127:21-28). Overexpression of a human ER 

20 retrieval receptor in COS cells results in hyperactive retrograde traffic from the Golgi to the ER 
leading to a loss of the Golgi structure and the breakdown of the secretory pathw^ (Hsu V. W. 
(1992) Cell 69:625-635). 

Disruptions in the cellular secretory pathway have been implicated in several human 
diseases. In familial hypercholesterolemia the low density lipoprotein receptors remain in the ER, 

25 rather than moving to the cell surface (Pathak, R. K. (1988) J. Cell Biol. 106:1831-1841). A form of 
congenital hypothyroidism is produced by a deficiency of thyroglobulin, the thyroid prohormone. In 
this disease the thyroglobulin is incorrectly folded and is therefore retained in the ER (Kim, P. S. 
( 1 996) J.Cell Biol. 1 33:5 1 7-527). Mutant forms of proteolipid protein (PLP) have been examined as 
they play a role in generating dysmyelinating or hypomyelinating diseases. In this case, the 

30 mutations that result in disease are mutations that arrest transport of PLP in the ER and the early 
Golgi; the subsequent accumulation of PLP in the ER results in rapid oligodendrocyte death (Gow, 
A. (1994) J. Neurosci. Res. 37:574-583). 

The human ER retrieval receptor function is necessary for processing and presentation of 
specific antigens to T cells. Many antigens must be processed intracellularly before they can be 

35 presented, in association with major histocompatability complex (MHC) molecules at the cell 
surface, for recognition by the antigen-specific receptor of T cells. Disruption of the ER retrieval 
receptor function with an antibiotic, Brefeldin A, abolishes the ability of a cell to present these 
specific antigen complexes to T cells. These antigenic proteins must be retained in the ER for 
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cleavage to smaller peptides which can then bind to MHC molecules and be released for 
presentation at the cell surface. (Kakiuchi, T. (1991) J. Immunol. 147:3289-3295). 

The discovery of polynucleotides encoding a novel human KDEL receptor, and the 
molecules themselves, provides the means to further investigate the regulation of the cellular 
5 protein secretory pathway. Discovery of molecules related to a novel human KDEL receptor 
satisfies a need in the art by providing a means or a tool for the study of this pathway and the 
diseases that involve the dysfunction of this pathway. 

In an embodiment of the present invention, ERD4 polypeptides of the present invention are 
used to purify KDEL containing proteins and other homologous proteins with similar signals for ER 

10 retention such as "HDEL", "DDEL", "ADEL", "SDEL", "RDEL", "KEEL", "QEDL", "fflEL", 
"HTEL" and "KQDL". This may be carried out by covalently or non-covalently attached the EDR4 
polypeptides of the present invention to a column or other solid support using techniques well 
known in the art (e.g., affinity chromatography, panning, etc.). Once bound to the ERD4 
polypeptide, the complex is washed to remove contaminants. The target protein is released using 

15 increasing salt concentrations either in a gradient or step type purification. The bound target protein 
may also be released from the ERD4 polypeptide by a single step up in sah concentration. 

In another embodiment of the present invention, the EDR4 polypeptides of the present 
invention are used to detect KDEL containing proteins and other homologous sequences as 
described above by methods comprising the steps of contacting KDEL or other homologous 

20 sequences with an EDR4 polypeptide under conditions that allow binding to said sequence, and 
detecting the presence of bound EDR4. The presence of bound EDR4 can be detected using 
methods known in the art, such as by labeling EDR4 directly or indirectly. Bound EDR4 can be 
detected, for example, by using an antibody that specifically binds to EDR4 or another EDR4 - 
binding compound that is detectable directly or indirectly. 

25 Preferred ERD4 polypeptides for binding KDEL containing proteins and other homologous 

sequences described above comprise the amino acid sequence -KIWK- or - 
MNLFRFLGDLSHLLAIILLLLKIWKSRSCA-. 

The present invention is further directed to a transformant comprising the following 
expression units in a co-expressible state: an expression unit containing a gene coding for an ERD4 

30 polypeptide which is capable of binding to a protein localizing in the endoplasmic reticulum and 
having a signal for staying therein; an expression unit containing a gene coding for said protein 
localizing in endoplasmic reticulum; and an expression unit containing a foreign gene coding for a 
polypeptide which is a subject of function of said protein localizing in endoplasmic reticulum, and 
to a transformant comprising, in a co-expressible state, a fusion gene which is composed of a DNA 

35 fragment coding for a human serum albumin prepro-sequence and a foreign gene coding for a useful 
polypeptide. The present invention is also directed to a process for producing said polypeptide by 
co-expressing said genes in said transformant such that the polypeptide is predominantly secreted 
out of the transformant cell. Consequently, the invention has an advantage of improving the 
productivity of said polypeptide. 
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More particularly, the invention relates to: A transformed yeast cell comprising the 
following expression units integrated on a yeast chromosome in a co-expressible state: a first 
expression unit containing a gene coding for a receptor for an endoplasmic reticulum retention 
signal, wherein the receptor is the receptor protein ERD4 or a fragment thereof which is capable of 
5 binding to a retention signal selected from the group consisting of "KDEL", "HDEL", "DDEL", 
"ADEL", "SDEL", "RDEL", "KEEL". "QEDL", "HIEL". "HTEL" and "KQDL". and a second 
expression unit containing a gene encoding a protein disulfide isomerase, wherein said isomerase 
comprises an endoplasmic reticulum retention signal, or a gene encoding a fusion protein 
comprising the amino acid sequence of said isomerase and a human serum albumin prepro- 
1 0 sequence. These methods can be carried out using methods known in the art or described in U.S. 
Patent 5,578,466, incorporated herein by reference in its entirety. 

Proteins of SEP ID NO:193 and 194 (internal designation 585770 2 15-1 6-5-0-E8-F and 
123996 140-002-5-0-B4-F) 

15 The cDNA of clones 585770_215-16-5-0-E8-F (SEQ ID NO:24) and 123996 140-002-5-0- 

B4-F (SEQ ID NO:25) encode the human Smooth Muscle and Pain Effector (SMPE) proteins: 
MRGATRVSIMLLLVTVSDCAVITGACERDVQCGAGTCCAISLWLRGLRMCTPLGRXGEEC 
HPGSHKIPFFRKRKHHTCPCLPNLLCSRFPDGRYRCSMDLKNINF (SEQ ID NO: 193) and 
MRGATRVSIMLLLVTVSDCAVITGACERDVQCGAGTCCAISLWLRGLRMCTPLGREGEEC 

20 HPGSHKIPFFRKRKHHTCPCLPNLLCSRFPDGRYRCSMDLKNINF (SEQ ID NO: 1 94), 

respectively. It will be appreciated that all characteristics and uses of the polynucleotides of SEQ 
ID NOs:24 and 25 and polypeptides of SEQ ID NO: 193 and 194, described throughout the present 
application also pertain to the human cDNA of clones 585770_215-16-5-0-E8-F and 123996_140- 
002-5-0-B4-F, and the polypeptides encoded thereby. Polypeptide fragments having a biological 

25 activity described herein and polynucleotides encoding the same are also included in the present 
invention. Related polynucleotide and polypeptide sequences included in the present invention are 
SEQ ID NOs:360 and 447. 

SMPE contracts longitudinal ileal muscle and distal colon, and relaxes the proximal colon. 
SMPE binds with a high affinity to both ileum and brain membranes. Therefore, included as 

30 embodiments of the present invention is a method of causing gastrointestinal smooth muscle cells to 
contract, in vitro or in vivo, comprising the steps of contacting said cells with a contracting effective 
amount of an SMPE polypeptide. Preferrably, the gastrointestinal smooth muscle cells are those of 
the longitudinal ileal or distal colon. A further embodiments of the present invention is a method of 
causing gastrointestinal smooth muscle cells to relax comprising the steps of contacting said cells 

35 with a relaxing effective amount of an SMPE polypeptide. Preferrably, the gastrointestinal smooth 
muscle cells are proximal colon cells. SMPE can also be used in the same manner to contract 
uterine cells. Therefore, included in the present invention is a method of causing uterine smooth 
muscle cells to contract comprising contacting said cells with a contracting effective amount of an 
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SMPE polypeptide. Further included in the present invention is a method of causing smooth muscle 
cells (e.g., bladder, vascular) to contract comprising contacting said cells with a contracting 
effective amount of an SMPE polypeptide. Further included in the present invention is a method of 
inhibiting angiogenesis comprising contacting vascular endothelial cells with an angiogenesis 
5 inhibiting effective amount of an SMPE polypeptide. The SMPE anti-angiogenic affect can be 
measured using assays known in the art. For example, the anti-angiogenic effect in vivo can be 
assayed by using the 10-day-old embryo chick chorioallantoic membrane model. 

SMPE binds with a high affinity to both ileum and brain membranes. Thefore, as a further 
embodiment of the present invention is a method of binding an SMPE polypeptide to ileum or brain 

1 0 membranes. The method can be further used as a method of detecting ileum or brain membranes 
comprising the steps of contacting ileum or brain membranes with an SMPE polypeptide under 
conditions that allow binding to said membranes, and detecting the presence of SMPE. The 
presence of SMPE can be detected using methods known in the art, such as by labeling SMPE 
directly or indirectly. Bound SMPE can be detected, for example, by using an antibody that 

1 5 specifically binds to SMPE or another SMPE-binding compound that is detectable directly or 
indirectly. 

SMPE is also expressed in spermatocjlcs. Therefore, a further embodiment of the present 
invention is a method of detecting testes or spermatocytes by detecting an SMPE polypeptide or 
nucleic acid. An SMPE polypeptide can be detected using anti-SMPE antibodies or other SMPE- 

20 binding compounds. SMPE polynucleotides, such as mRNA, can be detected using methods knovtoi 
in the art such as PGR (RT-PCR), hybridization (Northern blot analysis), etc. 

SMPE elicits hyperalgesia when it contacts the GNS, e.g., the brain. Therefore, the present 
invention includes a method of causing hyperalgesia comprising contacting the CNS with a 
hyperalgesia effecting amount of an SMPE polypeptide. SMPE can be delivered to the CNS using 

25 methods well knowm in the art including those described in PCT application WO9906060, 

incorporated herein by reference in its entirety. Using the methods of WO9906060, the TGF-alpha 
or other polypeptide that binds the epidermal growth fector (EGF) receptor, is substituted with an 
SMPE polypeptide of the present invention. 

Further included in the present invention are methods of inhibiting the above SMPE 

30 activities using an inhibitor of SMPE. A preferred inhibitor of SMPE is an anti-SMPE antibody. 
Thus, an embodiment of the present invention is a method of inhibiting smooth muscle contraction 
(bladder, gastrointestional cells, uterine) or pain comprising the step of contacting said cells with an 
effective contractive or pain inhibiting amount of an anti-SMPE antibody or other SMPE inhibitor. 
The invention further relates to a method of screening for test compounds that bind and/or 

35 inhibit an SMPE activity above comprising the steps of contacting an SMPE polypeptide with said 
test compound and detecting or measuring whether said test compound binds said SMPE 
polypeptide. Alternatively, the method comprises the steps of contacting an SMPE polypeptide 
with a binding target (e.g., smooth muscle cells or brain cells) of said SMPE polypeptide in the 
presence of a test compound, and detecting or measuring the binding of the SMPE polypeptide to 
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said binding target, wiierein a difference in the amount of said binding in the presence of said test 
compound relative to the amount of binding in the absence of the test compound indicates that the 
test compound modulates, preferably inhibits, the binding of said polypeptide to said binding target. 
The method may alternatively comprise the steps of contacting an SMPE polypeptide with a 
5 binding target in the presence of a test compound, wherein the binding of said SMPE polypeptide 
with said binding target elicits or causes a biological activity (e.g., activities described above) which 
is detected or measured, and fiirther wherein a difference in the level of said biological activity in 
the presence of the test compound relative to the amount of biological activity in the absence of the 
test compound indicates that the test compound modulates, preferably inhibits or activates, the 
10 biological activity of said SMPE polypeptide. 

Preferred SMPE polypeptides for use in the methods described herein include the amino 
acid sequences - 

AVITGACERDVQCGAGTCCAISLWLRGLRMCTPLGREGEECHPGSHKIPFFRKRKHH- or - 
TGACERDVQCGAGTCCAISLWLRGLRMCT- of SEQ ID NO: 193 or 194. 

15 

Protein of SEP ID NO:305 f internal designation 500691428 255-2-5-0-D4-R 104) 

The human cDNA of clone 500691428_255-2-5-0-D4-R_104 (SEQ ID NO: 136) encodes 
the human VESICLE-ASSOCIATED MEMBRANE PROTEIN 10 or VAMP- 10 protein: 
MSATAATAPPAAPAGEGGPPAPPPNLTSNRRLQQTQAQVDEVVDIMRVNVDKVLERDQKL 

20 SELDDRADALQAGPSQFETSAAKLKRKYWWKNLKMMIILGVICAIIUIIIVYFST (SEQ ID 
NO:305). It will be appreciated that all characteristics and uses of the polynucleotides of SEQ ID 
NO: 136 and polypeptides of SEQ ID NO:305 described throughout the present application also 
pertain to the human cDNA of clone 500691428_255-2-5-0-D4-R_104 and the polypeptides 
encoded thereby. Polypeptide fragments having a biological activity described herein and 

25 polynucleotides encoding the same are also included in the present invention. Related 

polynucleotide and polypeptide sequences included in the present invention are SEQ ID NOs:432 
and 546. 

VAMP- 10 is an integral membrane protein involved in the movement of vesicles from the 
plasmalemma of one cell, across the synapse, to the plasma membrane of the receptive neuron. This 

30 regulated vesicle trafficking pathway and the endocytotic process may be blocked by the highly 
specific action of clostridial, tetanus toxin (TeTx) and botulinum toxin (BoNT) and other 
metalloendoprotease neurotoxins which prevents neurotransmitter release by cleaving VAMPs. 
VAMP- 10 is important in membrane trafficking. It participates in axon extension via exocytosis 
during development, in the release of neurotransmitters and modulatory peptides, and in 

35 endocytosis. The tightly-regulated synaptic vesicle cycle at the nerve terminal consists of the 
formation of synaptic vesicles, the docking of vesicles comprising VAMP-IO to the presynaptic 
plasma membrane, the fusion of these membranes and consequent neurotransmitter release, 
endocytosis of the empty vesicles and the regeneration of fresh vesicles. Endocytotic vesicular 
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transport includes such intracellular events as the fusions and fissions of the nuclear membrane, 
endoplasmic reticulum, Golgi apparatus, and various inclusion bodies such as peroxisomes or 

lysosomes. 

VAMP-IO, like other VAMPs, has a three domain organization. The domains include a 
5 variable proline-rich, N-terminal sequence, a highly conserved central hydrophilic core of amino 
acids, and a hydrophobic sequence of amino acids presumed to be the membrane anchor. 

In one aspect, the invention includes a VAMP-IO polypeptide composition for use in 
delivering a second composition, preferably nucleic acids, polypeptides, or small molecules such as 
therapeutic drugs, to target biological cells either in vitro or in vivo. The composition comprises a 

10 VAMP-IO polypeptide as a first molecule and a second molecule. The second molecule may, if 
desirable, be covalently or non-covalently attached or flised to the VAMP-IO polypeptide. The 
VAMP-IO polypeptide composition may fiirther comprise artificial lipids to facilitate delivery of 
the second molecule by lipisomes or lipid vesicles. Methods for using VAMP-IO polypeptides in 
these methods are known in the art and include U.S. Patents 6,074,844, 6,203,794 and 6,099,857, 

15 incorporated by reference in their entireties. In a preferred embodiment, VAMP-IO polypeptides 
are used to faciliate delivery of a second composition, e.g., lipisome mediated DNA transfection, to 
cells in culture, preferably neuronal cells, and further preferably to the presynaptic membrane. 

VAMP-IO polypeptides arc also useful in methods of inhibiting the release of 
neurotransmitters by preventing the docking and/or fusing of a presynaptic vesicle to the 

20 presynaptic membrane. These polypeptides may be referred to as excitation-secretion uncoupling 
peptides (ESUPs). Fragments of VAMP-IO having this blocking activity can be identified using 
methods known in the art (See e.g., U.S. Patents 6,090,63 1 and 6,169,074 incorporated by reference 
in their entireties). ESUPs of the present invention comprise synthetic and purified VAMP-IO 
peptide fragments which correspond in primary structure to peptides which serve as binding 

25 domains for the assembly of a ternary protein complex ("docking complex") which is critical to 
neuronal vesicle docking with the cellular plasma membrane prior to neurotransmitter secretion. 
Preferably, the primary sequence of the ESUPs of the invention also includes amino acids which are 
identical in sequence to the VAMP-IO peptide products of BoTx and TeTx proteolytic cleavage in 
neuronal cells, or fragments thereof ("proteolilic products"). For optimal activity, ESUPs of the 

30 invention have a minimum length of about 20 amino acids and a maximal length of about 28 amino 
acids, although they may be larger or smaller. Preferably, the ESUPs correspond in primary 
structure to binding domains in the docking complex, most preferably the region of such binding 
domains that are involved in the formation of a coiled-coil structure in the native docking complex 
proteins. ESUPs may also be used as pharmaceutical carriers as part of fusion proteins to deliver 

35 substances of interest into neural cells in a targeted manner. Preferred VAMP-1 0, or ESUP, 
polypeptides for use in inhibiting the release of neurotransmitters include those comprising - 
NRRLQQTQAQVDEWDIMRVNVDKVLERDQKLSELDDRADALQAGPSQFETSAAKLKRK 
- of SEQ ID NO:305. More preferred ESUP polypeptides comprise an amino acid sequence portion 
of SEQ ID NO:305 selected from the group consisting of: RVNVDKVLERDQKLSELDD; 
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KVLERDQKLSELDDRA; VNVDKVLERDQKLSELDDRA; 

DIMRVNVDKVLERDQKLSELDDRADAL; DEVVDIMRVNVD; QAQVDEVVDIMRVNVD; 
LQQTQAQVDEVVDIMRVNVD; QQTQAQVDEVVD; NRRLQQTQAQVDEVVD; and 
NLTSNRRLQQTQAQVDEVVD. 
5 The ESUPs above may be used to inhibit or treat pain according to U.S. Patent 6, 1 1 3,9 1 5 or 

5,989,545 (incorporated by reference herein in their entireties) by substituting the polypeptides of 
the present invention for BoTx type A. 

Because VAMP- 10 is a component of vesicles, antibodies to VAMP- 10 are useful in the 
detection of vesicles, preferably neuronal vesicles transporting neurotransmitters. VAMP-10 can be 

1 0 used during purification of vesicles as a marker for vesicles or vesicles can be detected using 

antibodies to VAMP-10 in assays such as immunohistochemistry. Following exocytosis of vesicles, 
a portion of the VAMP- 1 0 inserted in the vesicle appears on the surface of the axon, thus making 
VAMP-10 useful for the detection and monitoring of exocytosis of synaptic vesicles. 

Detection of VAMP-10 expression (mRNA or protein) levels or mutated forms of VAMP- 

15 10 is further useful in the determination or diagnosing of whether someone is at risk of developing 
or has a neurological disorder, such as mood disorders selected from depression, bipolar disorder, 
schizophrenia, etc.), wherein a decreased level in expression of VAMP-10, mRNA or protein, as 
compared to an individual without a neurological disorder indicates the individual has the disorder 
or is as risk of having the disorder in the future. 

20 The present invention further includes a novel assay system for toxins, such as clostridial, 

tetanus toxin (TeTx) and botulinum toxin (BoNT), using novel reagents. Preferably, methods of 
U.S. Patent 6,043,042, incorporated by reference in its entirety, are used to perform the assay, 
wherein a VAMP-10 polypeptide is the substrate cleaved by the test compound. More specifically, 
the assay comprises the steps of: 

25 The invention relates to an assay for botulinum toxin or tetanus toxin comprising the steps 

of: 

(a) combining a test compound with a substrate and with antibody, wherein the substrate 
has a cleavage site for the toxin and when cleaved by toxin forms a product, and wherein the 
antibody binds to the product but not to the substrate; and wherein the substrate is a VAMP- 1 0 

30 polypeptide; and 

(b) testing for the presence of antibody bound to the product, which product is attached to a 
solid phase assay component. 

Preferably, in the practice of this invention, the VAMP-10 polypeptide is cleaved by the 
toxin to generate new peptides having N- and C-terminal ends. In addition, the peptide substrate is 
35 attached to a solid phase component of the assay. 

The assay according to the invention may utilize assay components (a) and (b): 
(a) a peptide linked to a solid-phase, the peptide being cleavable by the toxin to generate a 
cleavage product. 



103 



[fije>'/^:/yv'GG208389e.s^] 



WO 02/083898 PCT/IBOl/00914 

(b) an antibody that binds to the cleavage product but not to the uncleaved polypeptide or 
an antibody that binds a cleavage product that is either the N-terminal or C-terminal portion of the 
VAMP- 10, and the assay may comprise the steps of: 

(i) combining a test compound that may contain or consist of the toxin with the solid-phase 
5 peptide to form an assay mixture, 

(ii) subsequently or simultaneously combining the assay mixture with the antibody, and 

(iii) subsequently or simultaneously determining whether there has been formed any 
conjugate between the antibody and the cleavage product. 

Preferably, the step (i) of the assay is carried out in the presence of a zinc compound and a 
10 VAMP- 10 polypeptide. 

In this embodiment, the assay comprises: 

(i) combining the test compound with a solid phase comprising a VAMP- 10 polypeptide, 

(ii) washing the test compound from the solid phase, 

(iii) combining the solid phase with an antibody adapted for binding selectively with 
15 peptide cleaved by toxin, and 

(iv) detecting a conjugate of the antibody with cleaved peptide. 
In another embodiment, the assay comprises: 

(1) adding a test solution to an assay plate comprising immobilized peptide, the peptide 
being a VAMP- 1 0 polypeptide; 

20 (ii) incubating the assay plate, 

(iii) washing the plate with a buifer, 

(iv) adding to the plate an antibody solution, said solution comprising an antibody adapted 
selectively to bind to a peptide selected from the group consisting of (1) the 50 C-terminal amino 
acid residues SEQ ID NO:305, the 30 C-terminal amino acid residues SEQ ID NO:305, and the 20 

25 C-terminal amino acid residues SEQ ID NO:305 (any other VAMP- 10 polypeptide of the present 
invention may also be selected). 

(2) a peptide the N-terminal end of which is selected from the group consisting of: (1) the 50 
N-terminal amino acid residues SEQ ID NO:305, the 30 N-terminal amino acid residues SEQ ID 
NO:305, and the 25 N-terminal amino acid residues SEQ ID NO:305 (any other VAMP-10 

30 polypeptide of the present invention may also be selected). 

(v) incubating the assay plate, 

(vi) washing the plate with a buffer, and 

(vii) measuring the presence of antibody on the assay plate. 

In this embodiment, the antibody may be linked to an enzyme and the presence of antibody 
35 on the plate is measured by adding an enzyme substrate and measuring the conversion of the 
substrate into detectable product. The detectable product may be colored and measured by 
absorbance at a selected wavelength. 

In the practice of the invention, the inactive toxin present in the test compound may be 
converted to active toxin. This may be accomplished by adding a protease to the test compound. 
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The antibody-peptide conjugate may be detected using a further antibody specific to the 
first antibody and linked to an enzyme. 



5 Proteins of SEP ID NO: 171 (internal designation Clone ID:5891 15) and related protein of SEQ ID 
NO:457. 

The polynucleotides of SEQ ID NO:2 and SEQ ID NO:340 and polypeptides of SEQ ID 
N0:171 and 457 encode a C-terminal variant of Apolipoprotein Al, herein referred to as ApoAI- 
CTV. An embodiment of the invention includes compositions of SEQ ID NO:2, 340, 171 , and 457 
1 0 which encode for this novel variant of the apolipoprotein family of lipid transporting proteins. 
Specifically, ApoAI-CTV is a component of high density lipoprotein which functions to remove 
cholesterol from circulation and thus providing protection against the development of 
atherosclerosis, coronary atherosclerotic lesions and subsequent microvascular and cardiovascular 
disease. 

1 5 Preferred polynucleotides of the invention are compositions of the novel portion of the 

cDNA from bases 465 to 52 1 of SEQ ID NO:2 including the nucleic acids comprising the 
sequences - 

GCAGCTTTCTTAACTATCCTAACAAGCCTTGGACCAAATGGAAATAAAGCTTTTTGA-,- 
GAAGGCAGCTTTCTTAACTATCCTAACAAGCCTTGGACCAAATGGAAATAAAGCTTTTT 
20 GA-,or- 

AGCTCTACCGCCAGAAGGCAGCTTTCTTAACTATCCTAACAAGCCTTGGACCAAATGG 
AAATAAAGCTTTTTGATGAAAAAA- of SEQ lDNO:2 and 340. 

Preferred polypeptides of the invention are compositions of the novel C-terminal portion 
comprising the amino acid sequence -AAFLTILTSLGPNGNKAF, - 
25 MELYRQKAAFLTILTSLGPNGNKAF, or -QKKWQEEMELYRQKAAFLTILTSLGPNGNKAF 
of SEQ ID NO: 171 and SEQ ID NO:457. 

Further preferred polypeptides of the invention include the compositions comprising the 
apolipoprotein domain 

KAAVLTLAVLFLTGSQARHFWQQDEPPQSPWDRVKDLATVYVDVLKDSGRDYVS 
30 QFEGSALGKQLNLKLLDNWDSVTSAFSNLREQLGPVTQEXWDNLEKETEGLRQEMSKDLE 
EVKAKVQPYLDDFQKKWQEEMELYRQKAAFLTILTSLGPNGNKA of SEQ ID NO: 1 7 1 and 
457, or the amino acid residue positions -17 to +141 of SEQ ID NO:l 71. 

An embodiment of the invention includes a method for treatment of atherosclerosis or 
cardiovascular diseases, comprising administering to an individual a therapeutically effective 
35 amount of apoAI-CTV or variants or mixtures thereof to lower total plasma cholesterol at least 5% 
of pretreatment levels. 

Further utility of the polypeptides of the present invention may be further confirmed by 
methods of production and use of other apolipoproteins by those skilled in the art or as described by 
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Ageland et al in US Patent 5990081, which disclosure is hereby incorporated by reference in its 
entirety. 

It will be appreciated that all characteristics and uses of the polynucleotides of SEQ ID 
NO:2 and SEQ ID NO:340 and polypeptides of SEQ ID NO: 1 71 and 457 described throughout the 
5 present application also pertain to the human cDNA of clone 5891 1 5. 

Proteins of SEQ ID NO:302 (internal designation Clone ID: 1000853793) and related protein of 
SEP ID NO:543. 

MRLFLSLPVLVVVLSIVLEGPAPAQGTPDVSSALDKLKEFGNTLEDKARELISRIKQ 
10 SELSAKMREWFSETFQKVKDKLKIDS 

The polynucleotides of SEQ ID NO: 133 and SEQ ID NO:429 encode human apolipoprotein 
CI (ApoCI) polypeptide of SEQ ID NO:302 and SEQ ID NO:543, respectively. The ApoCI of the 
invention differs by 1 amino acid comprising the amino acid sequence FQKVKDKLKI, where 
aspartate (D at position 77 of SEQ ID NO:302) replaces a glutamate (E) of the ApoCI of GENPEP 
1 5 accession X00570, AF0501 54, and M20902. ApoCI is a member of the apolipoprotein family of 
lipid binding and transporting proteins specifically functioning to transport cholesterol esters. 

It will be appreciated that all characteristics and uses of the polynucleotides of SEQ ID 
NO: 133 and SEQ ID NO:429 and polypeptides of SEQ ID NO:302 and SEQ ID NO:543, described 
throughout the present application also pertain to the human cDNA of clone 1000853793, and the 
20 polypeptides encoded thereby. 

Proteins of SEP ID NO:295 (internal designation Clone ID:642948). SEP ID NO:296 (internal 
designation Clone ID:638743\ and SEP ID NO:539. 

MEASALTSSAVTSVAKWRVASGSAVVLPLARIATVVIGGVVAMAAVPMVLSAMGFTAA 
25 GIASSSIAAKMMSAAAIANGGGVASGSLVATLQSLGATGLSGLTKFILGSIGSAIAAVIARFY 

The polynucleotides of SEQ ID NO: 1 26, 1 27, and 425 and the polypeptides of SEQ ID 
NO:295, 296 and 539 encode human transmembrane, alpha-interferon-inducible polypeptides, 
aINFIP-I, aINFIP-I, and aINFIP-3, respectively. Preferred polynucleotides and polypeptides of the 
invention comprise the nucleic acid sequences of SEQ ID NO: 126, 127, and 425 and amino acid 

30 sequences of SEQ ID NO:295, 296 and 539. 

Preferred polypeptides of SEQ ID NO:295 and SEQ ID NO:539 for use in the methods 
described herein include the amino acid sequences comprising - 
VLSAMGFTAAGIASSSIAAKMMSAAAIANGGGVASG-,- 
SSIAAKMMSAAAIANGGGVASGSLVATLQSLGAT-, or - 

3 5 VIGG VVAMAAVPMVLSAMGFTAAGI ASSSIA AKMMSAAAIANGGGVASGSLVATLQSLG 
ATGLSGLTK-. 
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Preferred polypeptides of SEQ ID NO:296 for use in the methods described herein include 
the amino acid sequence -AAAIANXGGVASGSLVATLQSLGATGLSGLTKF- or - 
LSAMGFTAAGIASSSIAAKMMSAAAIANXGGVASGSLVATLQSLGATGLS-. 

It will be appreciated that all characteristics and uses of the polynucleotides of SEQ ID 
5 NOs: 1 26, 1 27, and 425 and the polypeptides of SEQ ID NOs:295, 296 and 539, described 

throughout the present application also pertain to the human cDNA of Clone ID:642948 and Clone 
ID:638743, and the polypeptides encoded thereby. 

Sites of glycine myristylation within a polypeptide function to modulate the activity and 
compartmentalization of the protein (Resh, M. D. Biochim Biophys Acta 1451:1-16 (1999)). 
1 0 Preferred polypeptides of the invention include fragments comprising the sites of N- 

myristylation. Preferred amino acids of said sites within SEQ ID NO:295 and SEQ ID NO:539 
include GGWAM (positions 39-44), GIASSS (positions 60-65), GGGVAS (positions 79-84), 
GSLVAT (positions 85-90), and GSIGSX (positions 108-1 13). Further preferred are amino acids 
within 6 residues preceding or 6 residues following said amino acid sequences. Further preferred 
15 amino acids include sequences comprising the sites of N-myristylation in the polypeptides of SEQ 
ID NO:296. Preferred amino acids of said sites within SEQ ID NO:296 include 
lATVVIGGVVAMAAVPMV, MGFTAAGIASSSIAAKMM, AAIANXGGVASGSLVATL, 
ISIXGGVASGSLVATLQSLGA, and LTKFILGSIGSAIAAVIAR. 

Interferons (IFNs) are a part of the group of intercellular messenger proteins known as 
20 cytokines and are part of the body's natural defense to viruses and tumors. Type I IFNs (alpha and 
beta interferons) are produced in a variety of cells types and their biosynthesis is stimulated by 
viruses and other pathogens, and by various cytokines and growth factors. Both a- and y-IFNs are 
immunomodulators and anti-inflammatory agents, activating macrophages, T-cells and natural killer 
cells (reviewed in Jonasch and Haluska, Oncologist 6(1):34 (2001)). As part of the body's natural 
25 defense to viruses and tumors, INFs affect the function of the immune system and have direct action 
on pathogens and tumor cells. IFNs mediate these multiple effects by inducing the synthesis of 
cellular proteins, including the polypeptides of the present invention, aINFIP-1, alNFIP-l, and 
aINFIP-3. 

Antiviral activity of the alNFIP polypeptides are assayed according to conventional 
30 methods (Tovey et al, Proc. See. Exp. Biol, and Med., 1974 146: 809-815). Preferred polypeptides 
of SEQ IDNO:295, 296 and 539 and fragments thereof include those which possess antiviral 
function, where preferred antiviral activity is against herpes simplex virus and hepatitis virus C, 
alone or in combination with known antiviral treatments such as interferon alpha. 

The antitumor activity the aINFIP polypeptides of the invention can be demonstrated by 
35 similar methods using tumor cell lines rather than treatment of cells with virus as used to test 

antiviral activity. Tumor cell lines examples include MCF-7 (human breast cancer derived), NOS-1 
(human oral primary squamous cell carcinoma derived), and MedB-1 (human primary mediastinal 
large B-cell lymphoma derived). 
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Further utility of the polypeptides of the present invention may be further confirmed by 
methods of interferon inducible proteins in the inhibition of viral functions such as cell penetration, 
uncoating, RNA and protein synthesis, assembly and release described in Hardman et al.. 
Pharmacological Basis of Therapeutics, McGraw-Hill, New York NY pp 121 1-1214, 25 (1996), 
5 disclosure of which is hereby incorporated by reference in its entirety. 

Another embodiment of the present invention relates to the use of aINFIP polypeptides or 
fragments thereof to treat and/or prevent the ill-effect of bacterial infection. In a preferred 
embodiment, the protein of the invention may be used to counteract the effects of the bacterial 
endotoxin lipopolysaccharide (LPS). The methods for using such compositions is described in 

1 0 Dziegielewska and Andersen, Biol. Neonate, 74:372-5 ( 1 998), the disclosure of which is 
incorporated herein by reference in its entirety. 

Furthermore, the aINFIP polypeptides or fragments thereof may be used to identify specific 
molecules with which it binds such as agonists, antagonists or inhibitors. Another embodiment of 
the present invention relates to methods of using the aINFIP polypeptides or fragments thereof to 

1 5 identify and/or quantify cytokines of the interferon family as well as other cytokines such as ILIO 
and tumor antigens, which may interact with the aINFIP polypeptides of the invention. 

The alNFIP polypeptides of the invention or fragments thereof are included in 
pharmaceutical preparations for treatment, prevention or alleviation of cancers. In another 
embodiment of the present invention, the aINFIP polypeptides of the invention or fragments thereof 

20 are used included in pharmaceutical preparations for treatment, prevention or alleviation of viral or 
bacterial infections. In another embodiment of the present invention, the aINFIP polypeptides of 
the invention or fragments thereof are used to inhibit and/or modulate the effect of cytokines and 
related molecule such as II-2, TNF alpha, CTLA4, CD28, and others, by preventing the binding of 
the endogenous cytokine to their natural receptors, thereby blocking cell proliferation or inhibitory 

25 signals generated by the ligand-receptor binding event. 

In another embodiment of the present invention, the aINFIP polypeptides of the invention 
or fragments thereof are useful to correct defects in in vivo models of disease such as autoimmune, 
inflammation and tumor models, by injecting the protein either intra peritoneally intravenously, 
subcutaneously or directly in the diseased tissue. 

30 The polynucleotides of SEQ ID NO: 126, 127, and 425 or fragments thereof is useful in 

diagnostic assays for alNFIP-l, aINFIP-2, or alNFlP-3 gene expression in in vitro models or in 
conditions associated with expression of the aINFlP polypeptides of the invention. The diagnostic 
assay is useful to distinguish between absence, presence, and excess expression of the gene and to 
monitor regulation of levels of the gene of the invention during therapeutic intervention. The DNA 

35 may also be incorporated into effective eukaiyotic expression vectors and directly targeted to a 
specific tissue, organ, or cell population for use in gene therapy to treat the above mentioned 
conditions, including tumors and/or to correct disease- or genetic-induced defects in any of the 
above mentioned proteins including the protein of the invention. 
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Protein of SEP ID NO: 170 f internal designation Clone 1D:502084^ and related protein of SEP ID 
NO:456. 

The polynucleotides of SEQ ID NO:339 and polypeptides of SEQ ID NO:456 encode 
neutrophil stimulating protein 2, previously described in WO 9006321 (GENPEP accession 
5 AO 13 19) as a novel factor having neutrophil-stimulating activity. The polynucleotide of SEQ ID 
N0:1 encodes a novel polypeptide variant, neutrophil stimulating protein 2v, comprising the amino 
acid sequence of SEQ ID NO: 1 70 in which an aspartate (D) residue is located at position +16 of 
SEQ ID NO: 1 70 rather than a glutamate (E). Preferred compositions of the invention include the 
polypeptides of SEQ ID NO: 1 70. Further preferred amino acids of SEQ ID NO: 1 70 comprise the 

1 0 sequence LAKGKDESLDS, QXKRNLAKGKDESLDSDLYAE, or 

SSTKGQXKRNLAKGI^ESLDSDLYAELRCMCIKTTSGIHPKNIQSLEVIGKGTHCNQVEVIA 
TLKDGRKICLDPDAPRIKKIVQKKLAGDESAD. It will be appreciated that all characteristics 
and uses of the polynucleotides of SEQ IDNO:l and SEQ ID NO:339 and polypeptides of SEQ ID 
NO: 170 and SEQ ID NO:456, described throughout the present application also pertain to the 

15 human cDNA of clone 502084, and the polypeptides encoded thereby. 

A preferred embodiment of the invention includes use of the novel neutrophil stimulating 
protein 2v of SEQ ID NO: 1 70 in a method to stimulate wound healing by contacting the wound area 
with effective amount of polypeptide of SEQ ID NO: 1 70 or further use as described in US Patent 
5,804,176, which disclosure is hereby incorporated by reference in its entirety. A further preferred 

20 includes use of neutrophil stimulating protein 2v in the enhancement of angiogenesis for 
revascularization after injury such following myocardial infarction, wherein site of injury is 
contacted with effective amount of polypeptide of SEQ ID NO: 1 70 or use as further described in 
US Patent 5,871,723, which disclosure is hereby incorporated by reference in its entirety. 
Antibodies against neutrophil stimulating protein 2v, by preventing or blocking the deposition of 

25 connective tissue matrix, are useful in the treatment of fibrotic disorders by contacting the 

polypeptides of SEQ ID NO: 1 70 with fibrotic tissue, such as in scleroderma, liver cirrhosis, and 
myelofibrosis. 

An embodiment of the invention includes fragments of SEQ ID NO: 170 which comprise 
domains which impart function to this cytokine. Preferred fragments include the amino acid 

30 sequence comprising the 1L8 domain, 

DSDLYAELRCMCIKTTSGIHPKNIQSLEVIGKGTHCNQVEVIATLKDGRKICLDPDAPRIKKI 
VQKKL. Further preferred amino acids include the small cytokines (intercrine/chemokine) C-x-C 
subfamily signature of the amino acid sequence comprising 
CMCIKTTSGIHPKNIQSLEVIGKGTHCNQVEVIATLKDGRKICLD. 

35 Further preferred polypeptides include portions comprising sites of Protein Kinase C 

phosphorylation including amino acid residues 2 to 4, residues 13 to 15, residues 36 to 38 and 
residues 97 to 99 of SEQ ID NO: or amino acids sequence comprising SLR, SAR, STK, and TLK. 
Further preferred polypeptides include portions of the amino acid sequence comprising sites of 
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Casein kinase II phosphorylation including amino acid residues 97 to 100 or the amino acid 
sequence comprising TLKD. 

Further preferred polypeptides include portions of the amino acid sequence comprising sites 
of N-myristylation or the amino acid residues comprising GTHCNQ. 
5 Further preferred polypeptides include the small cytokines (intercrine/chemokine) C-x-C 

subfamily signature of the amino acid sequence comprising 
CMCIKTTSGIHPKNIQSLEVIGKGTHCNQVEVIATLKDGIIKICLD. 

Proteins of SEP ID NO: 227 (internal designation Clone ID: 166601) and related protein of SEP ID 
10 NP: 502. 

Polynucleotides of SEQ ID NG:58 and SEQ ID NP:385 encode the polypeptides of SEQ 
ID NP:227 and SEP ID NP:502, respectively, with amino acid sequence 

MAAAAVPSLLLSLPPHpGLTFSNKIpPFGApGVLHPEPGLRDWLLPTCSRpLRVALPEKGS 

EGSLCQTQLPATPCFLPSNTVRT. It will be appreciated that all characteristics and uses of the 
15 polynucleotides of SEp ID NPs:58 and 385 and polypeptides of SEP ID NP:227 and 502, 

described throughout the present application also pertain to the human cDNA of clone 1 6660 1 , and 

the polypeptides encoded thereby 

The polynucleotides of SEP ID NG:58 and 385 and polypeptides of SEp ID NP:227 and 

502 encode a transcriptional regulatory protein. Preferred polynucleotides of the invention include 
20 the nucleic acid sequences comprising Clone 166601, the polynucleotides comprising SEP ID 

NG:58 and the polynucleotides comprising SEQ ID NG:385. Preferred polypeptides of the 

invention include the amino acid sequences derived from the nucleic acid sequence comprising 

Clone 1 6660 1 , the polypeptides comprising the amino acid sequences of SEQ ID NG:227 and the 

polypeptides comprising the amino acid sequences of SEQ ID NO:502. 
25 In an embodiment of the invention, preferred polypeptides include the portion comprising 

the site of protein kinase C phosphorylation or the amino acid sequences comprising SNK or TVR 

of SEQ ID NG:227 and 502. 

In another embodiment, preferred polypeptides of the invention include the portion of the 

amino acid sequence comprising sites of myristylation or the amino acids comprising the sequence 
30 GLTFSN or GSEGSL of SEQ ID NG:227 or 502. 

Proteins of SEQ ID NG:268 (internal designation Clone ID:21 1056) and related protein of SEQ ID 
NG:530. 

The polynucleotides of SEQ ID NO:99 and SEQ ID NG:416 and polypeptides of SEQ ID 
35 NG:268 and SEP ID NO:530, respectively, encode a novel human tryptophan hydroxylase, 
including the amino acid sequence hereafter referred to as nhTPH. Tryptophan is taken up by 
active transport into the neurons where it is hydroxylated to 5-hydroxyttyptophan (5HTP). The 
latter is then decarboxylated to serotonin, a neurotransmitter involved in central nervous disorders, 
especially mood disorders, sleep disorders, and eating disorders. Activity of the polypeptide of the 
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invention increases production of serotonin levels and increase the metabolism of tryptophan. Thus 
polypeptides of the invention are useful in the in vitro production of the serotonin and metabolism 
of tryptophan. As example, an expression vector containing the polynucleotides of SEQ ID NO:99 
or SEQ ID NO:41 6 can be introduced into a cell line by methods known in the art such as by 
S calcium precipitation; tryptophan can be supplied in the media; and serotonin produced by the cells 
can be extracted by known methods. 

The invention further relates to a method of screening for test compounds that bind hnXOH 
comprising the steps of contacting a hnTOH polypeptide with said test compound and detecting or 
measuring whether said test compound binds said hnTOH polypeptide. The invention further 

1 0 relates to a method of screening for test compounds that activate hnTOH comprising the steps of 
contacting a hnTOH polypeptide with said test compound and detecting or measuring whether said 
test compound activates said hnTOH polypeptide, for example by measuring serotonin production 
or tryptophan depletion. 

Another embodiment includes physiologically acceptable compositions of test compounds 

1 5 found to increase serotonin production, referred to as activators, in a screen. Further embodiments 
include methods to use activators that have been identified in a screen or previously known in the 
art in the preparation of physiological acceptable formulations for use in in vivo. Further preferred 
are methods to use activators in a physiologically acceptable formulation in the treatment of CNS 
disorders in which tryptophan and serotonin levels are aberrant, particularly depression, anxiety 

20 disorder, bipolar disorder, and eating disorders. 

It will be appreciated that all characteristics and uses of the polynucleotides of SEQ ID 
NO:99 and SEQ ID NO:416 and polypeptides of SEQ ID NO:268 and SEQ ID NO:530, described 
throughout the present application also pertain to the human cDNA of clone 2 1 1056, and the 
polypeptides encoded thereby. 

25 

Proteins of SEQ ID NO: 190 (internal designation Clone ID: 147648) and related protein of SEP ID 

NO:474. 

The polynucleotides of SEQ ID NO:21 and SEQ ID NO:357 and polypeptides of SEQ ID 
NO: 190 and SEQ ID NO:474 encode a novel DNA binding polypeptide containing a leucine zipper 

30 pattern multimerization domain, thereafter referred to as LZP, also known as bZIP transcription 

factor basic domain signature (Hai et al., Genes Dev. 3:2083(1989)). An embodiment of the present 
invention includes the polynucleotides, polypeptides and fragments thereof comprising the 
sequences of SEQ ID NO:2 1 , 357, 1 90, and 474 of the invention. Preferred polypeptides of the 
present invention are directed to the amino acid sequences which comprise the leucine zipper 

35 domain selected from the following amino acids of SEQ ID NO: 1 90 and 474 including 
LAAGAVTLGIGFFALASALWFL; PKGFFNYLTYFLAAGAVTLGIG; or 
FFALASALWFLICKRREIFQNS. It will be appreciated that all characteristics and uses of the 
polynucleotides of SEQ ID NO:21 and SEQ ID NO:357 and polypeptides of SEQ IDNO:I90 and 



111 



wo 02/083898 PCT/IBOl/00914 
SEQ ID NO:474, described throughout the present application also pertain to the human cDNA of 
clone 147648, and the polypeptides encoded thereby. 

Leucine-zippers permit dimerization of various cytoplasmic hormone receptors and 
enzymes (Forman, et al., Mol Endocrinol, 3, 1610-1626 (1989)). Leucine zippers are also a 
5 common feature of transcription factors, where they permit homo- or heterodimerization resulting in 
tight binding to DNA strands (for reviews, see Abel, et al., Nature 341, 24-25 (1989); Jones, et al., 
Ce//61, 9-11 (1990); Lamb, etal.. Trends in Biochemical Sciences 16, 417-422 (1991)). Therefore, 
preferred polypeptides of the present invention are useful tools in several areas of biotechnology, 
especially in protein engineering, where their ability to mediate homo-dimerization or hetero- 

10 dimerization has found several applications, including but not limited to immunochemistry, 
antibody generation, preparation of soluble oligomeric proteins, complementation assasys. The 
utility of the present invention may be further confirmed by methods described, for example, by 
Bosslet et al (US patent 5,643,73 1) in which use of a pair of leucine zippers for in vitro diagnosis, in 
particular for the immunochemical detection and determination of an analyte in a biological liquid ; 

15 by Tso et al (US patent 5,932,448) in which use of leucine zippers for producing bispecific antibody 
heterodimers ; by Conrad et al (US patent 5,965,712) , Ciardelli et al (US patent 5,837,816) , and 
Spriggs et al (W094 1 0308) in which methods of preparing soluble oligomeric proteins using 
leucine zippers have been described; and by Pelletier et al (WO9834120) in which methods to use 
leucine zipper forming sequences in protein fragment complementation assays to detect 

20 biomolecular interactions has been described, all examples which disclosures are hereby 
incorporated by reference in their entireties. 

The multimerization activity of the polypeptides of the present invention containing leucine 
zipper domains may be assayed using any of the assays known to those skilled in the art including 
circular dichroism spectrum and thermal melting analyses as described in US patent 5,942,433, 

25 which disclosures are hereby incorporated by reference in their entirety. Alternatively, the leucine 
zipper motif in LZP could be used by those skilled in art as a "bait protein" in a well established 
yeast double hybridization system to identify its interacting protein partners in vivo from cDNA 
library derived from different tissues or cell types of a given organism. Alternatively, LZP or part 
thereof could be used by those skilled in art in mammalian cell transfection experiments. When 

30 fused to a suitable peptide tag such as [His]6 tag in a protein expression vector and introduced into 
culture cells, this expressed fiision protein can be immunoprecipitated with its potential interacting 
proteins by using anti-tag peptide antibody. This method could be chosen either to identify the 
associated partner or to confirm the results obtained by other methods such as those just mentioned. 
In a preferred embodiment, the invention relates to compositions and methods of using the 

35 LZP polynuceotides and polypeptides of SEQ ID NO: and SEQ ID NO: or fragment thereof for 
preparing soluble multimeric proteins, which consist in multimers of fusion proteins containing a 
leucine zipper fused to a protein of interest, using any technique known to those skilled in the art 
including those described in international patent WO9410308, which disclosure is hereby 
incorporated by reference in its entirety. In another preferred embodiment, LZP or derivative 
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thereof is used to produce bispecific antibody heterodimers as described in US patent 5,932,448, 
which disclosure is hereby incorporated by reference in its entirety. Briefly, leucine zippers capable 
of forming heterodimers are respectively linked to epitope binding components with different 
specificities. Bispecific antibodies are formed by pairwise association of the leucine zippers, 
5 forming an heterodimer which links two distinct epitope binding components. In still another 
preferred embodiment, LZP or part thereof or derivative thereof is used for detection and 
determination of an analyte in a biological liquid as described in US patent 5,643,73 1, which 
disclosure is hereby incorporated by reference in its entirety. Briefly, a first leucine zipper is 
immobilized on a solid support and the second leucine zipper is coupled to a specific binding 

10 partner for an analyte in a biological fluid. The two peptides are then brought into contact thereby 
immobilizing the binding partner on the solid phase. The biological sample is then contacted with 
the immobilized binding partner and the amount of analyte in the sample bound to the binding 
partner determined. In still another preferred embodiment, the LZP or part thereof may be used to 
synthesize novel nucleic acid binding proteins which are able to muhimerize with proteins of 

15 interest, for example to inhibit and/or control cellular growth using any genetic engineering 

technique known to those skilled in the art including the ones described in the US patent 5,942,433, 
which disclosure is hereby incorporated by reference in its entirety. 

In another embodiment, the invention relates to compositions and methods using the LZP or 
part thereof or derivative thereof in protein fragment complementation assays to detect 

20 biomolecular interactions in vivo and in vitro as described in international patent WO9834120, 
which disclosures is hereby incorporated by reference in its entirety. Such assays may be used to 
study the equilibrium and kinetic aspects of molecular interactions including protein-protein, 
protein-nucleic acid, protein-carbohydrate and protein-small molecule interactions, for screening 
cDNA libraries for binding to a target protein with unknown proteins or libraries of small organic 

25 molecules for biological activity. 

Still, another object of the present invention relates to the use of the LZP or part thereof for 
identifying new leucine zipper domains using any techniques for detecting protein-protein 
interaction known to those skilled in the art. Among the traditional methods which may be 
employed are co-immunoprecipitation, crosslinking and co-purification through gradients or 

30 chromatographic columns of cell lysates. Once isolated as a protein interacting with the LZP, such 
an intracellular protein can be identified (e.g. its amino acid sequence determined) and can, in turn, 
be used, in conjunction with standard techniques, to identify other proteins with which it interacts. 
The amino acid sequence thus obtained may be used as a guide for the generation of oligonucleotide 
mixtures that can be used to screen for gene sequences encoding such intracellular proteins. 

35 Screening may be accomplished, for example, by standard hybridization or PCR techniques. 

Techniques for the generation of oligonucleotide mixtures and the screening are well-known. (See, 
e.g., Ausubel et al., eds.. Current Protocols in Molecular Biology, J.Wiley and Sons (New York, 
NY 1993) and PR Protocols: A Guide to Methods and Applications, 1990, Innis, M. et al., eds. 
Academic Press, Inc., New York ). 
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Alternatively, methods may be employed which result in the simultaneous identification of 
genes which encode the intracellular proteins that can dimerize with the LZP or part thereof using 
any technique known to those skilled in the art. These methods include, for example, probing cDNA 
expression libraries, in a manner similar to the well known technique of antibody probing of 
5 lambda.gtl 1 libraries, using as a probe a labeled version of the LZP or part thereof, or fusion 
protein, e.g., the LZP or part thereof fused to a marker (e.g., an enzyme, fluor, luminescent protein, 
or dye), or an Ig-Fc domain (for technical details on screening of cDNA expression libraries, see 
Ausubel et al, supra). Alternatively, another method for the detection of protein interaction in vivo, 
the two-hybrid system, may be used. 

10 

Proteins of SEP ID NO:3 1 8 (internal designation Clone ID: 1 24608) and related protein of SEP ID 
NP:556. 

The polynucleotides of SEQ ID NP:149 and SEQ ID NO:442 and polypeptides of SEQ ID 
NP:3 1 8 and SEQ ID NP:556 encode an RNA-binding protein, hgRBP, which functions in RNA 
15 processing and protein expression. The preferred composition of SEQ lD"NO:318 and 556 include 
MERPDKAALNALQPPEFRNESSLASTLKTLLFFTALMITVPIGLYFTTKSYIFEGALGMSNR 
DSYFYAAIVAVVAVFFVVLALFVYVAWNEGSRQWREGKQD. 

Further preferred polypeptides include those of SEQ ID NP:318 or SEQ ID NP:556 
comprising an N-myristoylation site or the amino acid sequence at positions 43-48 or comprising 
20 the amino acid sequence GLYFTT which targets the protein to the membrane of the endoplasmic 
reticulum for function of hgRBP in translation of cellular mRNA into protein. 

It will be appreciated that all characteristics and uses of the polynucleotides of SEQ ID 
NP: 1 4? and SEQ ID NP:442 and polypeptides of SEQ ID NP:3 1 8 and SEQ ID NP:556, described 
throughout the present application also pertain to the human cDNA 124608of clone , and the 
25 polypeptides encoded thereby. 

Protein of SEP IDNP:337 (internal designation Clone ID: 1 13448) 

The polynucleotides of SEQ IDNP:168 and related SEQ IDNP:454 and polypeptides of 
SEQ ID NP:337 encode a novel human RNA-binding protein involved in RNA processing and 
30 protein expression which is related to Clone ID: 1 83902 and Clone ID:63 5993 . 

It will be appreciated that all characteristics and uses of the The polynucleotides of SEQ ID 
NG:168 and related SEQ ID NG:454 and polypeptides of SEQ ID NO:337, described throughout 
the present application also pertain to the human cDNA of clone 1 13448, and the polypeptides 
encoded thereby. 

35 

Protein of SEP IDNP:328 (internal designation Clone ID: 183902) 

Polynucleotides of SEQ ID NP:159 and related SEQ ID NP:450 and polypeptides of SEQ 
ID NP:328 encode a novel human RNA-binding protein involved in RNA processing and protein 
expression which is related to Clone ID: 1 13448 and Clone ID:635993. 
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It will be appreciated that all characteristics and uses of the The polynucleotides of SEQ ID 
NO:159 and related SEQ ID NO:450 and polypeptides of SEQ IDNO:328, described throughout 
the present application also pertain to the human cDNA of clone 183902, and the polypeptides 
encoded thereby. 

5 

Protein of SEC ID NO:329 ^internal designation Clone ID:635993) 

Polynucleotides of SEQ ID NO: 1 60 and related SEQ ID NO:45 1 and polypeptides of SEQ 
IDNO:329 encode a novel human RNA-binding protein involved in RNA processing and protein 
expression which is related to Clone ID: 1 83902 and Clone ID: 1 13448. It will be appreciated that 

10 all characteristics and uses of the The polynucleotides of SEQ ID NO: 160 and related SEQ ID 
NO:45 1 and polypeptides of SEQ ID NO:329, described throughout the present application also 
pertain to the human cDNA of clone 635993, and the polypeptides encoded thereby. 

Many eukaryotic proteins that bind single-stranded RNA contain one or more copies of a 
putative RNA-binding domain of about 90 amino acids. This is known as the eukaryotic RNA- 

15 binding region, RNP-1 signature or RNA recognition motif (RRM) (Bandziulis et al. Genes Dev. 
3:431 (1989); Swanson etal. Trends Biochem. Sci. 13: 86-91 (1988)). RRMs are found in a 
variety of RNA binding proteins, including heterogeneous nuclear ribonucleoproteins (hnRNPs), 
proteins implicated in regulation of alternative splicing, and protein components of small nuclear 
ribonucleoproteins (snRNPs). The polypeptides of SEQ ID NO:337, 328, and 329 encode novel 

20 human RNA binding protein, hereafter referred to as ghRBP which contains one copy of an RRM. 
Further characteristic of a protein which binds to nucleic acids, ghRBP contains a zinc fmger motif 
comprising the amino acid sequence. Preferred polynucleotides of the invention include 
polynucleotides comprising the nucleic acids of SEQ ID NO: 1 59, 160, 168, 450, 45 l,and 454. 
Preferred polypeptides of the invention are polypeptides comprising the amino acids of SEQ ID 

25 NO:337, 328 and 329. It will be appreciated that all characteristics and uses of the polynucleotides 
of SEQ ID NO: 159, 160, 168, 450, 451, and 454 described throughout the present application also 
pertain to the human cDNA of Clone ID: 183902, Clone ID:635993 and Clone ID: 1 13448. 

Preferred amino acids of the invention are residues which comprise the RNA-binding 
domain or portion thereof. Preferred amino acid sequences are selected from the following set of 

30 sequences including 

AFVRRXPWTAASSQLKEHFAQFGHVRRCILPFDKETGFHRGLGWVQFSSEEGLRNALQQE 
NHIIDGVKVQV; 

SINQPVAFVRRXPWTAASSQLKEHFAQFGHVRRCILPFDKETGFHRGLGWVQFSSEEGLRN 
ALQQENHIID; 

35 PWTAASSQLKEHFAQFGHVRRCILPFDKETGFHRGLGWVQFSSEEGLRNALQQENHIIDGV 
KVQVHTRRP. Further preferred are polypeptides of the invention include any fragment of SEQ 
ID NO: which binds to RNA. 

An embodiment of the invention relates to methods of using the polypeptides of the 
invention to bind to RNA molecules in vitro by techniques that are known in the art. Preferred use 
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of the polypeptides of the invention includes extraction of RNA from biological samples, chemical 
reagents, cell homogenates and tissue homogenates. Further utility of the polypeptides of the 
present invention or part thereof may be further confirmed by binding methods described in 
Triflllis, et al., RNA 5(8): 1071-82 (1999) and U.S. Patent 6,107,029, which disclosures are hereby 
5 incorporated by reference in their entireties. 

Proteins of SEP ID NO:248 f internal designation Clone ID: 199782). SEP IDNO:249 (internal 
designation Clone ID:821212'). SEP ID NP:250 (internal designation Clone ID:202863) and related 
protein of SEP ID NP:5 1 8. 

1 0 The polynucleotides of SEQ ID NPs: 79, 80, 8 1 and 40 1 and polypeptides of SEQ ID 

NPs:248, 249, 250 and 518 encode human RNA-associated pol}^eptides which act as splicing 
factors. It will be appreciated that all characteristics and uses of the polynucleotides of SEQ ID 
NPs:79, 80, 8 1 and 40 1 and polypeptides of SEQ ID NPs:248, 249, 250 and 5 1 8, described 
throughout the present application also pertain to the human cDNA of clones 199782, 8212 12, and 

15 202863, and the polypeptides encoded thereby. 

The translation of genetic information into protein depends on RNA and the first step in this 
process is the transcription of DNA into RNA while retaining all the genetic information encoded in 
DNA. The RNA transcript undergoes various processing steps which include splicing and 
polyadenylation. The mature RNA transcript is translated into protein by the ribosomal machinery. 

20 Nascent RNA transcripts are spliced in the nucleus by the spliceosomal complex which catalyzes 
the removal of introns and the rejoining of exons. At least 40 splicing factors have been identified 
and interaction of these factors are important in the conformational changes needed for the 
enzymatic removal of introns and religation of the exons. Both protein and RNA components are 
involved in the spliceosome assembly and the splicing reaction. There are 2 distinct catalytic steps 

25 involved in the RNA splicing reaction with distinct proteins and RNA species. Alternative splicing 
factors include developmentally regulated proteins that play key roles in developmental processes 
such as pattern formation and sex determination, respectively (Hodgkin, J. et al. (1994) 
Development 120:3681-3689). Alternate splicing is also involved in the tissue specific expression 
of isoforms of proteins, including structural proteins and enzymes. 

30 An embodiment of the present invention relates to compositions of the polynucleotides of 

SEQ ID NO:79, 80, 8 1 and 401 and polypeptides of SEQ ID NO:248, 249, 250 and 5 1 8. Preferred 
amino acids of the invention comprise the zinc finger region or fragment thereof and are selected 
from the following sequences of amino acids from SEQ ID NP:248, 249, 250 and 5 1 8 including 
GACENCGAMTHKKKDCFE; NSIITKYRKGACENCGAM; orTHKKKDCFERPRRVGAKF. 

35 The polypeptides of SEQ ID NP:248, 249, 250 and 5 1 8 are involved in the splicesome 

complex and have function in the processing of RNA processing. Alternatively, the polypeptides of 
the present invention are involved in RNA processing and thus involved in protein expression. A 
preferred embodiment of the invention relates to a method of using the polynucleotides of 
polynucleotides of SEQ IDNP:79, 80, 81 and 401 in vitro. A preferred method of use relates to 
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introduction of said polypeptides or fragments thereof into ceils by techniques i<nown in the art such 
as transfection or microinjection. Further preferred are methods to use the polynucleotides of the 
invention to alter protein expression in the given cell. Alternately, polypeptides of the present 
invention can be used in combination with reagents known in the art to alter protein expression in 
5 cell free expression systems, mammalian expression systems, insect expression systems, or bacterial 
expression systems. Furthermore, methods to use the polynucleotides or polypeptides the present 
invention to increase or decrease protein expression is preferred. The utility of the polypeptides of 
the invention or part thereof may be further confirmed using methods described in US Patent 
6020164 and Chua and Reed, Gene Devel 13:841-850 (1999), which disclosures are hereby 

1 0 incorporated by reference in their entireties. 

In another embodiment, methods to screen for inhibitors and activators of the polypeptides 
of the invention are preferred. In another embodiment, molecules or compounds which are 
identified in such a screen are further preferred. Further preferred are compounds which activate or 
inhibit activity of the polypeptides of the current invention. Activity of the the polypeptides of the 

15 invention is modified by phosphorylation at cAMP and cGMP dependent phosphorylation sites 
(including 3-6:55-58; 107-1 10) and casein kinase II phosphorylation sites (including 33-36:58- 
61 :126-129). Preferred activators include but are not limited to compounds which promote 
accumulation of intracellular cAMP and cGMP. Further preferred activators include those 
compounds which activate casein kinase II. Preferred inhibitors are those compounds which inhibit 

20 intracellular cAN4P and cGMP accumulation, or those compounds which promote cAMP and cGMP 
degradation. Further preferred inhibitors include compounds which promote the deactivation of 
casein kinase II. Furthermore, inhibitors and activators of the polypeptides of the present invention 
include compounds known in the art as well as compounds to be identified by the method of 
screening. Furthermore, compounds that inhibit or activate the activity of the polypeptides of the 

25 present invention by means other than phosphorylation or dephosphorylation are also preferred. 

In another embodiment, a method for the use of the polynucleotides or polypeptides of the 
present invention in the treatment, prevention, attenuation or diagnosis of disorders of RNA 
processing or protein processing are preferred. Such disorders are selected from a group which 
includes but is not limited to cancers such as adenocarcinoma, leukemia, sarcoma, teratocarcinoma, 

30 and any disorder associated with cell growth and differentiation, embryogenesis, and 

morphogenesis involving any tissue, organ, or system, e.g., the brain, adrenal gland, or reproductive 
system. 

Proteins of SEP ID NO:256 (internal designation Clone ID: 822794). SEP ID NO:257 (internal 
35 designation Clone ID:337572) and related protein of SEP ID NP:521. 

The polynucleotides of SEQ ID NP:87, 88 and 407 and polypeptides of SEQ ID NP:256, 
257 and 521 encode human nuclear polypeptides which interact with transcription factors of the 
Signal Transducers and Activators of Transcription (STAT) family of proteins involved in the 
regulation of cell division. It will be appreciated that all characteristics and uses of the 
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polynucleotides of SEQ IDNO:87, 88 and 407 and polypeptides of SEQ ID NO:256, 257 and 521, 
described throughout the present application also pertain to the human cDNA of clones 822794 and 
337572, and the polypeptides encoded thereby. 

STATs are pleiotropic transcription factors which mediate cytokine-stimulated gene 
5 expression in multiple cell populations (Levy, Cytokine Growth Factor Rev., 8:81 (1997)). All 
STAT proteins contain a DNA binding domain, a Src homology 2 (SH2) domain, and a 
transactivation domain necessary for transcriptional activation of target gene expression. Janus 
kinases (JAK), including JAKl, JAK2, Tyk, and JAK3, are cytoplasmic protein tyrosine kinases 
(PTKs) which play pivotal roles in initiation of cytokine-triggered signaling events by activating the 

1 0 cytoplasmic latent forms of STAT proteins via tyrosine phosphorylation on a specific tyrosine 
residue near the SH2 domain (Ihle et al.. Trends Genet., 11: 69 (1995); Darnell, Science 
277(5332): 1630 (1997); Johnston et al.. Nature, 370: 1513 (1994)). Tyrosine phosphorylated STAT 
proteins dimerize through specific reciprocal SH2-phosphotyrosine interactions and translocate 
from the cytoplasm to the nucleus where they stimulate the transcription of specific target genes by 

15 binding to response elements in their promoters (Leonard, Nature Medicine, 2: 968 (1996); Zhong 
et a!., PNAS USA, 91:4806 (1994) Darnell, Science, 277:1630 (1997)). 

In an embodiment of the present invention, compositions of the polynucleotides and 
polypeptides or fragments thereof SEQ ID NO:87, 88 and 407 and SEQ ID NO:256, 257 and 52 1 , 
respectively are included. Further preferred are polypeptides of the present invention which interact 

20 with activated STAT 3, but may also interact with STATl , STAT2 or other STAT homologues. 
Preferred polypeptides of the invention act to inhibit or decrease the activity of STATs. Further 
preferred amino acids of SEQ ID NO:256, 257 and 521 include the SAP domain 
VSSFRVSELQVLLGFAGRNKSGRKHDLLMRALHLL. Activation of cytokine receptors by 
their cognate ligands activate JAKs which in turn, activate STATS. Therefore cytokines and other 

25 hormones which signal through cytokine-like receptors may be modulated by polypeptides or 
polynucleotides of the present invention. Cytokines and other hormones which can thus be 
modulated by the present invention include but are not limited to interferons, interleukins, prolactin, 
and growth hormone. The utility of the polypeptides of the present invention or part thereof may be 
further confirmed using the methods described in WlPO Publication W09928465 which disclosure 

30 is hereby incorporated by reference in its entirety. 

The polypeptides of this invention can be used in a method of inhibiting the activity of 
STAT proteins in a cell in vitro, the method comprising introducing a nucleic acid into the cell, 
wherein the nucleic acid comprises a nucleotide sequence encoding the amino acid sequence of 
SEQ ID NO:256, 257 and 521 or the amino acid sequence of SEQ ID NO:256, 257 and 521 with 

35 one or more conservative amino acid alterations, and wherein the nucleic acid expresses the amino 
acid sequence in an amount and for a time sufficient for the amino acid sequence to specifically 
bind to STAT proteins and to decrease STAT activity, thereby decreasing STAT activity in the cell. 

Suitable compositions of polypeptides or polynucleotides of the present invention are useful 
as a method of treatment of pathologies such as diseases, syndromes, or other undesirable 
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conditions resulting from defects in cell cycle progression. Such cell cycle defects may result from 
defects in the regulation of activated STAT or an upstream factor such as activated JNKs or 
activated cytokine receptors. Alternatively, polypeptides or polynucleotides of the present 
invention may be used in a method of treating pathologies resulting from defects in cell cycle 
5 progression due to defects in a step "downstream" of STAT regulation of cell cycle progression. In 
preferred embodiments, agonists of polypeptides or polynucleotides of the present invention are 
useful in the treatment of pathologies such as but not limited to hyperproliferative diseases such as 
cancer (e.g., leukemia, lymphoma, breast cancer, colon cancer, prostate cancer, Wilms' tumor), 
coronary artery disease, pulmonary vascular obstructive disease, either primary or as a feature of 

10 Eisenmenger's syndrome, and other disorders of abnormal cellular proliferation. Cells to be treated 
include but are not limited to hyperproliferative cells, cancer cells, vascular smooth muscle cells, 
endothelial cells, and gametes. 

In some embodiments of the invention, antagonists of the polypeptides or polynucleotides 
of the present invention are used to stimulate, promote, or facilitate progression through the cell 

15 cycle, such as in the cellular regeneration of terminally differentiated cardiac myocytes or tissues, 
e.g., striated muscle myocytes. For example, this could allow restoration of damaged myocardium 
after cardiac injury, myocardial infarction, myocarditis, cardiomyopathy, trauma, as a consequence 
of cardiac surgery, etc., or repletion of striated muscle exhausted by muscular dystrophy. 

In further embodiments, expression of the polypeptides encoded by the nucleic acids is 

20 expected to prevent, ameliorate, or lessen the cell cycle defect of the host cell, or to restore normal 
cell cycle progression of the host cell. Whether provided via nucleic acid or polypeptides delivered 
directly to cells, the therapeutic formulations of the invention can also be used as adjuncts to other 
forms of therapy, including but not limited to chemotherapy, and radiation therapy. 

25 Protein of SEP IDNO:330 (internal designation Clone ID:398703) and related protein of SEP ID 
NP:330. 

The polynucleotides of SEQ ID NO: 161 and SEQ ID NO:452 and polypeptides of SEQ ID 
NO:330 encode a novel human deubiquitinating enzyme (GNP:AFO 17306). Deubiquitinating 
enzymes serve a number of functions (Hochstrasser Cur Opin Cell Biol 4:1024 (1992); Rose, In: 

30 Ubiquitin . Plenum Press, New York (1988)). First, ubiquitin must be cleaved from a set of 

biosyntheticprecursors, which occur either as a series of ubiquitin monomers in head-to-tail linkage 
or as fusions to certain ribosomal proteins (Finley & Chau, Annu Rev Cell Biol 7, 25-69 (1991)). 
Secondly, ubiquitin must be recycled from intracellular conjugates, both to maintain adequate pools 
of free ubiquitin and, in principle at least, to reverse the modification of inappropriately targeted 

35 proteins. Finally, deubiquitinating reactions may be integral to the degradation of ubiquitinated 
proteins by the 26S proteasome, a complex ATP-dependent enzyme whose exact composition and 
range of activities remain poorly characterized (Hershko & Ciechanover, Annu Rev Biochem 61, 
761-807 (1992); Hadari et al., J Biol Chem 267, 719-727 (1992); Murakami et al., Nature 360, 597- 
9 (1992); Rechsteiner, J. Biol. Chem. 268, 6065-6068 (1993)). 
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An embodiment of the invention includes preferred polypeptides with ubiquitin-specific 
protease activity with a novel N-terminus of Clone ID:398703 comprising the amino acid sequence 
MCTTSLPCPIIMEPWGLATTKAAYVLFYQRRDDEFYKTPSLSSSGSSDGGTRPSSSQQGFGD 
DEACSMDTN encoded by 
5 ATGTGTACGACCTCATTGCCGTGTCCAATCATTATGGAGCCATGGGGGTTGGCCACTAC 
TAAAGCAGCTTATGTGCTATTTTACCAACGTCGAGATGATGAATTTTATAAGACACCTT 
CACTTAGCAGTTCTGGTTCCTCTGATGGAGGGACACGACCAAGCAGCTCTCAGCAGGG 
CTTTGGGGATGATGAGGCTTGCAGCATGGACACCAACTAA of SEQ ID NO: 1 61 . 
The preferred polypeptides of the invention are those which prevent or reverse 
10 ubiquitination of cellular proteins in vitro or in vivo. Further preferred are polypeptides of the 
invention which prevent or reverse ubiquitination of extracellular proteins in vitro or in vivo. 

The polynucleotides of SEQ ID NO: 1 61 and SEQ ID NO:452 encode polypeptides of SEQ 
ID NO:330 which contain protein domains or motifs including but not limited to a Protein kinase C 
phosphorylation site comprising the amino acid fragment SAR, and an N-myristylation site 
15 comprising amino acid fragment GLNMSE. Further preferred amino acids of SEQ ID NO : 330 
include the Ubiquitin carboxyl-terminal hydrolases family 2 signature or amino acid sequence 
YDLIAVSNHYGAMGVGHY. 

It will be appreciated that all characteristics and uses of the polynucleotides of SEQ ID 
NO: 161 and SEQ ID NO:452 and polypeptides of SEQ ID NO:330, described throughout the 
20 present application also pertain to the human cDNA of clone 398703, and the polypeptides encoded 
thereby. 

The utility of the polynucleotides and polypeptides of the present invention or part thereof 
may be further confirmed using methods which assess activity or function of deubiquitinating 
enzymes described in United States Patents 5391490 and 5565352 which disclosures are hereby 

25 incorportated by reference in their entireties. 

Proteins of SEQ IDNO:277 (internal designation Clone ID:653966) and related protein of SEQ ID 
NO:535. 

The polynucleotides of SEQ ID NO: 108 and SEQ ID NO:42 1 and polypeptides of SEQ ID 
30 NO:277 and SEQ ID NO:535 encode human liver fatty acid binding protein (L-FABP) comprising 
the amino acid sequence of SEQ ID NO:277 and 535. The amino acid sequence of SEQ ID NO:277 
and 535 are the same as human L-FABP (Genbank accession GNP:M106I7; Lowe et al., JBC 
260:34 1 3- 1 7 ( 1 985)) and homologous to human FABP (Genbank accession GNP:M1 0050). The 
polypeptides of the present invention belong to the FABP/P2/CRBP/CRABP family of transporters 
35 and fiinctionally binds to free fatty acids and derivatives thereof. L-FABP is normally expressed in 
the cytoplasm of hepatocytes, but preferred embodiments include use of the polypeptides of the 
present invention as extracellular polypeptides. Further preferred embodiments include use of the 
polypeptides of the present invention as serum or plasma polypeptides. Further preferred 
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embodiments use polypeptides of the invention in vitro. Still further preferred embodiments 
include use of the polypeptides of the present invention in vivo. 

Preferred amino acids of the invention include the lipocalin domain, from 2 to 127 or 
polypeptides comprising the amino acid sequence 
5 SFSGKYQLQSQENFEAFMKAIGLPEELIQKGKDIKGVSEIVQNGKHFKFTITAGSKVIQNEFT 
VGEECELETMTGEKVKTWQLEGDNKLVTTFKNIKSVTELNGDIITNTMTLGDIVFKRISKR 
I. The polypeptides of SEQ ID NO:277 and SEQ ID NO:535 contain a cytosolic fatty-acid binding 
. protein signature comprising the amino acid sequence GKYQLQSQENFEAFMKAI which 
functions in the polypeptides ability to bind small hydrophobic molecules, such as lipids, steroid 
1 0 hormones, and retinoids. Preferred amino acids of SEQ ID NO:277 and SEQ ID NO:535 include 
GKYQLQSQENFEAFMKAI, MSFSGKYQLQSQENFEAF, and LQSQENFEAFMKAIGLPE. 

Phosphorylation status modulates the activity of L-FABP. Preferred polypeptides of the 
invention include the amino acids sequence comprising the sites of cAMP- and cGMP-dependent 
protein kinase phosphorylation including residues of SEQ ID NO:277 and 535 comprising the 
15 sequence KRIS. 

Further preferred polypeptides of the invention include the amino acid sequence comprising 
the sites of Protein kinase C phosphorylation including residues at positions 4 to 6, 94 to 96, and 
124 to 126 of SEQ lDNO:277 and 535. Still further preferred polypeptides of SEQ 1DN0:277 and 
535 include the amino acid sequence comprising SGK, TFK, and SKR. 

20 Further preferred are polypeptides of SEQ ID NO: 277 and 535 include the amino acid 

sequence comprising a Casein kinase II phosphorylation sites. Preferred amino acids of SEQ ID 
NO: 277 and 535 include positions 64 to 67, 100 to 103, and 1 14 to 1 17. Further preferred amino 
acids comprise the sequences TVGE, SVTE, and TLGD. 

A preferred polypeptide of SEQ ID NO: 277 and 535 is one in which the amino acid 

25 asparagine (Asn) is located at residue 1 05, further referred to as the N-isoform. Further preferred is 
the polypeptide of SEQ ID NO: 277 and 535 in which the amino acid aspartate (Asp) is located at 
residue 105 further referred to as the D-isoform. The rat homologue of the human D-isoform of the 
present invention was shown to have a greater affinity to lysophospholipids, prostaglandins, 
retinoids, bilirubin and bile salts compared to the rat homologue of the human N-isoform of the 

30 present invention by methods described by DiPietro and Santome, Biochim Biophys Acta 

1478 : 1 86-200 (2000) which disclosure is hereby incorporated by reference in its entirety. The rat 
homologues share only 82% identity with the of the human D- and N-isoforms, therefore it is not 
predictable to find that the human D-isoform has equal or greater affmity to lysophospholipids, 
prostaglandins, retinoids, bilirubin, bile salts and fatty acid compared to the human N-isoform. 

35 Further preferred polypeptides of the present invention include the D-isoform polypeptide 

and fragments thereof which have an equal or at least 10%, 20%, 30%, 40%, 50%, 60% or 75% 
greater affmity for fatty acids, and lipophilic compounds selected from a group including but not 
limited to lysophospholipids, prostaglandins, retinoids, bilirubin, bile salts, steroid hormones (such 
as testosterone and estradiol), and cholesterol compared to the N-isoform. 
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Another embodiment of the invention includes polynucleotides or polypeptides of the 
invention or fragments thereof which bind lipophilic compounds selected from a group including 
but not limited to free fatty acids, lysophospholipids, prostaglandins, retinoids, bilirubin, bile salts, 
steroid hormones (such as testosterone and estradiol), and cholesterol in serum or plasma. Further 
5 preferred are polypeptides of the invention which bind lipophilic compounds in serum or plasma 
separated from whole blood in a process of purifying serum or plasma for use in vitro or in vivo. 
Further preferred are polypeptides of the invention which bind lipophilic compounds in serum or 
plasma in vivo. 

In a further embodiment, polynucleotides or polypeptides of the invention or fragments 
1 0 thereof, in physiological appropriate formulations, are useful in the prevention, treatment or 
attenuation of conditions in which lipophilic compounds are elevated in the serum of mammals, 
preferably humans. Such conditions are selected from a group which include but are not limited to 
obesity, hyperlipidemia, hypercholesterolemia, hypertriglyceridemia, diabetes type I (IDDM) 
diabetes type II (NIDDM), atherosclerosis, and hypertension. 
1 5 Mammary-derived growth inhibitor (MDGI) and heart-fatty acid binding protein (FABP), 

which belong to the FABP family, specifically inhibit growth of normal mouse mammary epithelial 
cells (MEC) and promote morphological differentiation, stimulates its own expression and promotes 
milk protein synthesis (US Patent 5977309, 24 March 1995). In further preferred embodiments, 
polypeptides of the invention include those which locally signal growth cessation and stimulate 
20 differentiation of the developing epithelium. Further preferred polypeptides of the invention 
suppress the mitogenic effects of EGF family members, and inhibit c-fos, c-myc and c-ras 
expression. 

In a further aspect of the present invention, there is provided a method for producing such 
polypeptide by recombinant techniques comprising culturing recombinant prokaryotic and/or 
25 eukaryotic host cells, containing a human fatty acid binding polypeptides or polynucleotides of the 
invention acid under conditions promoting expression of said protein and subsequent recovery of 

said protein. 

In a further embodiment of the present invention, there is provided a method for utilizing 
such polypeptides, or polynucleotides of the invention for therapeutic purposes, for example, as a 

30 cell growth inhibitor and as to cause differentiation stimulatory activity on various responsive types 
of tissues and cells in vitro. Further preferred are methods for use of polypeptides, or 
polynucleotides of the invention, in appropriate physiological form, for therapeutic purposes for to 
inhibit cell proliferation or to induce cell differentiation in mammals, preferably humans. 

It will be appreciated that all characteristics and uses of the polynucleotides of SEQ ID 

35 NO:108 and SEQ ID NO:421 and polypeptides of SEQ IDNO:277 and SEQ IDNO:535, described 
throughout the present application also pertain to the human cDNA of clone 653966, and the 
polypeptides encoded thereby. 
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Proteins of SEP IDNO:313 (internal designation Clone ID:633418), SEQ ID NO:314 (internal 
designation Clone ID:422878) and related protein of SEP ID NO:552. 

The polynucleotides of SEQ ID NP:144, 145 and 438 and polypeptides of SEQ ID NP:3 13, 
3 Hand 552 encode a cleavage stimulation factor important in mRNA processing and protein 
5 expression. Protein kinase C phosphorylation increases activity of said polypeptides and preferred 
amino acids include SEK and SGR. Further, sites of tyrosine kinase phosphorylation increase 
activity of said polypeptides and preferred amino acids of SEQ ID NQ:3 14, 552 include 
KKLEENPY. 

It will be appreciated that all characteristics and uses of the polynucleotides of SEQ ID 
10 NQ:144, 145 and 438 and polypeptides of SEQ IDNQ:313, 314and 552, described throughout the 
present application also pertain to the human cDNA of clones 633418 and 422878, and the 
polypeptides encoded thereby. 

Proteins of SEQ 1DNP:219 (internal designation Clone ID:589848). SEQ IDNP:220 (internal 
15 designation Clone 10:211883'). SEQ 1DNP:221 (internal designation Clone ID:642603). SEP ID 
NO:222 (internal designation Clone ID: 1933 16). and related protein of SEQ ID NO:497. 

Polynucleotides of SEQ ID NQ:50, 5 1, 52, 53, 380 and polypeptides of SEQ ID NO:219, 
220, 22 1 and 497 encode RNA associated proteins with a ribosomal L34 domain comprising the 
amino acid sequence NEYQPSNIKRKNKHGWVRRLXTPAGXXXILRRMLKGRKSLSH or 
20 NEYQPSNIKRKNKHGWVRRLXTPAGVQVILRRMLKGRKSLSH. It will be appreciated that 
all characteristics and uses of the polynucleotides of SEQ IDNQs:50, 51, 52, 53, 380 and 
polypeptides of SEQ ID NOs:219, 220, 221 and 497, described throughout the present application 
also pertain to the human cDNA of clones 589848, 211883, 642603 and 1933 16, and the 
polypeptides encoded thereby. 

25 

Proteins of SEQ IDNO:302 (internal designation Clone 10:1000891255) and related protein of 
SEPIDNP:543. 

Polynucleotides of SEQ ID NO: 133 and 429 and polypeptides of SEQ 1DNO:302 and 543 
encode human ribosomal protein, hRIBPRT. An embodiment of the invention includes the 

30 compositions of the polypeptides of SEQ ID NO:302 and 543, comprising the amino acid sequence 
MVAAKKTKKSLESIKSRLQLVMKSGKYVLGYKQTLKMIRQGKAKLVILANNCPALRKSEI 
EYYAMLAKTGVHHYSGNNIELGTACGKYYRVCTLAIIDPXDSXIIRSMPEQTGEK, and the 
polynucleotides of SEQ ID NO: 133 and 429, respectively, which encode human ribosomal protein, 
hRIBPRT. The polypeptides of the invention contain the ribosomal protein 

35 L30e/L7Ae/S12e/Gadd4 signature (Koonin EV, J Mol Med 75:236-238 (1997) and Nakanishi et al.. 
Gene 35:289-96 (1985)). 

Preferred polypeptides of the invention include the amino acid sequence comprising 
KSLESIKSRLQLVMKSGKYVLGYKQTLKMIRCKjKAKLVILANNCPALRKSEIEYYAMLAK 
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TGVHHYSGNNIELGTACGKYYRVCTLAIIDPXDSXIIR; 

KSLESIKSRLQLVMKSGKYVLGYKQTLKMIRQGKAKLVILANfNCPALRK ; and 
SEIEYYAMLAKTGVHHYSGNNIELGTACGKYYRVCTLAIIDPXDSXIIR. Further preferred 
amino acids of the invention include sites of PKC phosphorylation, comprising the amino acid 
5 sequences ofSEQIDNO:302 and 543 including TKK (positions 7-13); SIK (positions 13-15); 
SGK (positions 24-26); and TLK (positions 34-36). Further preferred amino acids of the invention 
include sites of Casein Kinase II phosphorylation, comprising the amino acid sequences SEIE 
(positions 58-61) and SMPE (positions 107-110). 

In another embodiment, the proteins of SEQ ID NO:302 and 543 can be used to bind to 

10 nucleic acids, preferably RNA, alone or in combination with other substances. For example, the 
proteins of the invention or part thereof can be added to a sample containing RNAs in optimum 
conditions for binding, and allowed to bind to RNAs. In a preferred such embodiment, the proteins 
of the invention or part thereof may be used to purify mRNAs, for example to specifically isolate 
RNA, e.g. from a specific cell type or from cells grown under particular conditions. Such RNAs 

15 could then be reverse transcribed and cloned, could be analyzed for relative expression analyses, 
etc. In addition, such methods may be used to specifically remove RNA from a sample, for 
example during the purification of DNA. To carry out any of these methods, the proteins of the 
invention or part thereof may be bound to a chromatographic support, either alone or in 
combination with other RNA binding proteins, to form an affinity chromatography column. A 

20 sample containing a mixture of nucleic acids to purify is then run through the column. 

Immobilizing the proteins of the invention or part thereof on a support is particularly advantageous 
for embodiments in which the method is to be practiced on a commercial scale. This 
immobilization facilitates the removal of RNAs from the batch of resin-coupled protein after 
binding, and allows subsequent re-use of the protein. Immobilization of the proteins of the 

25 invention or part thereof can be accomplished, for example, by inserting any matrix binding domain 
in the protein according to methods known to those skilled in the art. The resulting fusion product 
including the proteins of the invention or part thereof is then covalently, or by any other means, 
bound to a protein, carbohydrate or matrix (such as gold, "Sephadex" particles, polymeric surfaces). 
Another embodiment of the present invention relates to methods and compositions using 

30 the proteins of the invention, or part thereof, to associate specific mRNAs to the inner face of lipidic 
bilayers of liposomes in order to further introduce these mRNAs into the cytoplasm of eukaryotic 
cells. Preferably, specific mRNAs are first associated with the protein of the invention and the 
RNA/protein complex formed in that way is then mixed with liposomes according to methods 
known to those skilled in the art. These liposomes are added to an in vitro culture of eukaryotic 

35 cells. In vivo, such a method might treat and/or prevent disorders linked to dysregulation of gene 
transcription such as cancer and other disorders relating to abnormal cellular differentiation, 
proliferation, or degeneration. 

A decrease in ribosome function results in a significant inhibition of cell growth. 
Therefore, in another embodiment, the present proteins and nucleic acids can be used to modulate 
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the rate of cell growth in vitro or in vivo. Accordingly, compounds that inhibits the expression or 
function of the proteins of the invention can be used to inhibit the growth rate of cells, and can thus 
be used, e.g. in the treatment or prevention of diseases or conditions associated with excessive cell 
growth, such as cancer or inflammatory conditions. Such compounds include, but are not limited 
S to, antibodies, antisense molecules, dominant negative forms of the proteins, and any heterologous 
compounds that inhibit the expression or the activity of the proteins. 

It will be appreciated that all characteristics and uses of the polynucleotides of SEQ ID 
NO: 133 and 429 and polypeptides of SEQ ID NO:302 and 543, described throughout the present 
application also pertain to the human cDNA of clone 1 000891255, and the polypeptides encoded 
10 thereby. 

Proteins of SEP ID NO:271 dntemal designation Clone ID:493328). related clones 153261. 
152042. 599054. and 650872 and related protein of SEP ID NO:533. 

15 The polynucleotides of SEQ ID Nos:102, 103, 104, 105, and 106 and polypeptides of SEQ 

ID Nos:271, 272, 273, 274, 275 and 533 encode for the HUMAN GENSET BINDING PROTEIN 
or HGBP-1. Preferred polypeptides comprise the amino acid sequence 

MKVKIKCWNGVATWLWVANDENCGICRMAFNGCCPDCKVPGDDCPLVWGQCSHCFHM 
HCILKWLHAQQVQQHCPMCRQEWKFKE. It will be appreciated that all characteristics and 
20 uses of the polynucleotides of SEQ ID Nos: 102, 103, 104, 105, and 106 and polypeptides of SEQ 
ID Nos:271, 272, 273, 274, 275 and 533, described throughout the present application also pertain 
to the human cDNA of clones 493328, 153261, 152042, 599054, and 650872, and the polypeptides 
encoded thereby. 

The protein of SEQ ID NO: 27 1 encoded by the extended cDN A SEQ ID NO: 1 02 is the 
25 same as a hepatocellular carcinoma associated ring fmger protein (EMBL AF247565) and Genset 
protein in WOO 100806 (Genpep accession AX061622) with homology to an anaphase-promoting 
complex (APC) subunit from Drosophila (Embl accession number AJ25 1 5 1 0). In addition, HGBP- 
1 exhibits the pfam PHD zinc fmger signature from positions 33 to 79. 

Zinc binding domains which contain a C3HC4 sequence motif are known as RING domains 
30 (Lovering, R. et al. (1993) Proc. Natl. Acad. Sci. USA 90:2 112-21 16). Zinc finger domains are 
found in numerous zinc binding proteins which are involved in protein-protein and protein-nucleic 
acid interactions. They are independently folded zinc-containing mini-domains which are used in a 
modular repeating fashion to achieve sequence-specific recognition of DNA (Klug 1993 Gene 135, 
83-92). Such zinc binding proteins are commonly involved in the regulation of gene expression, 
35 and usually serve as transcription factors, either by directly affecting transcription or recruiting co- 
activators or co-repressors (see US patents 5,866,325; 6,013,453 and 5,861,495). PHD fingers are 
C4HC3 zinc fingers spanning approximately 50-80 residues and distinct from RING fingers or LIM 
domains. They are thought to be mostly DNA or RNA binding domain but may also be involved in 
protein-protein interactions (for a review see Aasland et al. Trends Biochem Sci 20:56-59 (1995)). 
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HGBP-1 or part thereof is a zinc binding protein, which is able to bind nucleic acids, more 
preferably a transcription factor. Preferred polypeptides of the invention are polypeptides 
comprising the amino acids of SEQ ID NO: 271 from positions 33 to 79. Other preferred 
polypeptides of the invention are fragments of SEQ ID NO: 27 1 having any of the biological 
5 activity described herein. The nucleic acid binding activity of the protein of the invention or part 
thereof may be assayed using any of the assays known to those skilled in the art including those 
described in US patent 6,013,453. 

The invention relates to methods and compositions using the protein of the invention or part 
thereof to bind to nucleic acids, preferably DNA, alone or in combination with other substances. 
10 For example, the protein of the invention or part thereof is added to a sample containing nucleic 
acid in conditions allowing binding, and allowed to bind to nucleic acids. In a preferred 
embodiment, the protein of the invention or part thereof may be used to purify nucleic acids such as 
restriction fragments. 

In another preferred embodiment, HGBP polypeptides or parts thereof may be used to 

1 5 visualize nucleic acids when the polypeptide is linked to an appropriate fusion partner, or is 
detected by probing with an antibody. Thus, HGBP polypeptides can be used to diagnose. . . 

Alternatively, the protein of the invention or part thereof may be bound to a 
chromatographic support, either alone or in combination with other DNA binding proteins, using 
techniques well known in the art, to form an affmity chromatography column. A sample containing 

20 nucleic acids to purify is run through the column. Immobilizing the protein of the invention or part 
thereof on a support advantageous is particularly for those embodiments in which the method is to 
be practiced on a commercial scale. This immobilization facilitates the removal of the protein from 
the batch of product and subsequent reuse of the protein. Immobilization of the protein of the 
invention or part thereof can be accomplished, for example, by inserting a cellulose-binding domain 

25 in the protein. One of skill in the art will understand that other methods of immobilization could 
also be used and are described in the available literature. 

In another embodiment, the present invention relates to compositions and methods using the 
protein of the invention or part thereof, especially the zinc binding domain, to alter the expression 
of genes of interest in a target cells. Such genes of interest may be disease related genes, such as 

30 oncogenes or exogenous genes from pathogens, such as bacteria or viruses using any techniques 
known to those skilled in the art including those described in US patents 5,861,495; 5,866,325 and 
6,013,453. 

In a further embodiment, the protein of the invention or part thereof may be used to 
diagnose, treat and/or prevent disorders linked to dysregulation of gene transcription such as cancer 
35 and other disorders relating to abnormal cellular differentiation, proliferation, or degeneration, 
including hyperaldosteronism, hypocortisolism (Addison's disease), hyperthyroidism (Grave's 
disease), hypothyroidism, colorectal polyps, gastritis, gastric and duodenal ulcers, ulcerative colitis, 
and Crohn's disease. The invention relates to methods of diagnosing, treating and/or preventing 
disorders described herein, comprising delivering to a patient, or causing to be present therein, a 
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zinc finger polypeptide which inhibits the expression of a gene enabling the cells to divide. The 
target could be, for example an oncogene or a normal gene, which is overexpressed in the cancer 
cells. 



5 ISPG Iron-sulfur cluster protein fClone ID: 1000872335) 

The polynucleotides of SEQ ID NOs:43 and 374 encodes the amino acids sequence of SEQ 
ID N0s:2 12 and 491 respectively, an iron-sulfur protein which mediates electron transfer in 
metabolic reactions, also referred to as ISPG. It will be appreciated that all characteristics and uses 
of the polynucleotides of SEQ ID NOs 43 and 374 and polypeptides of SEQ ID NOs:2I2 and 491, 

10 described throughout the present application also pertain to the human cDNA of clone 1000872335, 
and the polypeptides encoded thereby. 

ISPG is an iron-sulfur protein that belongs to the broad family of the 2Fe-2S-type 
ferredoxins. The 2Fe-2S-type ferredoxins are proteins or domains of around one hundred amino 
acid residues that bind a single 2Fe-2S iron-sulfur cluster and are found in plants, animals and 

1 5 bacteria. Iron-sulfur cluster proteins are well known classes of proteins and are recognized as ideal 
devices for accepting, donating, storing and shifting electrons. It will be appreciated by the skilled 
artisan that stuctural aspect of iron-sulfur proteins have been studied extensively (reviewed in 
Beinert et al. Nature 1 Aug 1997, 277:653-659), allowing modifications of the ISPG proteins to fine 
tune its properties for desired uses while retaining its biological function in mediating electron 

20 transfer and possible protein stabilization and iron or sulfide storage functions. The ISPG protein of 
SEQ ID NO 491 comprisese a glycosaminoglycan attachment site at amino acid positios 34 
(SGSG); protein kinase C phosphorylation sites at amino acid positions 14 (SAR), 44 (TTR), and 
86 (SGR); N-myristoylation sites at amino acid positions 1 1 (GGVSAR), 24 (GTXWNR), 3 1 
(GGTSGS), 39 (GVALGT), 106 (GACEAS), a cytochrome c family heme-binding site at amino 

25 acid positions 1 14-1 19 and an iron-sulfur binding region signature at amino acid positions 108-1 18. 

In view of its role in electron transfer reactions, ISPG is thought to be involved in a wide 
variety of metabolic reactions and disorders, and may be useful in the treatment of disorders of 
metabolism such as obesity, in the detection of toxic compounds, in prediction, diagnosis or 
treatment of conditions or traits related to drug metabolism or in treatments related to the synthesis 

30 of eg. steroid hormones. 

In a preferred example, overexpression or administration of the ISPG protein may be used 
as a therapeutic treatment for obesity by accelerating the metabolic rate of a subject in need of 
treatment. There is accumulating evidence to support the hypothesis that a low-energy-output 
phenotype is at high risk of weight gain and obesity, irrespective of whether this is owing to a low 

35 resting metabolic rate and/or physical inactivity. The low-energy-output phenotype is associated 
with impaired appetite control, which is improved if energy output is increased, serving as the 
background for pharmacologic stimulation of energy expenditure as a tool to improve the results of 
obesity management. The ISPG protein and agonists or stimulators thereof may serve as a means to 
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increase electron transfer and hence the metaboHc rate of an individual in a similar goal as 
commonly cited targets such as leptin receptors, the sympathetic nervous system and its peripheral 
beta-adrenoceptors, selective thyroid hormone derivatives, and stimulation of the mitochondrial 
uncoupling proteins. 

5 In addition, iron-sulfur proteins such as ISPG are generally recognized as being capable of 

several functions that are not of an oxidoreductive nature such as the binding and activation of 
substrates at the unique iron site (in the catalytic function of aconitase and relates enzymes), and 
apparently stabilizing radicals in reactions occurring by a free-radical pathway. There is also 
evidence suggesting that such iron-sulfur clusters can function in coupling electron transfer to 

10 proton transport. By binding Cys ligands from different subunits, iron-sulfur clusters effect dimer 
formation, as in the Fe protein of nitrogenase. Further, by straddling protein structural elements, 
iron-sulfur clusters are able to stabilize structures that are required for specific functions (eg. 
endonuclease III of E. coli). Proteins of ISPG's class have also been shown to protect proteins from 
the attack of intracellular proteases. Finally, proteins of ISPG's class are thought to be capable of 

15 serving as storage devices for iron and possibly sulfide. 

Thus, in a few examples, ISPG may be used advantageously as an iron or metal biosensor, 
for the treatment and/or diagnosis of iron overload disorders, or in applications involving stabilizing 
target proteins such as for protein production or for mediating protein interactions. 

Additionally, structural aspects of ISPG suggest that it may be capable of mediating steroid 

20 hormone synthesis, either in human or animals, or in engineered cell culture systems for the large 
scale production of hormones. ISPG may be used to act as an adrenal ferredoxin (known as 
adrenodoxin (ADX)), a vertebrate mitochondrial protein which transfers electrons fix>m 
adrenodoxin reductase to cytochrome P4S0scc, which is involved in cholesterol side chain cleavage. 
Its primary function as a soluble electron carrier between the NADPH-dependent adrenodoxin 

25 reductase and several cytochromes P450 makes it an irreplaceable component of the steroid 
hormones biosynthesis in the adrenal mitochondria of vertebrates. 

DRUG METABOLISM 

Previous studies have revealed that cytochrome P-450 isozymes are responsible for drug 
metabolism, and oxidation by P-450 isozymes is a common aspect of the overall clearance of drugs. 

30 Further studies have revealed that genetic polymorphism of cytochrome P-450 isozymes underlies a 
wide spectrum of substrates specificity in drug oxidation. In certain cases, genetic mutation and/or 
deletion of one critical isozyme gene results in a significant alteration of a phenotype projected on 
substrate specificity. It has been reported that CYP2D6 oxidizes more than 30 drugs (for example, 
M. Eichelbaum et al., Pharmacol. Ther., Vol. 46, pp. 377-, 1990). Many anti-cancer drugs are 

35 known to be oxygenated by cytochrome P450 enzymes to yield metabolites that are cytotoxic or 
cytostatic toward tumor cells. These include several commonly used cancer chemotherapeutic 
drugs, such as cyclophosphamide (CPA), its isomer ifosfamide (IF A), dacarbazine, procarbazine, 
thio-TEPA, etoposide, 2-aminoanthracene, 4-ipomeanol, and tamoxifen (LeBlanc, G. A. and 



128 



[fije>'/^:/yv'GG208389e.s^] 



WO 02/083898 PCT/IBOl/00914 
Waxman, D. J., Drug Metab. Rev. 20:395-439 (1989); Ng, S. F. and Waxman D. J., Intl. J. 
Oncology 2:731-738 (1993); Goeptar, A. R., etal., Cancer Res. 54:241 1-2418 (1994); van Maanen, 
J. M., et al., Cancer Res. 47:4658-4662 (1987); Dehal, S. S., et a!.. Cancer Res. 57:3402-3406 
(1997); Rainov,N. G., et al.. Human Gene Therapy 9:1261-1273 (1998)). Bioreductive metabolism 
5 that results in drug activation is also catalyzed by cytochrome P450 enzymes for a variety of anti- 
cancer drugs. Examples of such drugs include Adriamycin, mitomycin C, and 
tetramethylbenzoquinone (Goeptar, A. R., et al., Crit. Rev. Toxicol. 25:25-65 (1995); Goeptar, A. 
R., et al., Mol. Pharmacol. 44:1267-1277 (1993)). Those who have homozygous alteration in this 
recessive gene, are so-called "poor metabolizers (PMs)" and may suffer fiom severe side effects due 

10 to poor metabolism of drugs (for example, see M. Eichelbaum et al., Pharmacol. Ther., Vol. 46, pp. 
377-, 1990). Such genetic alterations occur at rates of from 1 to 30% in different ethnic populations 
(for example, L. M. Distierath et al., J. Biol. Chem., Vol. 260, pp. 9057-, 1985). 

The ISPG protein of the invention is thought to be capable of functioning as a soluble 
electron carrier in the electron transport chain involving one or more of the several available 

15 cytochromes P450 enzymes. The ISPG protein may thus be useful in methods of killing neoplastic 
cells involving P450 (and ISPG) gene transfer and the use of bioreductive drugs that are activated 
by cytochrome P450, and in methods for evaluating the susceptibility of a sample compound to 
metabolism with respect to a specific cytochrome P-450 isozyme system. 

Thus, in a first aspect, a drug activation/gene therapy strategy has been developed based on 

20 a cytochrome P450 gene ("GYP" or "P450") in combination with a cancer chemotherapeutic agent 
that is activated through a P450-catalyzed monoxygenase reaction (Chen, L. and Waxman, D. J., 
Cancer Research 55:581-589 (1995); Wei, M. X., et al.. Hum. Gene Ther. 5:969-978 (1994); U.S. 
Pat. No. 5,688,773, issued Nov. 18, 1997). Presently known drug-enzyme combinations can utilize 
established chemotherapeutic drugs widely used in cancer therapy. Such methods to obtain 

25 enhanced chemosensitivity have been demonstrated both in vitro and in studies using a 

subcutaneous rodent solid tumor model and human breast tumor grown in nude mice in vivo, and is 
strikingly effective in spite of the presence of a substantial liver-associated capacity for drug 
activation in these animals (Chen, L., et al., Cancer Res. 55:581-589 (1995); Chen, L., et al., Cancer 
Res. 56:1331-1340 (1996)). The P450-based approach also shows significant utility for gene 

30 therapy applications in the treatment of brain tumors (Wei, M. X., et al.. Human Gene Ther. 5 :969- 
978 ( 1 994); Manome, Y., et al.. Gene Therapy 3:5 13-520 (1996); Chase, M., et al.. Nature 
Biotechnol. 16:444-448(1998)). 

Although the P450/drug activation system has shown great promise against several tumor 
types, further enhancement of the activity of this system is needed to achieve clinically effective, 

35 durable responses in cancer patients. This requirement is necessitated by two characteristics that are 
inherent to the P450 enzyme system: (1 ) P450 enzymes metabolize drugs and other foreign 
chemicals, including cancer chemotherapeutic drugs, at low rates, with a typical P450 turnover 
number (moles of metabolite formed/mole P450 enzyme) of only 1 0-30 per minute; and (2) P450 
enzymes metabolize many chemotherapeutic drugs with high Km values, typically in the millimolar 
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range. This compares to plasma drug concentrations that are only in the micromolar range for many 
chemotherapeutic drugs, including drugs such as CPA and IFA. Thus, current approaches to P450 
gene therapy may result in intratumoral drug activation at a low absolute rate and under conditions 
that are not saturating with respect to drug substrate. Furthermore, since P450 is expressed at a very 
5 high level in liver tissue, only a very small fraction of the administered chemotherapeutic drug is 
metabolized via the tumor cell P450 gene product using the currently available methods for P450 
gene therapy (Chen, L. and Waxman, D. J., Cancer Res. 55:581-589 (1995)). As described in U.S. 
Patent No. 6,207,648, one enhancement involves introducing a P450 reductase (RED) gene in 
combination with a cytochrome P450 gene (and thus a P4S0 gene product) into neoplastic cells, the 

10 enzymatic conversion of a P450-activated chemotherapeutic drug to its therapeutically active 
metabolites is greatly enhanced within the cellular and anatomic locale of the tumor, thereby 
increasing both the selectivity and efficiency with which neoplastic cells are killed. 

In a preferred embodiment, further enhancements to known prodrug-enzyme strategies may 
be achieved by introducing an ISPG gene into neoplastic cells, either alone or in combination with a 

1 5 P450 gene and/or a P450 reductase gene. Suitable vectors for the introduction and expression of 
said ISTG and P450 genes are known to one of skill in the art. The introduction of the ISPG gene 
and subsequent expression of the ISPG gene product may increase the enzymatic conversion of a 
P450-activated chemotherapeutic drug to its therapeutically active metabolites. Thus, the invention 
comprises a method for killing neoplastic cells comprising: (a) infecting the neoplastic cells with a 

20 vector for gene delivery, the vector comprising an ISPG gene capable of mediating enzymatic 
conversion of a chemotherapeutic agent by a P450 enzyme; (b) optionally infecting the neoplastic 
cells with a vector for gene delivery, the vector comprising a cytochrome P450 gene and/or a gene 
encoding RED; (b) treating the neoplastic cells with a chemotherapeutic agent that is activated by 
the product of the cytochrome P450 gene; and (c) killing the neoplastic cells. 

25 The present invention also provides a reagent composition for use in evaluating drug 

metabolism by a specific cytochrome P-450 isozyme, which comprises a liver microsome lacking 
said specific P-450, said specific P-450 isozyme and a carrier material. The liver microsome may be 
of human source lacking CYP2D6, CYP2C 19, or CYP2A6. The CYP2D6 isozyme, CYP2C 19 
isozyme, and CYP2A6 isozyme to be added may be a recombinant CYP2D6-expressing 

30 microsome, a recombinant CYP2C 1 9-expressing microsome, or a recombinant CYP2A6- 
expressing microsome. The reagent composition may comprise more than one kind of PM 
microsomes. 

According to the present invention, there can be provided a reagent composition and a 
method for accurately quantitating the contribution of certain P-450 isozymes such as CYP2D6, 
35 CYP2C 19, and CYP2A6 in drug metabolism. The present invention provides a method for 
evaluating the susceptibility of a sample compound to metabolism with respect to a specific 
cytochrome P-450 isozyme, which comprises contacting the sample compound with a reagent 
composition prepared by adding said specific cytochrome P-450 isozyme and an ISPG protein to 
liver microsomes lacking said specific cytochrome P-450 isozyme in a carrier material. ISPG 
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would be useful in order to enhance eiificiency of the P-450 isozyme in drug metabolism, thereby 
effectively amplifying the power of the assay to detect the contribution of a particular P-450 
enzyme. In another embodiment the contribution to drug metabolism of a particular P-450 system 
can be assessed by focussing on a particular iron-sulfur protein associated with said specific P-450 
5 system. In this aspect, the method comprises contacting the sample compound with a reagent 
composition prepared by adding said specific ISPG protein to liver microsomes lacking said ISPG 
protein in a carrier material. The method may further comprise (a) incubating a mixture of the 
sample compound and the reagent composition; (b) extraction of the reaction mixture obtained in 
Step (a); and (c) analyzing the reaction products isolated in Step (b). For the purposes of 
10 quantitating the assay, a plurality of the reagent compositions having different amount of the 
specific P-450 isozyme or ISPG protein may be subjected to Step (a) to (c), respectively. For 
example, the specific P-450 isozyme to be used in the method may be selected from CYP2D6, 
CYP2C19, CYP2A6, CYPlAl and CYP2E1. 

IRONBIOSENSOR 

15 Iron-sulfur clusters have been found to serve as sensors of iron, dioxygen, superoxide ion 

and possibly nitric oxide. Two main mechanisms of sensing have been described. In one example, 
the oxidation [Fe^S^]'"*" ~> [Fe2s2]2+ by dioxygen provides the signal for activation of a defense 
mechanism against superoxide, as observed with the SoxR protein of E. coli, and thus may serve a 
cytoprotective function (eg. useful for treatment of ischemia, etc.). In an alternative mechanism, 

20 oxidative disassembly or reassembly of a cluster provides the controlling signal as in the FNR 
protein of E.coli. 

In a further detailed example, the ISPG protein of the invention maybe be used as an iron 
biosensor. An example of an iron biosensor is provided in U.S. Patent No. 5,516,697 (Kruzel et 
al.). In summary, ISPG can be immobilized in the vicinity of a device to measure the change in pH, 

25 causing a detectable variation of the potential upon the binding of iron. 

In the process of sequestering iron, the sensing element, the ISPG protein is expected to 
release a number of protons of hydrogen (H"^) directly proportional to the atoms of iron bound. The 
release of protons during the binding of iron by ISPG becomes the operative feature which is 
measured by the biosensors. The release of protons causes a change in pH and is measured by an 

30 ion-selective field effect transistor or by pH sensitive paper. A sample containing iron is placed into 
a buffered solution, usually water. The sample may be diluted one or more times. In one 
embodiment of the sensors of the present invention, the release of protons is measured as the 
variation of the potential on the surface of an ion-selective field eiTect transistor (an ISFET) 
(Reviewed in: Biosensor Technology, edited by Buck et al. and published by Marcel Decker, Inc., 

35 1 990, entitled "Solid State Potentiometric Sensors" by Jiri Janata, pp 1 7-34). In another 
embodiment of the present invention, the protons released upon binding of iron by ISPG are 
detected by the change in pH using pH sensitive paper. Preferably, the iron selective element 
(ISPG) is incorporated in close proximity or integrated with the signal transducer, to give a 
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reagentless sensing system for iron. Since the signal can be amplified, only small quantities of ISPG 
are needed for detection of iron. The ISFET is modified by immobilizing ISPG on the surface of the 
ISFET or by a disposable membrane with immobilized ISPG that is in close proximity to the ISFET 
by attaching the ISPG -modified membrane to the surface of the ISFET. A sample containing iron, 
5 for example a biological sample such as body fluid from a mammal, particularly a human, is then 
contacted with the ISPG -modified ISFET. 

In order to produce an ISPG-modified ISFET as an independent sensor, an existing system 
which uses an ISFET designed to measure pH can be modified. Systems which presently use an 
ISFET to measure pH are the Sentron 2001 pH system, manufactured by Integrated Sensor 

1 0 Technology, Federal Way, Wash.; the Coming 360i pH system, manufactured by Coming 
Incorporated, Coming N.Y.; or Orion 610 pH system, manufactured by Orion Analytical 
Technology, Inc., Boston, Mass. The modification required to measure the amount of iron in a 
sample is either to place an immobilized layer of ISPG on the ISFET of such a system or, 
alternatively, to provide a ISPG -modified membrane, i.e. a membrane coated with ISPG, which 

15 will be in close proximity to the existing ISFET so as to detect the release of protons when the ISPG 
binds iron in a sample and records the change in potential. 

Iron sensitive biosensors (as well as treatments for iron overload disorders) are extremely 
valuable It is estimated that 30,000,000 Americans suffer from different types of iron related 
disorders, including a substantial proportion with profound iron deficiency syndrome. Detection of 

20 bioaccessible iron is one of the most important measurements that doctors can use for early 

detection if iron deficiency, iron overload or other types of immunological disorders. To date iron 
is measured through a combination of blood tests that detect iron and iron binding capacity of 
transferrin, the protein that transports iron through the body. The current technology involves very 
sophisticated instrumentation which make this analysis prohibitively expensive and often requires 

25 qualified personnel to analyze the sample. Therefore, there is a need for the direct assay of iron that 
combines simplicity and economics. ISPG may therefore be advantageously used in development of 
a biosensor for detecting the amount of iron in a sample. 

STEROID BIOSYNTHESIS 

As noted above, ISPG may be used as an adrenal ferredoxin (known as adrenodoxin 
30 (ADX)), a vertebrate mitochondrial protein which transfers electrons from adrenodoxin reductase to 
cytochrome P450scc, which is involved in cholesterol side chain cleavage and is an irreplaceable 
component of the steroid hormones biosynthesis in the adrenal mitochondria. 

In therapeutic embodiments, ISPG may have particular importance in treatment of disorders 
where it is desired to increase the level of steroid hormone synthesis. As P450scc has a critical role 
35 in synthesis of the conversion of cholesterol into pregnenolone, ISPG may be used as a limiter or 
enhancer of steroid synthesis. In but one example, evidence has been shown that cytochrome 
P450scc activity in the human placenta is limited by the supply of electrons to the P450scc. 
Furthermore, Tuckey et al. Eur. J. Biochem. 1999 Jul;263(2): 319-325 have shown that p450scc 
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activity can be increased considerably by adding adrenodoxin reductase and adrenodoxin. Thus, 
ISPG may be useful in the treatment of reproductive disorders by augmenting the electron supply to 
P450scc, and thus increasing the level of progesterone synthesis. Accordingly, in another example, 
ISPG may be used to limit steroid synthesis, whether for therapeutic or for research uses. 
5 ISPG may also be used in biological steroid synthesis processes for the production of 

steroid hormones. For example, Duport et al, Nat Biotechnol 1998 Feb; 1 6(2): 186-9 report a system 
for self-sufficient biosynthesis of pregnenolone and progesterone in engineered yeast vy^herein the 
first two steps of the steroidogenic pathway were reproduced in Saccharomyces cerevisiae. 
Engineering of sterol biosynthesis by disruption of the delta 22-desaturase gene and introduction of 
10 the Arabidopsis thaliana delta 7-reductase activity and coexpression of bovine side chain cleavage 
cytochrome P450, adrenodoxin, and adrenodoxin reductase, lead to pregnenolone biosynthesis from 
simple carbon source. As ISPG is thought to be capable of functioning as an adrenodoxin protein, 
ISPG may be used as a function substitute in the system of Duport et al for adrenodoxin. 

15 MTG fMETALLOTHIONEIN^ (Clone ID:654627^ 

SEQ IDNOS 96 and 413 and clone FLl I:654627_182-5-3-0-FI0-F encode the polypeptide 
of SEQ ID NOs:265 and 527 respectively, a metallothionein protein which binds heavy metal. Said 
polypeptide of the invention is also referred herein as MTG. It will be appreciated that all 
characteristics and uses of the polynucleotides of SEQ ID NOs:96 and 413 and polypeptides of SEQ 

20 ID NO:265 and 527, described throughout the present application also pertain to the human cDNA 
of clone 654627, and the polypeptides encoded thereby. 

Metallothioneins (MT) [1,2,3] are small proteins which bind heavy metals such as zinc, 
copper, cadmium, nickel, etc., through clusters of thiolate bonds. MT's occur throughout the 
animal kingdom and are also found in higher plants, fiingi and some prokaryotes and are thought to 

25 play a role in metal detoxification or in the metabolism and homeostasis of metals. On the basis of 
structural relationships MT's have been subdivided into three classes. Class I includes mammalian 
MT's as well as MT's from crustacean and molluscs, but with clearly related primary structure. 
Class II groups together MT's from various species such as sea urchins, fungi, insects and 
cyanobacteria which display none or only very distant correspondence to class I MT's. 

30 Class III MT's are atypical polypeptides containing gamma-glutamylcysteinyl units. 

Vertebrate class I MTs such as the MTG protein of the invention are proteins of typically 
60 to 68 amino acid residues, 20 of these residues are cysteines that bind to 7 bivalent metal ions. 
As a signature pattern we chose a region that spans 19 residues and which contains seven of the 
metal-binding cysteines, this region is located in the N-terminal section of class-I MT's. A 

35 consensus pattern for class I MTs is as follows: C-x-C-[GSTAP]-x(2)-C-x-C-x(2)-C-x-C-x(2)-C-x- 
K. 

The MTG protein of SEQ ID NO 527 has a metallothionein domain (Prosite ref PS00203) 
at amino acid positions 13-31; an N-glycosylation site at position 4 (NCSC), a protein kinase C 
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phosphorylation site at amino acid positions 18 (SCK), 28 (SCK) and 55 (SQR); a casein kinase II 
phosphorylation site at amino acid position 41 (TLVD); an N-myristoylation site at amino acid 
positions 10 (GVSCTC); and a prokaryotic membrane lipoprotein lipid attachment site at amino 
acid position 3 (PNCSCAAGVSC). 

5 THERAPEUTICS 

Discovery of new proteins related to metallothioneins, and the polynucleotides that encode 
them, satisfies a need in the art by providing new diagnostic or therapeutic compositions useful in 
diagnosing and treating heavy metal toxicity, cancer, inflammatory disease and immune disorders. 

Acute or chronic exposure to heavy metals such as lead, arsenic, mercury or cadmium leads 

10 to a variety of diseases and disorders involving neuromuscular, CNS, cardiovascular, and 

gastrointestinal effects. MTs may play a role in the prevention or alleviation of these conditions. In 
addition, MTs are transcriptionally regulated by glucocorticoids, which suggests that MTs have a 
direct role in the effects of glucocorticoids to treat inflammatory disease, immune disorders, and 
cancer. It is therefore thought that MTG may have important applications in the treatment of 

15 inflammatory disease, immune disorders, and cancer as well as in cytoprotection in a variety of 
therapeutic applications. 

In one preferred example, the MTG nucleic acids and protein may be used for suppressing 
the production of sunburn cells which is applicable in various manners with minimal adverse side 
effects, a method of inducing metallothionein, a method of treating skin diseases and a method of 

20 screening ultraviolet rays, and further relates to cosmetic compositions and UV screening 
compositions. 

Conventionally, steroids and zinc oxide formulations have been topically used as medicines 
for treating skin diseases such as dermatitis, sunburn, neurodermatitis, eczema and anogenital 
pruritus. Steroids, however, have been difficult to administer in large quantities for a prolonged 

25 period due to their strong adverse side effects. Zinc oxide formulations, which have local astringent 
action, involve problems with respect to the manufacture of pharmaceuticals, since they are 
insoluble in water and are not usually administered internally. 

Zinc, one of the indispensable trace metals in the living body, is known to participate in the 
development of sexual organs, promotion of wound healing and is also known to be a component of 

30 a metalloenzyme, an accelerator for dehydrogenase, and to have various functions such as activating 
the immune system. Zinc is further known to be an inducing factor of metallothionein (MT), a 
metal-combining protein. It is reported that MT functions as a scavenger of free radicals which are 
generated at the onset of inflammations ["Dermatologica", Hanada, k., et al., 179 (suppl. I) 143 
(1989)]. 

35 As proposed in U.S. Patent No. 5,582,8 1 7 (Otsu et al), MTG may be useful in treatment of 

dermatological inflammations caused by external irritative stimulants, such as sunburn or the like, 
where MTG could act to quench the free radicals released fix)m leukocytes, especially granulocytes 
which gather at the inflamed region, and thereby exhibit an anti-oxidation action to diminish cell 
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damage, especially to normal lymphocytes, to activate the immune system and further to prevent the 
accelerated aging of the skin. Formation of sunburn cells (SBCs) could be suppressed by 
administering zinc for inducing MTG to be present, or to increase MTG in the epidermal keratinous 
layer. Anti-oxidation action of MTG can also be useful in the treatment of skin problems resulting 
5 from radiation therapy by X rays, alpha rays, beta rays, gamma rays, neutron rays and accelerated 
electron rays. 

Various zinc compounds have been studied by Otsu et al (supra) with respect to their 
pharmacological activities, who reported that zinc salts or zinc complexes of a certain compound 
have an excellent action of inducing metallothionen (MT) and suppressing sunburn cell (SBC) 
1 0 production due to UV rays, and thereby useful as components of cosmetic compositions or 

medicines for purposes of ameliorating sunburn, preventing sunburn, ameliorating sufferings from 
skin diseases and ameliorating other radiation induced disorders, leading to completion of the 
invention. 

There are two different types of dermatological reactions caused by sunlight, one is an acute 
15 inflammatory change in the skin called sunburn, and the other is a subsequent melanin pigmentation 
called suntan. The light having a wave length in the range of 320 nm or less, called UVB, induces 
sunburn and is responsible for erythematous change. The erythemic reaction caused by UV rays, as 
opposed to a bum injury, does not occur immediately after the exposure to the sunlight, but rather 
occurs after a latent period of several hours. When sunburned skin is histopathologically examined, 
20 various degrees of inflammatory changes are recognized in the epidermis and dermis depending on 
the dose of radiation. Among such changes, a notable one is the generation of so-called sunburn 
cells (SBC) in the epidermis. A histologically stained tissue sample presents strongly and 
acidophilically stained cells which have pyknotic nuclei. This phenomenon indicates the necrosis of 
epidermal cells ("Fragrance Journal", 9, 15-20 (1991). In order to prevent sunburn, para- 
25 aminobenzoic acid derivatives, cinnamic acid derivatives or the like UV absorbers mentioned above 
are used, but their UV absorbing effects are not necessarily satisfactory. What is more, they raise 
problems of cumbersome handling upon use, poor stability, low compatibility with other 
components of the composition, and also involve unsolved problems in water-resistance and oil- 
resistance. 

30 In the field of medicines for the treatment of skin diseases, development of medicines 

which have minimal adverse side effects, and which have novel functions obtainable by both 
external and internal administrations has been desired. Also, in the field of the therapy and 
prevention of radiation disorders, medicines which can suppress and cure the disorders caused by 
oxidative reactions have been desired. Lastly, in the field of the manufacture of cosmetics, 

35 cosmetics which overcome the above-mentioned problems such as handling upon use and stability 
of the composition have been desired. Accordingly, the present invention encompasses providing 
therapeutic agents for treating skin diseases having the above-mentioned characteristics, wherein 
said agents are capable of inducing MTG for suppressing the formation of sunburn cells, and for use 
in cosmetic compositions. Also encompassed are methods of screening for therapeutic agents for 



135 



[fije>'/^:/yv'GG208389e.s^] 



WO 02/083898 PCT/IBOl/00914 
treating skin diseases comprising bringing a test compound into contact with a cell, tissue or animal 
model of disease, and detecting induction of MTG expression or function. 

GENE EXPRESSION SYSTEMS 

The MTG nucleic acid and proteins of the invention may also be advantageously used in the 
5 production of recombinant proteins as biopharmaceutical products at commercial scale. 

Previously, genes have been extensively expressed in mammalian cell lines, particularly in 
mutant Chinese Hamster Ovary (CHO) cells deficient in the dihydrofolate reductase gene (dhfr) as 
devised by the method of Urlaub et al, PNAS U.S.A. 77, 4216-4220, 1980. A variety of expression 
systems have been used. Many vectors for the expression of genes in such cells are therefore 

10 available. Typically, the selection procedures used to isolate cells transformed with the expression 
vectors rely on using methotrexate to select for transformants in which both the dhfr and the target 
genes arecoamplified. The dhfr gene, which enables ceils to withstand methotrexate, is usually 
incorporated in the vector with the gene whose expression is desired. Selection of cells under 
increasing concentrations of methotrexate is then performed. This leads to amplification of the 

15 number of dhfr genes present in each cell of the population, as cells with higher copy numbers 

withstand greater concentrations of methotrexate. As the dhfr gene is amplified, the copy number of 
the gene of interest increases concomitantly with the copy number of the dhfr gene, so that 
increased expression of the gene of interest is achieved. Unfortunately, these amplified genes have 
been reported to be variably unstable in the absence of continued selection (Schimke, J. Biol. Chem. 

20 263, 5989-5992, 1988). This instability is inherent to the presently available expression systems of 
CHO dhfrsup.- cells. For many years, several promoters have been used to drive the expression of 
the target genes such as the SV40 early promoter, the CMV early promoter and the SR.alpha. 
promoter. The CMV and SR.alpha. promoters are claimed to be the strongest (Wenger et al. Anal. 
Biochem. 221, 416-418, 1994). 

25 In one report, the .beta.-interferon promoter has also been used to drive the expression of 

the .beta.-interferon gene in the mutant CHO dhfr.sup - cells (U.S. Pat. No. 5,376,567). In this 
system, however, the selected CHO dhfr" cells had to be superinduced by the method of Tan et al 
(Tan et al, PNAS U.S.A. 67, 464-471, 1970; Tan et al, U.S. Pat. No. 3,773,924) to effect a higher 
level of .beta.-interferon production. In this system a significant percentage of the superinduced 

30 .beta.-interferon produced by the CHO dhfr- cells was not glycosylated. The mouse metallothionein 
gene (mMTl) promoter has also been used for the expression of beta-interferon genes in CHO cells, 
BHK and LTK.sup.- mouse cells (Reiser et al 1987 Drug Res. 37, 4, 482-485). However, the 
expression of .beta.-interferon with this promoter was not as good as the SV40 early promoter in 
CHO cells. Further, .beta.-interferon expression from these cells mediated by the mMTl promoter 

35 was inducible by heavy metals. Heavy metals are however extremely toxic to the cells and this 
system was therefore abandoned. Instead, Reiser et al used the CHO dhfr- expression system in 
conjunction with the SV40 early promoter (Reiser et al. Drug Res. 37,4, 482-485 (1987) and EP-A- 
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0529300) to produce .beta.-interferon in CHO dhfr- cells as derived by the method of Urlaub et al 
(1980). 

As described in U.S. Patent No. 6,207,146 (Tan et al) beta.-interferon was expressed in 
wild-type CHO cells using a metallothionein based system. MTG may thus be used in similar 
5 applications so as to provide a system for expression of recombinant proteins. Tan et al 

demonstrates wild-type CHO cells transfected with a vector comprising a .beta.-interferon gene 
under the control of a mouse sarcoma viral enhancer and mouse metallothionein promoter (MSV- 
mMTl ), a neo gene under the control of promoter capable of driving expression of the neo gene in 
both E. coli and mammalian cells and a human metallothionein gene having its own promoter. 

1 0 Transfected ceils capable of expressing .beta.-interferon were selected by first exposing cells to 
geneticin (antiobiotic G418) and thus eliminating cells lacking the neo gene and then exposing the 
surviving ceils to increasing concentrations of a heavy metal ion. 

The heavy metai ion enhanced the MSV-mMTl promoter for the beta.-interferon gene, 
thus increasing .beta.-interferon expression. The heavy metal ion also induced the human 

15 metallothionein gene promoter, causing expression of human metallothionein. The human 

metallothionein protected the cells against the toxic effect of the heavy metal ion. The presence of 
the heavy metal ion ensured that there was continual selection of cells which had the transfecting 
vector, or at least the .beta.-interferon gene and the human metallothionein gene and their respective 
promoters, integrated into their genome. 

20 The selected cells that had been successfully transfected expressed .beta.-interferon. 

Expression was surprisingly improved when the cells were cultured in the presence of Zn2+. The 
.beta.-interferon had improved properties, in particular a higher bioavailability, than prior .beta.- 
interferons. 

These findings have general applicability and suggest that the MTG gene of the present 
25 invention may be used accordingly in expression systems. Accordingly, the present invention 
provides a nucleic acid vector comprising: 

(i) a coding sequence which encodes a protein of interest and which is operably linked to a 
promoter capable of directing expression of the coding sequence in a mammalian cell in the 
presence of a heavy metal ion; (ii) a first selectable marker sequence which comprises an MTG 
30 gene of the invention and which is operably linked to a promoter capable of directing expression of 
the MTG gene in a mammalian cell in the presence of a heavy metal ion; and optionally (iii) a 
second selectable marker sequence which comprises a neo gene and which is operably linked to a 
promoter capable of directing expression of the neo gene in a mammalian cell; 

35 CDPG (GLYCOSYL PHOSPHATIDYLINOSITOL-LINKED GLYCOPROTEIN) (Clone 
ID: 10009029 17) 

SEQ ID NOS 3 and 341 and clone FLI 1 : 100090291 7_223-52-4-0-G3-F encode the 
polypeptide of SEQ ID NOS 172 and 458 respectively, a glycosyl phosphatidylinositol-linked 
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glycoprotein protein which is thought to be a signal transducing polypeptide expressed in lymphoid, 
myeloid, and erythroid cells. Said polypeptide comprises a CD24 signal transducing domain as 
well as a GPI-anchor of the invention is also referred herein as CDPG. CDPG is believed to be 
highly glycosylated, and it is expected that CDPG molecular weight will vary among cell types and 
5 cell developmental stage due to differences in glycosylation patterns, providing further specificity in 
its use as a therapeutic target. It will be appreciated that all characteristics and uses of the 
polynucleotides of SEQ ID NOs: Sand 341 and polypeptides of SEQ ID NO: 172 and 458, 
described throughout the present application also pertain to the human cDNA of clone 1000902917, 
and the polypeptides encoded thereby. 

10 It is suggested that CDPG may have a specific role to play in early thymocyte development. 

The CDPG protein is thought to be extensively o-glycosylated may be capable of modulating b-cell 
activation responses. As a signaling transducer, the CDPG polypeptide's signal transducing 
function may in some embodiments be triggered by the binding of a lectin-like ligand to the CD24 
domain carbohydrates, and the release of second messengers allowing signaling. The CDPG 

1 5 polypeptide is thought to have important functions in regulating the differentiation and/or growth of 
lymphoid, myeloid, and erythroid cells, including specifically promoting antigen dependent 
proliferation of b-cells, and preventing their terminal differentiation into antibody-forming cells. 

Additionally, based on a growing body of evidence characterizing the CD24 domain 
function, it is proposed that CDPG may have a role as a potent stimulator of neurite outgrowth, and 

20 thus may be useful in the treatment of central nervous system disorders. 

Fragments of CDPG may also be useful, eg. GPI-anchor domain for example in the 
development of soluble T-cell receptors (U.S. Patent No. 6,080,840) or any suitable application 
where a temporally controlled solubilization of a protein of interest is desired. For example, a 
fragment comprising the CDPG GPI-anchor domain can be used in the production of soluble 

25 molecules by replacing the transmembrane domains of the cDN A of a protein of interest with a 
sequence comprising the CDPG glycosylphosphatidyl inositol (GPI) linkage. These chimeric 
cDNAs are then transferred into an expression vector containing a strong promoter and a mutant 
(e.g. DHFR) gene allowing high levels of transcriptional expression and amplification of the gene. 
These chimeric genes are then cotransfected into a selected cell type, preferably lacking the 

30 endogenous protein of interest, and transfectants are selected and can also be screened with 

antibodies for the protein of interest. These GPI linked proteins of interest can then be solubilized 
by cleavage with the enzyme phosphatidyl inositol specific phospholipase C (PI-PLC) and 
purified/concentrated from the supernatant (e.g. by passage over a protein of interest-reactive 
antibody affinity column). 

35 THERAPEUTICS 

CDPG or inhibitors of CDPG may be used in the treatment of any disorder where it is 
desired to regulate B-cell proliferation or differentiation. CDPG or inhibitors thereof may be useful 
in the treatment of B-cell neoplasms, a heterogeneous group of diseases characterized by different 
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maturation states of the B-cell, which are related to the aggressiveness of the disorder. Chronic 
lymphocytic leukemia (CLL) is characterized by proliferation and accumulation of B-lymphocytic 
leukemia (BLL) is characterized by proliferation and accumulation of B-lymphocytes that appear 
morphologically mature but are biologically immature. This disorder accounts for 30% of leukemias 
5 in Western countries. The disorder is characterized by proliferation of biologically immature 
lymphocytes, unable to produce immunoglobulins, which cause lymph node enlargement. As a 
regulator of B-cell proliferation and differentiation, CDPG and/or inhibitors of CDPG may be 
useful for inhibiting proliferation of leukemic B-cells in CLL patients. 

CDPG may also be useful in the modulation of cell growth in the CNS. CD24 is known to 
1 0 be highly expressed in neurons and has been demonstrated as capable of inhibiting neurite 

outgrowth of dorsal root ganglion neurons while promoting neurite outgrowth of cerebellar neurons 
via interaction with an LI protein. 

SELECTABLE CELL MARKERS 

In one aspect, the CDPG polypeptide may be used as a selectable cell mailcer and to a 

1 5 method of using the selectable marker to identify a cell. Viruses such as recombinant retroviruses 
have been used as a vehicle for gene transfer based on their potential for highly efficient infection 
and non-toxic integration of their genome into a wide range of cell types. The transfer of exogenous 
genes into mammalian cells may be used, for example in gene therapy to correct an inherited or 
acquired disorder through the synthesis of missing or defective gene products in vivo. The 

20 expression of exogenous genes in cells may be useful in somatic gene therapy, to correct hereditable 
disorders at the level of the gene. Hemopoietic stem cells are particularly suited to somatic gene 
therapy as regenerative bone marrow cells may be readily isolated, modified by gene transfer and 
transplanted into an immunocompromised host to reconstitute the host's hemopoietic system. 

Gene therapy involving hone marrow transplant with recombinant primary hemopoietic 

25 stem cells requires efficient gene transfer into the stem cells. As a very small number of primary 
stem cells can reconstitute the entire host hemopoietic system it is important that the transferred 
gene be efficiently expressed in the recombinant stem cells transferred. The transfer of foreign 
genes into a reconstituted host hemopoietic system has been limited by the availability of a 
selectable marker which permits the rapid and non-toxic selection of cells which are efficiently 

30 expressing the transferred gene. Currently available selection markers may not be suitable for 
primary hemopoietic stem cells since they may alter the proliferative ability or biological 
characteristics of the cells. The transfer of foreign genes into a reconstituted host hemopoietic 
system has also been limited by the availability of a viral vector capable of expression in 
hemopoietic stem cells, especially where more than one transcriptional unit is present in the vector 

35 (Botrell, D. R. L. et al., 1987, Mol. Biol. Med. 4:229). 

U.S. Patent No. 5,804,177 (Humphries et al) has demonstrated that the cell surface protein 
CD24 (also Ml/69-Jlld heat stable antigen) can be used as a dominant marker in a recombinant 
viral vector. A nucleotide sequence encoding the cell surface protein CD24 in a recombinant viral 
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vector was used to infect hematopoietic stem cells and cells infected with the recombinant viral 
vector were rapidly and non-toxically selected for in vitro using fluorescence activated cell sorting 
(FACS), demonstrating a good correlation between proviral copy number and expression of 
selectable marker. 

5 CD24 is a signal transducing molecule found on the surface of most human B cells that can 

modulate their responses to activation signals, and is structurally closely related to CDPG. The 
CD24 CDNA (approximately 300 bps) has been cloned (Kay, R. et al, 1991, J. Immunol. 147:1412) 
and encodes a mature peptide of only 31 to 35 amino acids that is extensively glycosylated and 
attached to the outer surface of the plasma membrane by a glycosyl phosphatidylinositol lipid 

10 anchor. M1/69-J1 Id heat stable antigen is a genetically similar homologous murine peptide widely 
expressed on a variety of hemopoietic cell types (Kay, R. et al., 1990, J. Immunol. 145:1952). 

It is thus proposed that a recombinant viral vector can be used to successfully transfer and 
express the CDPG gene in primitive hemopoietic stem cells such that they are able to repopulate 
lethaily irradiated recipients. Preferably foreign CDPG antigen expression in repopulated animals 

15 persists post transplantation such that the biological function of the repopulated hemopoietic cells is 
not affected by the expression of the CDPG antigen. CDPG may subsequently be found to be 
expressed in any or all of hemopoietic lineages including granulocytes, macrophages, pro- 
erythrocytes, erythrocytes and T and B lymphocytes. Therefore, the cell surface protein CDPG may 
be particularly usefiil as a marker for hematopoietic stem cells capable of repopulation in vivo and 

20 as a selectable marker in gene therapy. The recombinant viral vectors also have the advantage that 
the nucleotide sequence encoding the marker is very small, leaving a large amount of space for the 
insertion of additional genes of interest such as those coding for exogenous genes. 

In a preferred embodiment of the invention a recombinant viral vector is used to introduce 
the nucleotide sequence into the cell. Preferably, the CDPG nucleotide sequence is operatively 

25 linked to one or more regulatory elements. The recombinant viral vector of the invention may be 
used as a marker for an exogenous gene to be expressed in a host cell. The invention further 
provides a method of identifying a cell and progeny thereof comprising: providing a cell; infecting 
the cell with a recombinant viral vector of the invention under suitable conditions to allow 
expression of the cell surface protein CDPG on the cell; and, identifying the cell and progeny 

30 thereof by detecting expression of the cell surface protein CDPG on the cell or progeny thereof. 
Cells infected with a recombinant viral vector of the invention and expressing the cell surface 
protein may be transplanted into a host, and the cell and progeny thereof may be identified after 
transplantation by removing biological samples from the host, and assaying for cells expressing the 
cell surface protein. A recombinant viral vector of the invention may be directly introduced into a 

35 host. 

ENRICHING STEM CELL COMPOSITIONS 

As it is proposed that CDPG is involved in early thymocyte development and is found on 
the cell surface due to its GPI-anchor, CDPG nucleic acids and polypeptides of the present 
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invention may also be used to obtain novel antibody compositions useful for preparing ceil 
preparations containing human hematopoietic cells. 

There is a continued interest in developing stem cell purification techniques. Pure 
populations of stem cells will facilitate studies of hematopoiesis. Transplantation of hematopoietic 
5 cells from peripheral blood and/or bone marrow is also increasingly used in combination with high- 
dose chemo- and/or radiotherapy for the treatment of a variety of disorders including malignant, 
nonmalignant and genetic disorders. Very few cells in such transplants are capable of long-term 
hematopoietic reconstitution, and thus there is a strong stimulus to develop techniques for 
purification of hematopoietic stem cells. Furthermore, serious complications and indeed the success 
10 of a transplant procedure is to a large degree dependent on the effectiveness of the procedures that 
are used for the removal of cells in the transplant that pose a risk to the transplant recipient. Such 
ceils include T lymphocytes that are responsible for graft versus host disease (GVHD) in allogenic 
grafts, and tumour cells in autologous transplants that may cause recurrence of the malignant 
growth. 

15 Hematopoietic cells have been separated on the basis of physical characteristics such as 

density and on the basis of susceptibility to certain pharmacological agents which kill cycling cells. 
The advent of monoclonal antibodies against cell surface antigens has greatly expanded the 
potential to distinguish and separate distinct cell types. There are two basic approaches to separating 
cell populations from bone marrow and peripheral blood using monoclonal antibodies. They differ 

20 in whether it is the desired or undesired cells which are distinguished/labeled with the antibody(s). 
In positive selection techniques the desired cells are labeled with antibodies and removed from the 
remaining unlabeled/unwanted cells. In negative selection, the unwanted cells are labeled and 
removed. Antibody/complement treatment and the use of immunotoxins are negative selection 
techniques, but FACS sorting and most batch wise immunoadsorption techniques can be adapted to 

25 both positive and negative selection. In immunoadsorption techniques cells are selected with 
monoclonal antibodies and preferentially bound to a surface which can be removed from the 
remainder of the cells e,g. column of beads, flasks, magnetic particles. Immunoadsorption 
techniques have won favor clinically and in research because they maintain the high specificity of 
targeting cells whh monoclonal antibodies, but unlike FACSorting, they can be scaled up to deal 

30 directly with the large numbers of cells in a clinical harvest and they avoid the dangers of using 
cytotoxic reagents such as immunotoxins, and complement. 

Current positive selection techniques for the purification of hematopoietic stem cells target 
and isolate cells which express CD34. However, positive selection procedures suffer from many 
disadvantages including the presence of materials such as antibodies and/or magnetic beads on the 

35 CD34~'' cells, and damage to the cells resulting from the removal of these materials. 

Negative selection has been used to remove minor populations of cells from clinical grafts. 
These cells are either T-cells or tumour cells that pose a risk to the transplant recipient. The 
efficiency of these purges varies with the technique and depends on the type and number of 
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antibodies used. Typically, the end product is very similar to the start suspension, missing only the 
tumor cells or T-cells. 

As described in U.S. Patent No. 5,877,299, Thomas et al developed a negative selection 
technique that uses an antibody composition containing antibodies specific for glycophorin A, CD3, 
5 CD24, CDl 6, CD14 and optionally CD45RA, CD36, CD2, CD19, CD56, CD66a, and CD66b, 
which reportedly gave a cell preparation highly enriched for human hematopoietic and progenitor 
cells. Maximum enrichment of early progenitor and stem cells (CD34''', CD38" cells) was observed 
when anti-CD45R and anti-CD36 were included in the antibody composition. However, as CDPG 
is proposed as acting in early thymocyte development, CDPG may be used advantageuously to 
10 develop more effective antibody compositions for selecting hematopoietic stem cells. Accordingly, 
the invention encompasses antibodies specific for CDPG polypeptides of the invention and antibody 
compositions comprising, consisting of or consisting essentially of antibodies specific for CDPG, 
glycophorin A, CD3, CD24, CD16, CD14 and optionally CD45RA, CD36, CD2, CDI9, CD56, 
CD66a, and CD66b. 

15 Use of the antibody composition comprising CDPG in a negative selection technique to 

prepare a cell preparation which is enriched for hematopoietic stem cells and progenitor cells may 
offer significant advantages over conventional techniques. The antibody composition is applied in 
one step to a sample of peripheral blood, bone marrow, cord blood or frozen bone marrow, 
preferably without additional enrichments steps which could result in loss of, or damage to, 

20 progenitor and stem cells. 

PRSG rPROLITME-RICH CALCIUM-BINDING PROTEIN) and HSTG TBASIC HISTIDINE RICH 
SALIVARY GLAND PEPTIDE) (Clone 1D:3381 12 and 1000839315 respective^ 

SEQ ID NOS 22 and 358 and clone FLl 1:10008393 1 5_220-26-l-0-F3-F encode the 
25 polypeptide of SEQ ID NOS 191 and 475 respectively, a basic histidine rich salivary gland peptide 
referred to herein as HSTG and expected to have potent antimicrobial properties. Preferably, the 
amino acid sequence of HSTG (SEQ ID NO 475) comprises a tyrosine at amino acid position 40. 
The HSTG protein also comprises a Pattem-DE: Protein kinase C phosphorylation site at amino 
acid position 5 1 (SSK). It will be appreciated that all characteristics and uses of the polynucleotides 
30 of SEQ ID NOs: 22 and 358 and polypeptides ofSEQ ID NO: 191 and 475, described throughout 
the present application also pertain to the human cDNA of clone 10008393 1 5, and the polypeptides 
encoded thereby. 

SEQ ID NOS 8 and 345 and clone FLl 1:3381 12_174-l-l-0-Al 1-F encode the polypeptide 
of SEQ ID NOS 177 and 462 respectively, a proline-rich protein referred to herein as PRSG 
35 believed to be a component of saliva and a calcium binding protein also possessing potent 
antimicrobial properties. Preferably, the amino acid sequence of PRSG (SEQ ID NO 462) 
comprises a proline residue at amino acid position 96, an arginine residue at amino acid position 
100, a glutamine residue at amino acid positoin 102, and/or a glycine residue at amino acid position 
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103. The PRSG protein also comprises a casein kinase II phosphorylation site at amino acid 
positions 15 (SAQD), 24 (SQED), and 59 (SAGD) and an N-myristoylation site at amino acid 
position 52 (GGQQSQ). It will be appreciated that all characteristics and uses of the 
polynucleotides of SEQ ID N0s:8 and 345 and polypeptides of SEQ ID NO: 177 and 462, 
5 described throughout the present application also pertain to the human cDNA of clone 338 1 1 2, and 
the polypeptides encoded thereby. 

A study of saliva and its tooth-protective components reveals at least four important 
functions of saliva: (1) buflfering ability, (2) a cleansing effect, (3) antibacterial action, and (4) 
maintenance of a saliva supersaturated in calcium phosphate. Several salivary constituents serve one 

10 or more of these functions. Research has yielded important information about organic and inorganic 
secretory products. It is also clear that saliva as a unique biologic fluid has to be considered in its 
entirety to account fully for its effects on teeth. Saliva is greater then the sum of its parts. One 
reason for this is that salivary components display redundancy of function, each often having more 
than one function. This redundancy, however, does not imply that proteins that share functional 

15 roles all contribute to the same degree. For instance, when comparing proteins that inhibit calcium 
phosphate precipitation, statherin and acidic proline-rich proteins are most potent, whereas histatins, 
cystatins, and mucins appear to play lesser roles. The complex interaction between proteins is 
another major factor contributing to saliva's function. In this regard, heterotypic complexes of 
various proteins have been shown to form on hydroxyapatite. Mucin binding to other salivary 

20 proteins, including proline-rich proteins, histatins, cystatins, and statherin, is well documented. The 
complexes, whether adsorbed to the tooth surface or in saliva, have important implications for 
bacterial clearance, selective bacterial aggregation on the tooth surface, and control of 
mineralization and demineralization. Finally, proteolytic activity of saliva generates numerous 
products whose biologic activities are often different from their parent compounds. The ability of 

25 saliva to deliver fluoride to the tooth surface constantly makes salivar>' fluoride an important player 
in caries protection largely by promoting remineralization and reducing demineralization. Saliva is 
well adapted to protection against dental caries. Saliva's buffering capability; the ability of the saliva 
to wash the tooth surface, to clear bacteria, and to control demineralization and mineralization; 
saliva's antibacterial activities; and perhaps other mechanisms all contribute to its essential role in 

30 the health of teeth. The fact that the protective function of saliva can be overwhelmed by bacterial 
action indicates the importance of prevention and therapy as in other infectious diseases. With 
knowledge of salivary components and their Interactions, the use of modified oral molecules as 
therapeutic agents may become a important contributor to oral health. 

Proline-rich proteins are major components of parotid and submandibular saliva in humans 

35 as well as other animals. They can be divided into acidic, basic and glycosylated proteins. The 
proline-rich proteins are apparently synthesized the acinar cells of the salivary glands and their 
phenotypic expression is under complex genetic control. The acidic proline-rich proteins will bind 
calcium with a strength which indicates that they may be important in maintaining the concentration 
of ionic calcium in saliva. Moreover they can inhibit formation of hydroxyapatite, whereby growth 
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of hydroxyapatite crystals on the tooth surface in vivo may be avoided. Both of these activities as 
well as the binding site for hydroxyapatite are located in the N-terminal proline-poor part of the 
protein. 

Basic histidine rich salivary gland peptides such as the peptide of SEQ ID NO. 191 and 
5 475, also referred to as histatins are a group of electrophoretically distinct histidine-rich 
polypeptides with microbicidal activity found in human parotid and submandibular gland 
secretions. Histatins 1, 3, and S are homologous proteins that consist of 38, 32, and 24 amino acid 
residues, respectively, that have been shown to kill the pathogenic yeast, Candida albicans. More 
recently histatins 2, 4, 6, and 7-12 were isolated and characterized Troxler RF et al, J Dent Res 1990 

1 0 Jan;69( 1 ):2-6. Histatin 2 was found to be identical to the carboxyl terminal 26 residues of histatin 1 ; 
histatin 4 was found to be identical to the carboxyl terminal 20 residues of histatin 3; and histatin 6 
was found to be identical to histatin 5, but contained an additional carboxyl terminal arginine 
residue. The amino acid sequences of histatins 7-12 formally corresponded to residues 12-24, 13- 
24, 12-25, 13-25, 5-11, and 5-12, respectively, of histatin 3, but could also arise proteolytically from 

1 5 histatin 5 or 6. Troxler et al provides further guidance on the structural elements and relationship of 
histatins to one another in the context of their genetic origin, biosynthesis and secretion into the oral 
cavity, and potential as reagents in anti-candidal studies. The HSTG polypeptide and fragments 
thereof are therefore expected to have valuable properties and uses in antimicrobial applications, 
particularly in antifungal applications. Supporting such uses is a considerable body of evidence, 

20 including MacKay BJ et al, Infect Immun. 1 984 Jun;44(3):695-70 1 , Growth-inhibitory and 

bactericidal effects of human parotid salivary histidine-rich polypeptides on Streptococcus mutants; 
MacKay BJ et al. Infect Immun. 1984 Jun;44(3):688-94, Isolation of milligram quantities of a group 
of histidine-rich polypeptides from human parotid saliva; Pollock JJ et al. Infect Immun. 1984 
Jun;44(3):702-7, Fungistatic and fungicidal activity of human parotid salivary histidine-rich 

25 polypeptides on Candida albicans; and Xu T et al. Infect Immun. 1991 Aug;59(8):2549-54, 

Anticandidal activity of major human salivary histatins. Furthermore, tissue distribution of RNAs 
for cystatins, histatins, statherin, and proline-rich salivary proteins in humans and macaques is 
further discussed in Sabatini et al, J Dent Res 1989 Jul;68(7): 1 138-45. 

In further embodiments, the skilled artisan will appreciate that fragments and analogues of 

30 PRSG and HSTG may readily be generated and selected. Selection of preferred fragments and 
analogies may be carried out by assaying for a desired antimicrobial activity. For example, synthetic 
histatin analogues and methods for obtaining such analogies with broad-spectrum antimicrobial 
activity are described in Helmerhorst EJ et al, Biochem J 1997 Aug 1 5;326 ( Pt l):39-45, where 
histatin analogies inhibited the growth of the second most common yeast found in clinical isolates, 

35 Torulopsis glabrata, of oral- and non-oral pathogens such as Prevotella intermedia and 
Streptococcus mutants, and of a methicillin-resistant Staphylococcus aureus. 

Thus, in preferred embodiments, the PRSG and/or HSTG polypeptides or fragments thereof 
may be used in oral, injectable, topical or edible compositions for the treatment of infection PRSG 
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and/or HSTG polypeptides may also be used as antimicrobial/antifunga! compositions for 
disinfection of surfaces (e.g. in industrial settings). 

In a preferred example fiirther discussed below, the PRSG and/or HSTG polypeptides or 
fragments thereof are used in oral, topical (e.g. mouthwash) or edible compositions optionally 
5 containing additional salivary proteins to provide an anticaries effect. While there is an interest in 
developing and marketing products which reduce caries without reliance on a high level of fluoride 
ions (such as in fluoridated water and fluoride toothpastes), there have not been many reports of 
such approaches meeting with success. While certain cysteine-rich proteins have been proposed 
useful in the treatment of dental caries (U.S. Patent No. 5,688,766, Revis et al), the present 

10 invention provides PRSG and HSTG polypeptides which may provide higher potency, efficacy and 
range of disinfection and protection. 

PRSG and HSTG polypeptide compositions can be administered in a formulation 
comprising a carrier. A preferred carrier composition for the active(s) of this invention are oral 
compositions. Such compositions include toothpastes, mouthrinses, liquid dentifrices, lozenges, 

15 chewing gums or other vehicle suitable for use in the oral cavity. Toothpastes and mouthrinses are 
the preferred systems. The abrasive polishing material contemplated for use in the toothpaste 
compositions of the present invention can be any material which does not excessively abrade dentin. 
These include, for example, silicas including gels and precipitates, calcium carbonate, dicalcium 
orthophosphate dihydrate, calcium pyrophosphate, tricalcium phosphate, calcium 

20 polymetaphosphate, insoluble sodium polymetaphosphate, hydrated alumina, and resinous abrasive 
materials such as particulate condensation products of urea and formaldehyde, and others such as 
disclosed by Cooley et al. in U.S. Pat. No. 3,070,510, Dec. 25, 1962, incorporated herein by 
reference. Mixtures of abrasives may also be used. Silica dental abrasives, of various types, can 
provide the unique benefits of exceptional dental cleaning and polishing performance without 

25 unduly abrading tooth enamel or dentin. For these reasons, they are preferred for use herein. 

Flavoring agents can also be added to toothpaste compositions. Suitable flavoring agents include oil 
of wintergreen, oil of peppermint, oil of spearmint, oil of sassafras, and oil of clove. Sweetening 
agents which can be used include aspartame, acesulfame, saccharin, dextrose, levulose and sodium 
cyclamate. Flavoring and sweetening agents are generally used in toothpastes at levels of from 

30 about 0.005% to about 2% by weight. Toothpaste compositions can also contain emulsifying agents. 
Suitable emulsifying agents are those which are reasonably stable and foam throughout a wide pH 
range, including non-soap anionic, nonionic, cationic, zwittefionic and amphoteric organic synthetic 
surfactants. Water is also present in the toothpastes of this invention. Water employed in the 
preparation of commercially suitable toothpastes should preferably be deionized and free of organic 

35 impurities. In preparing toothpastes, it is necessary to add some thickening material to provide a 
desirable consistency. Preferred thickening agents are carboxyvinyl polymers, carrageenan, 
hydroxyethyl cellulose and water soluble salts of cellulose ethers such as sodium carboxymethyl 
cellulose and sodium carboxymethyl hydroxyethyl cellulose. Natural gums such as gum karaya, 
xanthan gum, gum arable, and gum tragacanth can also be used. Colloidal magnesium aluminum 
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silicate or finely divided silica can be used as part of the thickening agent to further improve 
texture. Thickening agents in ane amount from 0.2% to 5.0% by weight of the total composition can 
be used. It is also desirable to include some humectant material in a toothpaste to keep it from 
hardening. Suitable humectants include glycerin, sorbitol, and other edible polyhydric alcohols at a 
5 level of from about 1 5% to about 70%. 

Another preferred embodiment of the present invention is a mouthwash composition. 
Conventional mouthwash composition components can comprise the carrier for the agents of the 
present invention. Mouthwashes generally comprise from about 20:1 to about 2:1 of a water/ethyl 
alcohol solution or be alcohol free and preferably other ingredients such as flavor, sweeteners, 

10 humectants and sudsing agents such as those mentioned above for dentifrices. The humectants, such 
as glycerin and sorbitol give a moist feel to the mouth. Generally, on a weight basis the 
mouthwashes of the invention comprise 0% to 60% (preferably 5% to 20%) ethyl alcohol, 0% to 
20% (preferably 5% to 20%) of a humectant, 0% to 2% (preferably 0.01% to 1 .0%) emulsifying 
agents, 0% to 0.5% (preferably 0.005% to 0.06%) sweetening agent such as saccharin or natural 

1 5 sweeteners such as stevroside 0% to 0.3% (preferably 0.03% to 0.3%) flavoring agent, and the 
balance water. 

The pH of the present compositions and/or the pH in the mouth can be any pH which is safe 
for the mouth's hard and soft tissues. Such pH's are generally from about 3 to about 1 0, preferably 
from about 4 to about 8. Other acceptable oral carders include gums, lozenges, as well as other 

20 forms. Such suitable forms are disclosed in U.S. Pat. No. 4,083,955, Apr. 11, 1978 to Grabenstetter 
et al. incorporated herein in its entirety by reference. Edible compositions are also suitable for use 
as the carrier compositions herein. Edible compositions include many types of solid as well as 
liquid compositions. Such compositions include, for example, soft drinks, citrus drinks, cookies, 
cakes, breads among many others. Such compositions may contain sugar or another sweetener, 

25 water, flour, shortening, other fibers such as wheat, com, barley, rye, oats, psyllium and mixtures 
thereof. 

HSDG rHYDROXYSTEROID DEHYDROGENASE) (Clone ID:495917) 

SEQ ID NOS 54 and 381 and clone FLl 1 :495917J60-22-4-0-D8-F encode the polypeptide 
30 of SEQ ID NOS 223 and 498 respectively, a hydroxysteroid dehydrogenase referred to herein as 
HSDG. As the HSDG polypeptide is implicated in steroid hormone regulation, preferably 
glucocorticoid metabolism, HSDG may be useful in any applications where steroid hormones levels 
are to be increased or inhibited. HSDG may be useftil in the treatment of disease treatable by steroid 
hormones. HSDG inhibitors may also be useftil in systems for self-sufficient biosynthesis of steroid 
35 hormones such as glucocorticoids such as in engineered cells comprising elements of the synthesis 
pathway. HSDG inhibitors of endogenous HSDG activity may allow the recovery of higher 
amounts of glucocorticoids and/or other synthesized steroid hormones from these cell systems. 
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It will be appreciated that all characteristics and uses of the polynucleotides of SEQ ID 
NOs: 54 and 381 and polypeptides of SEQ ID NO: 223 and 498, described throughout the present 
application also pertain to the human cDNA of clone 495917 , and the polypeptides encoded 
thereby. 

5 The HSDG polypeptides of the invention comprise leucine zipper pattern (Prosite ref. 

PS00029) at amino acid positions 58-79 as well as a N-myristoylation site at amino acid position 36 
(GANAGV) of SEQ ID NO 498. In one example, HSDG polypeptides may be used in the 
production or therapeutic modulation of glucocorticoids. The skilled artisan will recognize that any 
suitable HSDG polypeptides or variants or fragments thereof capable of metabolizing 

10 glucocorticoids can readily be used. 

In one embodiment HSDG activity can be determined by detecting levels of glucocorticoids 
or metabolites or glucocorticoids. Moreover, structural aspects of 1 Ibeta-hydroxysteroid 
dehydrogenase have been documented in the art and may serve as a guidelines for developing 
suitable HSDG variants and fragments. The HSDG polypeptides of the invention may allow 

15 modulation of glucocorticoid activity or identification (e.g. drug screening) of compounds capable 
of specifically modulating glucocorticoid levels or activity. HSDG may be useful in allowing the 
modulation of steroid hormone synthesis, or glucocorticoid synthesis to be carried in a tissue 
specific manner, thereby offering improved methods for treating disease with decreased risk of side 
effects, 

20 Corticosteroids, also referred to as glucocorticoids are steroid hormones, the most common 

form of which is Cortisol. Modulation of glucocorticoid activity is important in regulating 
physiological processes in a wide range of tissues and organs. Glucocorticoids act within the gonads 
to directly suppress testosterone production (Monder, C et al, (1994) Steroids 59, 69-73). High 
levels of glucocorticoids may also result in excessive salt and water retention by the kidneys, 

25 producing high blood pressure. 

Glucocorticoid action is mediated via binding of the molecule to a receptor, defined 
hereinafter as either a mineralocorticoid receptor (MR) or a glucocorticoid receptor (GR). 
Krozowski, Z. S. et al, ((1983) Proc. Natl. Acad. Sci. USA 80, 6056-6060) and Beaumont, K. et al, 
((1983) Endocrinology 113, 2043-2049) showed that MR of adrenalectomised rats have an equal 

30 affinity for the mineralocorticoid aldosterone and glucocorticoids, for example corticosterone and 
Cortisol. Confirmatory evidence has been found for human MR (Arriza, J. L et al, (1988) Neuron 1, 
887-900). In patients suffering from the congenital syndrome of Apparent Mineralocorticoid 
Excess (AME), Cortisol levels are reportedly elevated and bind to and activate MRs normally 
occupied by aldosterone, the steroid that regulates salt and water balance in the body. Salt and water 

35 are retained in AME patients causing severe hypertension. 

Like HSDG, the enzyme 1 1 P-hydroxysteroid dehydrogenase ( 1 1 PHSDX also discussed in 
U.S. Patent No. 5,965,372 (Funder et al) may be involved in converting glucocorticoids into 
metabolites that are unable to bind to MRs (Edwards et al, (1 988) Lancet. 2: 986-9; Funder et al, 
(1988) Science 242, 583,585), present in mineralocorticoid target tissues, for example kidney. 
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pancreas, small intestine, colon, as well as the hippocampus, placenta and gonads. For example, in 
aldosterone target tissues 11|3HSD inactivates glucocorticoid molecules, allowing the much lower 
circulating levels of aldosterone to maintain renal homeostasis. When the 1 1 PHSD enzyme is 
inactivated, for example in AME patients or following administration of glycyrrhetinic acid, a 
5 component of licorice, severe hypertension results. Further, placental 1 1 PHSD activity may protect 
the foetus from high circulating levels of glucocorticoid which may predispose to hypertension in 
later life (Edwards et al., 1993). Biochemical characterisation of activity has indicated the presence 
of at least two 1 1 PHSD isoenzymes (11 PHSD 1 and 1 ipHSD2) with different cofactor requirements 
and substrate affinities. The 1 1 PHSDI enzyme is a low affinity enzyme that prefers NADP+ as a 

10 cofactor (Agarwal et al., 1989). The 1 1 PHSD2 enzyme is a high affinity enzyme (Km for 

glucocorticoid=l 0 nM), requiring NAD+, not NADP+ as the preferred cofactor, belonging to a class 
of glucocorticoid dehydrogenase enzymes hereinafter referred to as "NAD+ dependent 
glucocorticoid dehydrogenase" enzymes. 

Inverse correlation between 1 1 pHSD enzyme activity in human granulosa-lutein cells and 

15 the success of IVF has further been shown, suggesting that activity of this enzyme might be related 
to the success of embryo attachment and implantation following IVF. The measurement of ovarian 
1 1 pHSD enzyme activity as a prognostic indicator for the outcome of assisted conception in all 
species, is the subject of UK Patent Application No 9305984. However, the disclosure of Michael et 
al. ((1993) Lancet 342, 71 1-712), and corresponding UK Patent Application No 9305984 do not 

20 identiiy, or even suggest which isoenzyme in the ovary might be a predictive indicator of IVF 
embryo transfer, or a means of distinguishing isoenzymes of 1 1 PHSD in the prediction of IVF 
embryo transfer outcomes. In fact, the enzyme assay procedure might detect all isoenzymes of 
1 ipHSD activity in the cell, some of which may be hitherto uncharacterised. 

Thus, the human HSDG hydroxysteroid dehydrogenase enzyme and the nucleic acids 

25 encoding it providing novel means for the development of gene therapies and identification of 
HSDG activators and inhibitors which alter the endogenous activity of this hydroxysteroid 
dehydrogenase enzyme in a cell. The present invention also permits the screening, through genetic 
or immunological means, levels of expression of genes encoding the NAD+ dependent 
glucocorticoid dehydrogenase enzyme in various tissue or organ types, including for example, skin, 

30 colon, kidney, placenta, and gonads, amongst others. 

SIOOG CALCIUM BINDING PROTEIN rClone ID:200895> 

The polynucleotides of clone FLl 1 :200895_1 16-055-1-0-Hl 1-F and SEQ ID NOS 132 and 
428 encode for the SlOO calcium binding protein referred to herein as SIOOG. SEQ ID NOS 301 
35 and 542 provide the amino acid sequence corresponding to the nucleic acid sequences of SEQ ID 
NOS 1 32 and 428, respectively. It will be appreciated that all characteristics and uses of the 
polynucleotides of SEQ ID NOs:132 and 428 and polypeptides of SEQ ID NO: 301 and 542, 
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described throughout the present application also pertain to the human cDNA of clone200895, and 
the polypeptides encoded thereby. 

BACKGROUND 

In nearly all eukaryotic cells, calcium (Ca^*^) functions as an intracellular signaling 
5 molecule in diverse cellular processes including cell proliferation and differentiation, 

neurotransmitter secretion, glycogen metabolism, and skeletal muscle contraction. Within a resting 
cell, the concentration of Ca2+ in the cytosol is extremely low, <10-^ M. However, when the cell is 
stimulated by an external signal, such as a neural impulse or a growth factor, the cytosolic 
concentration of Ca^"*" increases by about 50-fold. This influx of CaP-'^ is caused by the opening of 

10 plasma membrane Ca^^ channels and the release of Ca^"*" from intracellular stores such as the 
endoplasmic reticulum. Ca^"^ directly activates regulatory enzymes, such as protein kinase C, 
which trigger signal transduction pathways. 

The protein of SEQ ID NOS 30 1 and 542 is a calcium binding S 1 00 protein typically found 
in heart and muscle. SlOO proteins are low-molecular weight calcium binding proteins that are 

1 5 believed to play an important role in various cellular processes such as cytoskeletal/membrane 
interactions, cell division and differentiation. The expression of SlOO proteins has been evaluated 
in a variety of disorders. For example, SlOO protein have been evaluated as markers of 
inflammatory disease, including ulcerative colitits, Crohn's disease, and as serum markers for 
subjects with infections diseases including AIDS and malaria and for subjects with hematological 

20 disease. Evidence has accumulated that indicates that SI 00 proteins can alter cellular invasion and 
metastatic spead of cancer. For example, SlOO protein is expressed in dendritic cells in human 
transitional cell carcinoma of the bladder and the invasive potential of these tumor has been found 
to correlate with the presence of S 1 00 protein expressing cells. 

THERAPEUTICS AND DIAGNOSTICS 

25 The S 1 OOG protein of SEQ ID NOS 301 and 542 disclosed herein provides new calcium 

binding SlOO protein compositions useful in the diagnosis, prevention, and treatment of cancer, 
reproductive disorders, immune disorders, neuronal disorders, vesicle trafficking disorders and 
developmental disorders. 

CBPs are implicated in a variety of disorders and several CBPs have proven to be effective 

30 therapeutic targets for which small molecule inhibitors could be developed. However, while several 
CBPs are targets for widely-used therapeutic treatments, it would be advantageous to provide 
further CBPs allowing more selective therapeutic treatments for disease to be developed. In one 
example, calcineurin is found in the cells of all eukaryotes ranging from yeast to mammals. 
Calcineurin is a target for inhibition by the immunosuppressive agents cyclosporin A and FK506 

35 emphasizing its importance in immune disorders (Kissinger, C. R. et al. (1995) Nature 378:641 - 
644). Calcineurin also plays a critical role in transcriptional regulation and growth control in T- 
lymphocytes (Wang, M. G. et al. (1996) Cytogenet. Cell Genet. 72:236-241). However, inhibition 
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of calcineurin phosphatase activity has been implicated both in the mechanism of 
immunosuppression and in the observed toxic side effects of FK506 in nonlymphoid cells, 
suggesting that identification of a new (FK binding proteins (FKBPs) that can mediate calcineurin 
inhibition and are restricted in its expression to T cells could provide new immunosuppressive drugs 
5 may be identified that, by virtue of their specific interaction with the FKBP, would be targeted in 
their site of action (Baughman G, et al Mol Cell Biol 1995 Aug;15(8):4395-402). In another CBP 
example, levels of CaM are increased several-fold in tumors and tumor-derived cell lines for 
various types of cancer (Rasmussen, C. D. and Means, A. R. (1989) Trends in Neuroscience 
12:433-438). Calcium binding SIOOP is another example of a CBP involved in a variety of 

10 disorders. Like the SIOOG protein of the invention, SlOOp contains an EF-hand motif SlOOp is 
abundantly expressed in the nervous system. SIOOP levels are increased in the blood and 
cerebrospinal fluid of patients with neurological injury resulting from cerebral infarction, transient 
ischemic attacks, hemorrhagia, head trauma, and Down's syndrome. Furthermore, SI OOp and other 
neural-specific CBPs may also protect against neurodegenerative disorders, such as Alzheimer's, 

1 5 Parkinson's, and Huntington's diseases. S 1 OOP is produced and secreted by glial cells in the central 
and peripheral nervous systems (AUore, R. J. et al. (1990) J. Biol. Chem. 265:15537-15543). The 
accumulation of SIOOP in mature glial cells is associated with the microtubule network. SIOOP 
promotes neuronal differentiation and survival but may be detrimental to cells if overexpressed. The 
selective overproduction has been implicated in the progression of the neuropathological changes in 

20 Alzheimer's disease which may involve mitotic protein kinases (Marshak, D. R. and Pena, L. A. 
(1992) Prog. Clin. Biol. Res. 379:289-307). Adult T-cell leukaemia (ATL) is a mature T-cell 
malignancy which is caused by human T lymphotrophic virus type-1. Diminished surface 
expression of the T-cell receptor alpha beta (TCRaP+) complex is a specific feature of ATL cells. 
SlOOp is not detectable in CD4+, TCRaP+ ATL cells, but is expressed in CD4-, CD8-, TCRaP+ 

25 leukaemic cells from four ATL patients. This suggested that increased levels of S 1 OOP may be 

associated with the diminished surface expression of the TCRaP+ complex in ATL (Suzushima, H. 
et al. (1994) Leuk. Lymphoma 13:257-262). Elevated serum levels of SlOOp are associated with 
disseminated malignant melanoma metastases, suggesting that serum SI OOP may be of value as a 
clinical marker for progression of metastatic melanoma (Henze, G. et al. (1997) Dermatology 

30 1 94:208-2 1 2). In yet another example, messenger RNA levels encoding human calgizzarin (an 
SlOO-like protein), as well as those encoding phospholipase A2, are elevated in colorectal cancers 

compared with those of normal colorectal mucosa (Tanaka, M. et al. (1995) Cancer Lett. 89:195- 
200). Finally, an intracellular SlOO calcium-binding protein has been isolated from rat peritoneum. 
This protein, MRP14, is one of two migration inhibitory factor-related proteins that are expressed in 
35 peritoneal macrophages in the arthritis-susceptible Lewis/N rat (Imamichi, T. et al. (1 993) Biochem. 
Biophys. Res. Comm. 194:819-825). 

However, despite the many uses and therapeutics based on SlOO proteins and other CBPs, it 
would be advantageous to selectively target CBPs which are involved in a disorder and which are 
found specifically in the targeted cells or tissues. Thus, in a first embodiment, the SIOOG protein of 
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the invention can be used for the development of selective inhibitors of calcium signaling, eg. 
preferably inhibitors of transcriptional regulation and cell growth control. 

In one embodiment, an antagonist of SIOOG may be administered to a subject to prevent or 
treat a neuronal disorder. Such disorders may include, but are not limited to, akathesia, Alzheimer's 
5 disease, amnesia, amyotrophic lateral sclerosis, bipolar disorder, catatonia, cerebral neoplasms, 
dementia, depression, Down's syndrome, tardive dyskinesia, dystonias, epilepsy, Huntington's 
disease, multiple sclerosis, neurofibromatosis, Parkinson's disease, paranoid psychoses, 
schizophrenia, and Tourette's disorder. In one aspect, an antibody which specifically binds SIOOG 
may be used directly as an antagonist or indirectly as a targeting or delivery mechanism for bringing 

1 0 a pharmaceutical agent to cells or tissue which express S 1 GOG. 

In one embodiment, an antagonist of S I OOG may be administered to a subject to prevent or 
treat a vesicle trafficking disorder. Such disorders may include, but are not limited to, cystic 
fibrosis, glucose-galactose malabsorption syndrome, hypercholesterolemia, diabetes mellitus, 
diabetes insipidus, hyper- and hypoglycemia. Grave's disease, goiter, Cushing's disease, and 

1 5 Addison's disease; gastrointestinal disorders including ulcerative colitis, gastric and duodenal 
ulcers; other conditions associated with abnormal vesicle trafficking including AIDS; allergies 
including hay fever, asthma, and urticaria (hives); autoimmune hemolytic anemia; proliferative 
glomerulonephritis; inflammatory bowel disease; multiple sclerosis; myasthenia gravis; rheumatoid 
and osteoarthritis; scleroderma; Chediak-Higashi and Sjogren's syndromes; systemic lupus 

20 erythematosus; toxic shock syndrome; traumatic tissue damage; and viral, bacterial, fungal, 

helminth, and protozoal infections. In one aspect, an antibody which specifically binds SIOOG may 
be used directly as an antagonist or indirectly as a targeting or delivery mechanism for bringing a 
pharmaceutical agent to cells or tissue which express SIOOG. 

In one embodiment, an antagonist of SIOOG may be administered to a subject to prevent or 

25 treat an immunological disorder. Such disorders may include, but are not limited to, AIDS, 

Addison's disease, adult respiratory distress syndrome, allergies, anemia, asthma, atherosclerosis, 
bronchitis, cholecystitis, Crohn's disease, ulcerative colitis, atopic dermatitis, dermatomyositis, 
diabetes mellitus, emphysema, erythema nodosum, atrophic gastritis, glomerulonephritis, gout. 
Graves' disease, hypereosinophilia, irritable bowel syndrome, lupus erythematosus, multiple 

30 sclerosis, myasthenia gravis, myocardial or pericardial inflammation, osteoarthritis, osteoporosis, 
pancreatitis, polymyositis, rheumatoid arthritis, scleroderma, Sjogren's syndrome, Werner 
syndrome, and autoimmune thyroiditis; complications of cancer, hemodialysis, and extracorporeal 
circulation; viral, bacterial, fungal, parasitic, protozoal, and helminthic infections; and trauma. In 
one aspect, an antibody which specifically binds SI OOP may be used directly as an antagonist or 

35 indirectly as a targeting or delivery mechanism for bringing a pharmaceutical agent to cells or tissue 
which express SIOOG. 

In one embodiment, an antagonist of SIOOG may be administered to a subject to prevent or 
treat a neoplastic disorder. Such disorders may include, but are not limited to, adenocarcinoma, 
leukemia, lymphoma, melanoma, myeloma, sarcoma, teratocarcinoma, and, in particular, cancers of 
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the adrenal gland, bladder, bone, bone marrow, brain, breast, cervix, gall bladder, ganglia, 
gastrointestinal tract, heart, kidney, liver, lung, muscle, ovary, pancreas, parathyroid, penis, 
prostate, salivary glands, skin, spleen, testis, thymus, thyroid, and uterus. In one aspect, an antibody 
which specifically binds SI OOP may be used directly as an antagonist or indirectly as a targeting or 
5 delivery mechanism for bringing a pharmaceutical agent to cells or tissue vy^hich express S 1 OOG. 

In addition, the SI OOG nucleic acids and polypeptides of the present invention can be used 
to identify compounds for the treatment of a subject experiencing negative side effects from the 
administration of other pharmaceuticals, such as those drugs that disrupt the body's calcium 
homeostasis. Co-administration of said compounds woM be useful to counter-effect iatrogenically 

1 0 caused dysfunction of calcium metabolism. 

An antagonist of SI OOG may be produced using methods which are generally known in the 
art. In particular, purified SI OOG may be used to produce antibodies or to screen libraries of 
pharmaceutical agents to identify those which specifically bind SI OOG. In one example, a positive 
screening for drugs that specifically inhibit the Ca2+-signaling activity was carried out on the basis 

15 of the growth promoting effect on a yeast mutant with a peculiar phenotype (Shitamukai A et al, 
Biosci Biotechnol Biochem 2000 Sep;64(9): 1942-6). An inappropriate activation of a signaling 
pathway in yeast often has a deleterious physiological effect and causes various defects, including 
growth defects. In a certain genetic background (deltazdsl) of Saccharomyces cerevisiae, the cell- 
cycle progression in G2 is specifically blocked in the medium with CaC12 by the hyperactivation of 

20 the Ca2+-signaling pathways. Shitamukai et al provide an example of a drug screening procedure 
designed to detect the active compounds that specifically attenuate the Ca2+-signaling activity on 
the basis of the ability to abrogate the growth defect of the cells suffering from the hyperactivated 
Ca2+ signal. Screening conditions were established for the drugs that suppress the Ca2+-induced 
growth inhibition using known calcineurin inhibitors as model compounds, and an indicator strain 

25 with an increased drug sensitivity was constructed with a syr]/erg3 null mutation. 

In another embodiment, a vector expressing the complement of the polynucleotide encoding 
SIOOP may be administered to a subject to treat or prevent a neuronal disorder, immunological 
disorder, neoplastic disorder or vesicle trafficking disorder including, but not limited to, those 
described above. In other embodiments, any of the proteins, antagonists, antibodies, agonists, 

30 complementary sequences or vectors of the invention may be administered in combination with 
other appropriate therapeutic agents. Selection of the appropriate agents for use in combination 
therapy may be made by one of ordinary skill in the art, according to conventional pharmaceutical 
principles. The combination of therapeutic agents may act synergistically to effect the treatment or 
prevention of the various disorders described above. Using this approach, one may be able to 

35 achieve therapeutic efficacy with lower dosages of each agent, thus reducing the potential for 
adverse side effects. 

Expression of SI 00 proteins have been evaluated as serum markers for melanoma (Henze et 
al. Dermatology 194:208-212; Buer et al, 1997 Brit. J. Cancer 75: 1373-1376; Sherbert et al. 1998; 
Anticancer Res. 1 8:24 1 5-2422) and more recently as serum markers for cancer in general 
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(particularly breast, colon and lung cancers) and that their detectability in serum can have 
prognostic and/or therapeutic significance in cancer. Furthermore auto-antibodies to SlOO proteins 
were found in cancer patients. (International Patent Publication No. WO 00/26668). The SIOOG 
protein of the invention may thus be used in the diagnosis and prevention of cancer, for 
5 identification of subjects predisposed to cancer, for monitoring patients undergoing treatment for 
cancer based on the increased level of SIOOG protein(s) in biological fluid samples of subjects. 
Methods for diagnosis and prognosis of cancer in a subject may comprise 

a) detecting a SIOOG protein in a biological fluid sample obtained from a subject, and 

b) comparing the level of protein detected in the subject's sample to the level of protein 
10 detected in a control sample, 

wherein an increase in the level of S 1 OOG protein detected in the subject's sample as 
compared to control samples is an indicator of a subject with cancer of at increased risk for cancer. 

The invention also comprises methods for diagnosis and prognosis of a subject with cancer 
comprising: 

15 a) contacting a serum sample derived from a subject with a sample containing SIOOG 

protein antigens under conditions such that a specific antigen-antibody complex binding can occur; 
and 

b) detecting the presence of immunospecific binding of autoantibodies present in the 
sujecfs serum samples to the SIOOG protein; 
20 wherein the presence of immunospecific binding of autoantibodies indicates the presence of 

cancer. 

Assays for detection of SIOOG protein in a sample can be accomplished by any suitable 
method, including immunoassays where in SIOOG proteins are detected by their interaction with an 
SIOOG specific antibody. In addition, reagents other than antibodies such as for example 

25 polypeptides that specifically bind S 1 OOG may be used. 

In yet further embodiments, the SIOOG protein of the invention may be useful for the 
development of specific anti-SlOO antibodies which are specific for SlOO proteins other than 
SIOOG. As several SlOO proteins have been implicated in disease, it may be advantageous to 
develop a panel of SlOO specific antibodies to characterize disease, eg. cancers. Thus, SIOOG 

30 proteins and antibodies thereto may be advantageous to distinguish cancer types. S 1 OOG proteins 
and antibodies may also be useful in the screening of S 1 00 specific antibodies by determining the 
selectivity of a given anti-Sl 00 antibody for its target, and eliminating antibodies which are cross- 
reactive with SIOOG proteins. 

STUCTURAL ASPECTS OF SlOO PROTEINS OF THE INVENTION 

35 The SlOO proteins are a group of low molecular mass (approximately 10-12 kDa) acidic 

Ca2+ -binding proteins, so named after the solubility of the first isolated protein in 1 00% saturated 
ammonium sulfate. The most striking conserved feature of these proteins is the presence of an EF- 
hand. The SlOO proteins have two Ca2+ -binding domains. One of these domains is a basic helix- 
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loop-helix domain, the other domain is an acidic helix-loop-helix EF-hand (Kligman, D. and Hilt, 
D. C. (1988) Trends Biochem. Sci. 13:437-442). The EF-hand domain also encompasses a part of a 
region within SlOO proteins which specifically identifies members of the SlOO family of proteins 
which have a low affinity for Ca2+ ions (SlOO/ICaBP; PROSITE PS00303, SWISSPROT, PFAM 
5 PFOl 023). The EF-hand is characterized by a twelve amino acid residue-containing loop, flanked 
by two alpha-helices, orientated approximately 90 degrees with respect to one another. Aspartate 
(D) and glutamate (E) residues are usually found bordering the twelve amino acid loop. In addition, 
a conserved glycine residue in the central portion of the loop is found in most Ca2+ -binding EF- 
hand domains. Oxygen ligands within this domain coordinate the Ca^"*" ion (Kretsinger, R. H. and 

10 Nockolds, C. E. (1973) J. Biol. Chem. 248:3313-3326). It will also be appreciated by the skilled 
artisan that modifications of the SIOOG polypeptide may readily be made based on extensive 
knowledge of CBP stucture and Ca(2+) binding mechanisms, such as Sastry M et al, (Structure 
1 998;6:223-23 1), describing the three-dimensional structure of Ca(2+)-bound calcyclin and its 
implications for Ca(2+)-signal transduction by SlOO 

15 Recently, a protein designated S100A13 has been discovered and characterized which 

shared significant primary structure similarity with the SIOOG polypeptide of the invention. (Wicki 
et al, (1996) BBRC 227:594-599). 

The binding to calcium induces a conformational change in the SlOO proteins, and this may 
then affect the secondary effector proteins. This mode of protein-protein interaction and modulation 

20 of the activity of the secondary effector protein is similar to that seen with calmodulin, also 
containing the EF-hands. The SIOOG protein of the invention comprises an ICaBP type calcium 
binding domain domain at amino acid positions 9 to 52 and casein kinase II phosphorylation site 
patterns at amino acid position 7 (TELE); 34 (SVNE); and 55 (SLDE) of SEQ ID NO 542. 

The SIOOG polypeptides of the invention may thus also be used in any situation in vivo or 

25 in vitro where it is desired to modulate, preferably decrease the level of free calcium, or in 

applications involving sensing a change in calcium concentration. In one aspect, SIOOG nucleic 
acids and polypeptides may be used to develop a calcium biosensor. Calcium biosensors may have 
particular utility in detecting abnormalities in calcium transport that result in uncompensated influx 
into, or efflux from, the extracellular fluid, will result in hypercalcaemia or hypocalcaemia, 

30 respectively. Such abnormalities in serum calcium concentration may have profound effects on 
neurological, gastrointestinal, and renal function (Bushinsky DA et al. Lancet 1998 Jul 25;352 
(9124):306-1 1). Calcium biosensors may be developed by using any suitable means known in the 
art to detect a conformational change induced by the binding of calcium to the EF-hand domains of 
the SIOOG polypeptides. Preferably, the conformational change is detected by detecting a change in 

35 the abil ity of the S 1 OOG protein to bind a selected secondary effector protein 

As the distribution of particular SlOO proteins is dependent on specific cell types, the SlOO 
proteins may be involved in transducing the signal of an increase in intracellular calcium in a cell 
type-specific fashion (Wu, T. et al. (1997) J. Biol. Chem. 272:17145-17153). 
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CaMLP fCALClUM BINDING PROTEINS fClone 10:5007426981 

Calcium is one of the "second messengers" which relays chemical and electrical signals 
within a cell. This signal transduction and, hence the regulation of biological processes, involves 
interaction of calcium ion with high-affinity calcium-binding proteins (CBPs). Disclosed herein in 
5 SEQ ID NOS 1 84 and 469 is one such protein, encoded by the nucleic acid sequences of SEQ ID 
NOS 1 5 and 352, respectively, and the clone FLl 1 :500742698_204-61-4-0-B2-F, and further 
referred to herein as CaMLP, which is thought to act as a Ca^"^ sensing and binding protein 
involved in diverse aspects of cell proliferation (such as for example of hepatocytes, melanoma 
cells, leukemic lymphocytes, and HUVEC (human umbilical vein endothelial cells)) and 

10 differentiation. It will be appreciated that all characteristics and uses of the polynucleotides of SEQ 
ID NOs: 1 5 and 352 and polypeptides of SEQ ID NO: 1 84 and 469, described throughout the present 
application also pertain to the human cDNA of clone 500742698, and the polypeptides encoded 
thereby. Notably, the CaMLP polypeptide contains EF hand calcium-binding domains (PROSITE 
PS00018) at amino acid positions 81-93 and at position 129-141 of SEQ ID NO 469. CaMLP 

1 5 The cellular processes in which Ca^^ functions as an intracellular signaling molecule are 

diverse, including cell proliferation and differentiation, neurotransmitter secretion, glycogen 
metabolism, and skeletal muscle contraction. Within a resting cell, the concentration of Ca^"^ in the 
cytosol is extremely low, <10.sup.-7 M. However, when the cell is stimulated by an external signal, 
such as a neural impulse or a growth factor, the cytosolic concentration of Ca^"^ increases by about 

20 50-fold. This influx of Ca^"*" is caused by the opening of plasma membrane Ca^"*" channels and the 
release of Ca^"*" from intracellular stores such as the endoplasmic reticulum. Ca^"*" directly activates 
regulatory enzymes, such as protein kinase C, which trigger signal transduction pathways. Ca2+ 
also binds to specific Ca^"^ -binding proteins (CBPs) such as calbindins, troponin C, calmodulin, 
and S-100 proteins which then activate multiple target proteins including enzymes, membrane 

25 transport pumps, and ion channels. Calmodulin (CaM) is the most widely distributed and the most 
common mediator of calcium effects and appears to be the primary sensor of Ca^"*" changes in 
eukaryotic cells. The binding of Ca2+ to CaM induces marked conformational changes in the 
protein permitting interaction with, and regulation of over 100 different proteins. CBP interactions 
are involved in a multitude of cellular processes including, but not limited to, gene regulation, DNA 

30 synthesis, cell cycle progression, mitosis, cytokinesis, cytoskeletal organization, muscle contraction, 
signal transduction, ion homeostasis, exocytosis, and metabolic regulation (Celio, M. R. et al. 
(1996) Guidebook to Calcium-binding Proteins, Oxford University Press, Oxford, UK, pp. 15-20). 

THERAPEUTICS 

The CaMLP protein of SEQ ID NOS 1 84 and 469 disclosed herein provides new calcium 
35 binding protein compositions useful in the diagnosis, prevention, and treatment of cancer, 
reproductive disorders, immune disorders, neuronal disorders and developmental disorders. 

Calcium binding proteins (CBPs) are implicated in a variety of disorders and several CBPs 
have proven to be effective therapeutic targets for which small molecule inhibitors could be 
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developed. However, while several CBPs are targets for widely-used therapeutic treatments, it 
would be advantageous to provide further CBPs allowing more selective therapeutic treatments for 
disease to be developed. Evidence has accumulated for a large number of CBPs suggesting 
involvement in cell proliferative disorders. It is proposed that CaMLP may be useful as a tissue 
5 specific calmodulin homologue allowing the development of specific inhibitors and activators 
having increased selectivity and safety (decreased side effect profile). To date, calmodulin 
antagonists are reportedly useful for the treatment of some malignant tumors, particularly those of 
the central nervous system, as well as lung tumors. The antitumor activity of calmodulin 
antagonists, as well as successful chemotherapy using the same, has been described, for example, in 

10 Sculler et al. Cancer Res., 50:1645-1649 (1990) and Hait et al. Cancer Res., 50:6636-6640 (1990). 
U.S. Pat. No. 5,340,565, additionally describes the use of calmodulin antagonists or inhibitors as 
agents which enhance the effectiveness of a chemotherapeutic agent or radiation treatment. 
Specifically, described therein is a method of inhibiting or killing a tumor or cancer cell in a human 
patient undergoing radiation therapy or chemotherapy, for example with such chemotherapeutic 

1 5 agents as cisplatin (Platinol®), by additionally administering a calmodulin binding agent which 
inhibits calmodulin activity. 

Calmodulin is also believed to play a pathogenic role in the tissue damage caused by bums 
and frostbite (Beitner et al.. Gen. Pharmac. 20: 641-646, 1989), as well as in dermatitis and other 
conditions involving keratinocyte hyperproliferation. The methods of the present invention may be 

20 applied to the treatment of these and other conditions wherein antagonism of calmodulin activity is 
desirable. 

Thus, as discussed above, CaMLP protein of the invention shares structural similarity with 
the ubiquitous intracellular receptor protein calmodulin suggesting that CaMLP may be useful in the 
development of selective CaMLP inhibitors. The nucleic acids and polypeptides of the invention 

25 thus provide a novel therapeutic target in particular for cell proliferative disorders. In one aspect, 
said nucleic acids and protein may be used in drug screening processes to develop selective 
calmodulin and other calcium binding protein antagonists which do not inhibit the polypeptide of 
the invention. In another aspect, the nucleic acids and polypeptides of the invention may be used in 
drug screening processes to identify selective modulators of the CaMLP without inhibiting 

30 calmodulin, thereby identifying compounds less likely to cause unwanted side effects. In yet 
another aspect, the nucleic acids and polypeptides of the invention may be used in drug screening 
processes to identify selective modulators of both calmodulin and CaMLP, thereby identifying 
compounds having increased potency. 

Upon calcium binding, CaMLP may interact with a number of protein targets in a calcium 

35 dependent manner, thereby altering a number of complex biochemical pathways that can affect the 
overall behavior of cells. The calcium-calmodulin complex for example controls the biological 
activity of more than thirty different proteins including several enzymes, ion transporters, receptors, 
motor proteins, transcription factors, and cytoskeletal components in eukaryotic cells. 
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As described in U.S. Patent No. 5,840,697, Blondelle et al have peptide inhibitors of 
calmodulin. A number of other calmodulin targeted compounds are known and used for a variety of 
therapeutic applications. For instance, chlorpromazine (Thorazine.RTM.) and related phenothiazine 
derivatives, disclosed, for example, in U.S. Pat. No. 2,645,640, are calmodulin antagonists useful as 
5 tranquilizers and sedatives. Naphthalenen-sulfonamides, also calmodulin antagonists, are known to 
inhibit cell proliferation, as disclosed, for example, in Hidakaet al. ((1981), PNAS, 78:4354-4357) 
and are useful as antitumor agents. In addition, the cyclic peptide cyclosporin A (Sandimmune®), 
disclosed in U.S. Patent No. 4,1 17,118, is as an immunosuppressive agent which is thought to work 
by inhibiting calmodulin mediated responses in lymphoid cells. 

10 Many existing calmodulin inhibitors have undesirable biological effects when administered 

at concentrations sufficient to block calmodulin. These undesirable biological effects include non- 
specific binding to other proteins or receptors, as described, for example, in Polak et al, ((1991), J. 
Neurosci. 1 1 : 534-542.) In addition, negative side efFects such as toxicity can occur. A specific 
example is the toxic side effects from cyclosporin A. Therefore, a need exists for calmodulin 

1 5 targeted agents, and in particular calmodulin antagonists which inhibit calmodulin without having 
additional, undesirable biological or side effects. In particular there is a need for inhibitors which 
are specific to calmodulin and which do not have toxic side effects. 

In addition, the CaMLP nucleic acids and polypeptides of the present invention can be used 
to identify compounds for the treatment of a subject experiencing negative side effects from the 

20 administration of other pharmaceuticals, such as those drugs that disrupt the body's calcium 

homeostasis. Co-administration of said compounds would be useful to counter-effect iatrogenically 
caused dysfunction of calcium metabolism. Such disorders include, but are not limited to, organ 
damage, autoimmune disorders, psychotic disorders, tumors and drug induced dysfunction, such as 
negative side effects subsequent to administration of pharmaceuticals. For example, organ or tissue 

25 transplantation can result in autoimmune disorders, such as tissue graft (allograft) rejections. 

It is well known that calmodul in-targeted compounds which are antagonists can be used as 
immunosuppressive agents. In addition, also as described above, such compounds are widely used 
as sedative or anti-psychotic agents. Furthermore, there is evidence that calmodulin (and hence 
CaMLP) antagonists are useful for the treatment of some malignant tumors, particularly those of the 

30 central nervous system, as well as lung tumors. The antitumor activity of calmodulin antagonists, as 
well as successful chemotherapy using the same, has been described, for example, in Sculler et al. 
Cancer Res., 50:1645-1649 (1990) and Halt et al. Cancer Res., 50:6636-6640 (1990), both of which 
are incorporated herein by reference. U.S. Pat. No. 5,340,565, which is incorporated herein by 
reference, additionally describes the use of calmodulin antagonists or inhibitors as agents which 

35 enhance the effectiveness of a chemotherapeutic agent or radiation treatment. Specifically, 
described therein is a method of inhibiting or killing a tumor or cancer cell in a human patient 
undergoing radiation therapy or chemotherapy, for example with such chemotherapeutic agents as 
cisplatin (Platinol), by additionally administering a calmodulin binding agent which inhibits 
calmodulin activity. 
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It has also been found that extracellular calmodulin inhibits TNF release and facilitates 
elastase release, providing further suggestion that CaMLP, CaMLP analogues and CaMLP receptor 
agonists are useful agents for regulating the inflammatory process. CaMLP antagonists, which 
include CaMLP receptor antagonists and CaMLP -binding molecules, may be used to block the 
5 interaction of CaMLP with a receptor, thus providing the opposite effect from CaMLP, its 
analogues and receptor agonists. CaMLP may serve as a potent modulator of self-directed 
inflammation by assisting in the recognition of self vs. non-self as prokaryotes (e.g., bacterial 
pathogens) do not contain CaMLP. In some situations such as in tumor necrosis, release of 
extracellular CaMLP may lead to an inappropriate host response and failure of the 

10 immune/inflammatory systems to eradicate tumor cells. Further, a diagnostic test has been 
developed which can discern patient variabilities in TNF inhibition by calmodulin and other 
substances. This test can be utilized in monitoring individual patients for determining effective 
therapies, and for predicting efficacy of therapy with extracellular CaMLP, CaMLP analogues or 
CaMLP receptor agonists on the one hand and CaMLP antagonists on the other. A diagnostic test 

1 5 for elastase has also been developed with similar utility. 

Lvsozyme C Protein of SEP ID NO: 196 (internal designation 482181") and related protein of SEP 
IDNO:479 

The polypeptides of SEQ ID NP : 1 96 and SEQ ID NP:479encoded by the cDNA of SEQ 

20 ID NP:27 and 362, respectively, belong to the widely conserved family of lysozyme C precursors 
(Prager and Jolles, Lysozymes: model enzymes in biochemistry and biology, ed. Jolles, 9-321 
(1996), Qasba and Kumar, Crit. Rev. Biochem. Mol. Biol. 32:255-306 (1997)), which disclosures 
are hereby incorporated by reference in their entireties. The protein of SEQ ID NOs: 1 96 and 479 or 
part thereof plays a role in glycoprotein and/or peptidoglycan metabolism, probably as a glycosyl 

25 hydrolase of family 22. Thus, the protein of the invention or part thereof is involved in immune and 
inflammatory responses and has antiviral, antibacterial, anti-inflammatory and/or anti-histaminic 
functions. Preferred polypeptides of the invention are polypeptides comprising the amino acids of 
SEQ ID NO: 196 from positions 19 to 100, orfi-om positions 1 to 100. Other preferred polypeptides 
of the invention are fragments of SEQ ID NO: 1 96 having any of the biological activities described 

30 herein. The glycolytic activity of the protein of the invention or part thereof may be assayed using 
any of the assays known to those skilled in the art including those described in Gold and Schweiger, 
M. Methods in Enzymology, Vol. XX, Part C pp. 537-542, Ed. Moldave, Academic Press,New 
York and London, 1971 and in the US patent 4,255,517, which disclosures are hereby incorporated 
by reference in their entireties. 

35 Lysozymes, which are ubiquitous proteins found in most body secretions, are defined as 

1 ,4-beta-N-acetylmuramidase which cleave the glycoside bond between the C-1 of N-acetyl- 
muramic acid and the C-4 of N-acetylglucosamine in the peptidoglycan of bacteria. They have 
various therapeutic properties, such as antiviral, antibacterial, anti-inflammatory and antihistaminic 
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effects. The activity of lysozymes as an anti-bacterial agent appears to be based on botii its direct 
bacteriolytic activity and also on stimulatory effects in connection with phagocytosis of 
polymorphonuclear leucocytes and macrophages (Biggar and Sturgess, J. M. Infect Immunol. 1 6: 
974-982 (1977); Thacore and Willet, Am. Rev. Resp. Dis. 93: 786-790 (1966); Klockars and 
5 Roberts, P. Acta Haematol 55: 289-292 (1976)), which disclosures are hereby incorporated by 
reference in their entireties. Lysozymes have proven to be not only a selective factor but also an 
effective factor against microorganisms of the mouth (lacono et al, J. J. Infect. Immunol. 29: 623- 
632 (1980)), which disclosure is hereby incorporated by reference in its entirety. Lysozymes can 
also kill pathogens by acting synergistically with other proteins such as complement or antibody to 

10 lyse pathogenic cells. Lysozymes, also inhibit chemotaxis of polymorphonuclear leukocytes and 
limit the production of oxygen free radicals following an infection. This limits the degree of 
inflammation, while at the same time enhances phagocytosis by these cells. Other postulated 
functions of lysozymes include immune stimulation (Jolles, P. Biomedicine 25: 275-276 (1976) 
Ossermann, E. F. Adv. Pathobiol 4; 98-102 (1976)) and immunological and non-immunological 

1 5 monitoring of host membranes for any neoplastic transformation (Jolles, P. Biomedicine 25:275- 
276 (1976); Ossermann, E. F. Adv. Pathobiol 4: 98-102 (1976)), which disclosures are hereby 
incorporated by reference in their entireties. Lysozymes may thus be used in a wide spectrum of 
applications (see US patent 5,618,712, which disclosure is hereby incorporated by reference in its 
entirety). Determination of the lysozymes from serum and/or urine is used to diagnose various 

20 diseases or as an indicator for their development. In acute lymphoblastic leukaemia the lysozyme ' 
serum level is significantly reduced, whereas in chronic myelotic leukaemia and in acute 
monoblastic and myelomonocytic leukaemia the lysozyme concentration in the serum is greatly 
increased. The therapeutically effective use of lysozyme is possible in the treatment of various 
bacterial and virus infections (Zona, Herpes zoster), in colitis, various types of pain, in allergies, 

25 inflammation and in pediatrics (the conversion of cows milk into a form suitable for infants by the 
addition of lysozyme). 

The invention relates to methods and compositions using the protein of the invention or part 
thereof to hydrolyze one or several substrates, alone or in combination with other substances, 
preferably antiviral, antifungal and/or antibacterial substances including but not limited to 

30 immunoglobulins, lactoferrin, betalysin, fibronectin, and complement components. Such substrates 
are glycosylated compounds, preferably containing beta-l-4-glycoside bonds, more preferably 
containing beta-l-4-glycoside bonds between n-acetylomuraminic acid and n-acetyloglucosamine. 
For example, the protein of the invention or part thereof is added to a sample containing the 
substrate(s) in conditions allowing hydrolysis, and allowed to catalyze the hydrolysis of the 

35 substrate(s). In a preferred embodiment, the hydrolysis is carried out using a standard assay such as 
those described by Gold and Schweiger, supra, and US patents 5,871,477 and 4,255,517, which 
disclosures are hereby incorporated by reference in their entireties. In a preferred embodiment, the 
protein of the invention or part thereof may be used to lyze recombinant bacteria in order to recover 
the recombinant DNA, the recombinant protein of interest, or both using, for example, any of the 
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assays described in Sambrook, et al., Molecular Cloning: A Laboratory Manual, Second Edition, 
Cold Spring Harbor Laboratory Press (1989), which disclosure is hereby incorporated by reference 
in its entirety. 

In an embodiment, the protein of the invention or part thereof is used to hydrolyze 
5 contaminating substrates, preferably exogenous substrates from bacterial, fungal or viral origins, in 
an aqueous sample or onto a material, preferably glassware and plasticware. In particular, the 
protein of the invention or part thereof may be used as a disinfectant in dental rinse, in protection of 
aqueous systems or in preparing material for medical applications using any of the methods and 
compositions described in US patents 5,069,717, 4,355,022 and 5,001,062, which disclosures are 

10 hereby incorporated by reference in their entireties. In a preferred embodiment, the protein of the 
invention is used as a host resistance factor in infants' formulas to convert cow's milk into a form 
more suitable for infants as described in US patent 6,020,015, which disclosure is hereby 
incorporated by reference in its entirety. In another preferred embodiment, the protein of the 
invention or part thereof may be used as a food preservative (see Hayashi et al., Agric. Biol. Chem. 

1 5 (European Edition of Japanese Journal of Agriculture, Biochemistry and Chemistry), Vol. 53, pp. 
3 1 73-3 1 77, 1 989), which disclosure is hereby incorporated by reference in its entirety. In addition, 
the protein of the invention or part thereof may be used to clarify xanthan gum fermented broth for 
applications in food and in cosmetic industries using the method described in US patent 5,994,107, 
which disclosure is hereby incorporated by reference in its entirety. In another preferred 

20 embodiment, compositions comprising the protein of the present invention or part thereof are added 
to samples or materials as a "cocktail" with other antimicrobial substances, preferably antibiotics or 
hydrolytic enzymes such as those described in US patents 5,458,876 and 5,041 ,326, which 
disclosures are hereby incorporated by reference in their entireties, to decontaminate the samples. 
For example, the protein of the invention or part thereof may be used in place or in combination 

25 with antibiotics in cell cultures. The advantage of using a cocktail of hydrolytic enzymes is that one 
is able to hydrolyze a wide range of substrates without knowing the specificity of any of the 
enzymes. Using a cocktail of hydrolytic enzymes also protects a sample or material from a wide 
range of future unknown contaminants from a vast number of sources. For example, the protein of 
the invention or part thereof is added to samples where contaminating substrates, preferably 

30 exogenous substrates from bacterial, fungal or viral origins, is undesirable in an amount sufficient to 
promote hydrolysis of said substrates. Alternatively, the protein of the invention or part thereof 
may be bound to a chromatographic support, either alone or in combination with other hydrolytic 
enzymes, using techniques well known in the art, to form an affinity chromatography column. A 
sample containing the undesirable substrate is run through the column to remove the substrate. 

35 Immobilizing the protein of the invention or part thereof on a support advantageous is particularly 
for those embodiments in which the method is to be practiced on a commercial scale. This 
immobilization facilitates the removal of the enzyme from the batch of product and subsequent 
reuse of the enzyme. Immobilization of the protein of the invention or part thereof can be 
accomplished, for example, by inserting a cellulose-binding domain in the protein. One of skill in 
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the art will understand that other methods of immobilization could also be used and are described in 
the available literature. Alternatively, the same methods may be used to identify new substrates. 

In addition, the protein of the invention or part thereof may be usefial to identify or quantify 
the amount of a given substrate in biological fluids, foods, water, air, solutions and the like. In a 
5 preferred embodiment, the protein of the invention or part thereof is used in assays and diagnostic 
kits for the identification and quantification of exogenous substrates in bodily fluids including 
blood, lymph, saliva or other tissue samples, in addition to bacterial, fungal, plant, yeast, viral or 
mammalian cell cultures. In a preferred embodiment, the protein of the invention or part thereof is 
used to detect, identify, and or quantify eubacteria using reagents and assays described in US patent 

10 5,935,804, which disclosure is hereby incorporated by reference in its entirety. Briefly, the protein 
of the invention of part thereof is catalyticaily inactived , i.e. capable of binding but not cleaving a 
peptidoglycan comprising NAc-muramic acid in the eubacteria, using any of the methods known to 
those skilled in the art including those which produce a mutant enzyme, a recombinant-enzyme, or a 
chemically inactivated enzyme. The catalyticaily inactive protein of the invention is then incubated 

1 5 with an aliquot of a biological sample under conditions suitable for binding of the inactive enzyme 
to the peptidoglycan substrate. Then, the bound enzyme is detected to assess the presence or 
amount of the eubacteria in the biological sample. 

In another embodiment, the nucleic acid of the invention or part thereof may be used to 
increase disease resistance of plants to bacterial, fungal and/or viral infections. A polynucleotide 

20 containing the nucleic acid of the invention or part thereof is introduced into the plant genome in 
conditions allowing correct expression of the transgenic protein using any methods known to those 
skilled in the art including those disclosed in US patents 5,349,122 and 5,850,025, which 
disclosures are hereby incorporated by reference in their entireties. 

In another preferred embodiment, the protein of the invention or part thereof may be useful 

25 to treat and/or prevent bacterial, fungal and viral infections in humans or in animals caused by 
various agents including but not limited to Streptococcus, Veillonella alcalescens, Actinomyces, 
Herpes simplex, Candida albicans. Micrococcus lysodeikticus and HIV by hydrolyzing the 
glycosylated compounds contained in such micro-organisms. In still a preferred embodiment, the 
protein of the invention or part thereof is used to prevent and/or treat bacterial, fungal and viral 

30 infections in immunocompromised individuals who lack fully functional immune systems, such as 
neonates or geriatric patients or HIV-infected individuals, or who suffer from a disease affecting the 
respiratory tract such as cystic fibrosis or the gastrointestinal tract such as ulcerative colitis or sprue. 

In still another embodiment, the protein of the invention or part thereof may be used as a 
growth factor for in vitro cell culture, preferably for T cells and T cell lines, using techniques and 

35 methods taught in US patent 5,468,635, which disclosure is hereby incorporated by reference in its 
entirety. 

In addition, the protein of the invention or part thereof may be used to identify inhibitors for 
mechanistic and clinical applications. Such inhibitors may then be used to identify or quantify the 
protein of the invention in a sample, and to diagnose, treat or prevent any of the disorders where the 
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protein's hydrolytic, immunostimulatory and/or inflammatory activities is/are undesirable and/or 
deleterious including but not limited to amyloidosis, colitis, lysosomal diseases, inflammatory and 
immune disorders including allergies and leukaemia. The protein of the invention may also be used 
to monitor host cell membranes for neoplastic transformation. 
5 It will be appreciated that all characteristics and uses of the polynucleotides of SEQ ID 

NOs:27 and 362 and polypeptides of SEQ ID NO: 196 and 479, described throughout the present 
application also pertain to the human cDNA of clone 482181, and the polypeptides encoded 
thereby. 

10 Angiogenin Protein of SEP ID NO: 176 dntemal designation 11 41 801 and related protein of SEP 
IDNO:461 

The polypeptides of SEQ ID NO: 176 and SEQ ID NO:461 encoded by the extended cDNA 
SEQ ID N0:7 and SEQ ID NO:344, respectively, are ribonucleases that belongs to the pancreatic 
ribonuclease family (see reviews from Beintema, (1998) Cell. Mol. Life Sci. 54:763-5; Beintema 

1 5 and Kleineidam, ( 1 998) Cell. Mol. Life Sci. 54:825-32, which disclosures are hereby incorporated 
by reference in their entireties). It will be appreciated that all characteristics and uses of the 
polynucleotides of SEQ ID N0:7 and SEQ ID NO:344 and polypeptides of SEQ ID NO: 1 76 and 
SEQ ID NO:461, described throughout the present application also pertain to the human cDNA of 
clone 1 14180, and the polypeptides encoded thereby. In addition, the protein of the invention plays 

20 a role in angiogenesis as an angiogenin variant protein (see review from Badet, ( 1 999) Pathol. Biol. 
74:345-5 1 , which disclosure is hereby incorporated by reference in its entirety). Preferred 
polypeptides of the invention are polypeptides comprising the amino acids of SEQ ID NO: 1 76 
from positions 19 to 75, from positions 26 to 75, or from positions 63 to 69. Other preferred 
polypeptides of the invention are fragments of SEQ ID NO: 176 having any of the biological 

25 activity described herein. The ribonuclease activity of the protein of the invention or part thereof 
may be assayed using any of the assays known to those skilled in the art including those described 
in US patent 5,866, 1 19, which disclosure is hereby incorporated by reference in its entirety. The 
angiogenic activity of the protein of the invention or part thereof may be assayed using any of the 
assays known to those skilled in the art including those described by Fett etal. (1985) Biochem. 24, 

30 5480-5486, which disclosure is hereby incorporated by reference in its entirety. 

Ribonucleases are proteins which catalyze the hydrolysis of phosphodiester bonds in RNA 
chains. Pancreatic ribonucleases are pyrimidic-specific ribonucleases present in high quantity in the 
pancreas of a number of mammalia taxa and of a few reptiles. In addition to their function in 
hydrolysis of RNA, ribonucleases have evolved to support a variety of other physiological 

35 activities. Such activities include anti-parasite, anti-bacterium, anti-virus, anti-neoplastic activities, 
neurotoxicity, and angiogenesis. For example, bovine seminal ribonuclease is anti-neoplastic 
(Laceetti, et al. (1992) Cancer Res. 52: 4582-4586, which disclosure is hereby incorporated by 
reference in its entirety). Some frog ribonucleases display both anti-viral and anti-neoplastic activity 
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(Youle, et al. (1994) Proc. Natl. Acad. Sci. USA 91: 6012-6016; Mikulski, et al. (1990) J. Natl. 
Cancer Inst. 82: 151-152; and Wu, et al. (1993) J. Biol. Chem. 268: 10686-10693), which 
disclosures are hereby incorporated by reference in their entireties. Eosinophil-derived neurotoxin 
(EDN) and eosinophil cationic protein (ECP) are related ribonucleases which possess neurotoxicity 
5 (Beintema, et al. ( 1 988) Biochemistry 27: 4530-4538; Ackerman, ( 1 993) In Makino, and Fukuda, 
Eosinophils: Biological and Clinical Aspects. CRC Press, Boca Raton, Fla., pp 33-74), which 
disclosures are hereby incorporated by reference in their entireties. In addition, ECP exhibits 
cytotoxic, anti-parasitic, and anti-bacterial activities. A EDN-related ribonuclease, named RNase 
k6, is shown to express in normal human monocytes and neutrophils, suggesting a role for this 

10 ribonuclease in host defense (Rosenberg, and Dyer, ( 1 996) Nuc. Acid. Res. 24: 3507-3 5 1 3), which 
disclosure is hereby incorporated by reference in its entirety. 

Angiogenin is a tRNA-specific ribonuclease which binds protein partners on the surface of 
endothelial cells for endocytosis. Potential partners of angiogenin include heparin, plasminogen, 
elastase, angiostatin, actin, and a 170 kDa receptor on the surface of endothelial cells [Strydom, 

15 (1998) Cell. Mol. Life Sci. 54, 81 1-824, which disclosure is hereby incorporated by reference in its 
entirety ]. Endocytosed angiogenin is translocated to the nucleus where it promotes endothelial 
invasiveness required for blood vessel formation (Moroianu, and Riordan, (1994) Proc. Natl. Acad. 
Sci. USA 91: 1217-1221, which disclosure is hereby incorporated by reference in its entirety ). 

Although originally isolated from medium conditioned by human colon cancer cells (Fett et 

20 al. (1985), supra), and subsequently shown to be produced by several other histological types of 
human tumors [Rybak, et al. (1987) Biochem. Biophys. Res, Commun. 146, 1240-1248; Olson, et 
al., (1995) Proc. Natl. Acad. Sci. U.S.A. 92, 442-446, which disclosures are hereby incorporated by 
reference in their entireties], angiogenin also is a constituent of human plasma and normally 
circulates at a concentration of 250-360 ng/ml [Shimoyama, et al. (1996) Cancer Res. 56, 2703- 

25 2706; Blaser, et al. (1993) Eur. J. Clin. Chem. Clin. Biochem. 31, 513-516, which disclosures are 
hereby incorporated by reference in their entireties]. It has also been shown that recurrent gastric 
cancer patients had a much higher serum concentration of angiogenin than primary gastric cancer 
patients [Shimoyama, and Kaminishi, (2000) J, Cancer Res, Clin. Oncol. 126, 468-474, which 
disclosure is hereby incorporated by reference in its entirety]. 

30 Angiogenin is a potent inducer of angiogenesis [Fett, et al. supra]. Angiogenesis is a 

complex process of blood vessel formation comprising of several separate but interconnected steps 
at the cellular and biochemical level including: (i) activation of endothelial cells by the action of an 
angiogenic stimulus, (ii) adhesion and invasion of activated endothelial cells into the surrounding 
tissues and migration toward the source of the angiogenic stimulus, and (iii) proliferation and 

35 differentiation of endothelial cells to form a new microvasculature [Folkman, and Shing, (1992) J. 
Biol. Chem. 267, 1 093 1 -10934; Moscatelli, and Rifkin, (1988) Biochim. Biophys. Acta 948, 67-85, 
which disclosures are hereby incorporated by reference in their entireties]. While angiogenesis is a 
tightly-controlled process under usual physiological conditions, abnormal angiogenesis can have 
devastating consequences in pathological conditions such as arthritis, diabetic retinopathy and 



163 



[fije>'/^:/yv'GG208389e.s^] 



WO 02/083898 PCT/IBOl/00914 
tumor growth. It is now well-established that the growth of virtually all solid tumors is angiogenesis 
dependent [Folkman, (1989) J. Natl. Cancer Inst. 82, 4-6, which disclosure is hereby incorporated 
by reference in its entirety]. Angiogenesis is also a prerequisite for the development of metastasis, 
since it provides the means whereby tumor cells disseminate from the original primary tumor and 
5 establish at distant sites [Mahadevan, and Hart, (1990) Rev. Oncol. 3, 97-103; Blood, and Zetter 
(1990) Biochim. Biophys. Acta 1032, 89-1 1 8, which disclosures are hereby incorporated by 
reference in their entireties]. Therefore, interference with the process of tumor-induced 
angiogenesis can be an effective therapy for both primary and metastatic cancers. Indeed, several 
anti-angiogenic agents have been produced and are currently in the clinical trial stage. 

1 0 The invention relates to methods and compositions using the protein of the invention or part 

thereof to hydrolyze one or several substrates, preferably nucleic acids, more preferably RNA, alone 
or in combination with other substances. For example, the protein of the invention or part thereof is 
added to a sample containing the substrate(s) in conditions allowing hydrolysis, and allowed to 
catalyze the hydrolysis of the substrate(s). Hydrolysis conditions as described in the US patent 

1 5 5,866,1 1 9 may be used, which disclosure is hereby incorporated by reference in its entirety. 

In a preferred embodiment, the protein of the invention or part thereof may be used to 
remove contaminating RNA in a biological sample, alone or in combination with other nucleases. 
In a more preferred embodiment, the protein of the invention or part thereof may be used to purify 
DNA preparations from contaminating RNA, to remove RNA templates prior to second strand 

20 synthesis and prior to analysis of in vitro translation products. Compositions comprising the protein 
of the present invention or part thereof are added to biological samples as a "cocktail" with other 
nucleases. The advantage of using a cocktail of hydrolytic enzymes is that one is able to hydrolyze 
a wide range of substrates without knowing the specificity of any of the enzymes. Such cocktails of 
nucleases are commonly used in molecular biology assays, for example to remove unbound RNA in 

25 RNAse protection assays. Using a cocktail of hydrolytic enzymes also protects a sample from a 
wide range of future unknown RNA contaminants from a vast number of sources. For example, the 
protein of the invention or part thereof is added to samples w here contaminating substrates is 
undesirable. Alternatively, the protein of the invention or part thereof may be bound to a 
chromatographic support, either alone or in combination with other hydrolytic enzymes, using 

30 techniques well known in the art, to form an affinity chromatography column. A sample containing 
the undesirable substrate is run through the column to remove the substrate. Immobilizing the 
protein of the invention or part thereof on a support is particularly advantageous for those 
embodiments in which the method is to be practiced on a commercial scale. This immobilization 
facilitates the removal of the enzyme from the batch of product and subsequent reuse of the enzyme. 

35 Immobilization of the protein of the invention or part thereof can be accomplished, for example, by 
inserting a cellulose-binding domain in the protein. One of skill in the art v^ll understand that other 
methods of immobilization could also be used and are described in the available literature. 
Alternatively, the same methods may be used to identify new substrates. 
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In another embodiment, the protein of the invention or part thereof may be used to 
decontaminate or disinfect samples infected by undesirable parasite, bacteria and/or viruses using 
any of the methods known to those skilled in the art including those described in Youle et al, 
(1994), supra; Mikulski et al (1990) supra, Wu et al (1993) supra. 
S In another embodiment, the present invention relates to compositions and methods using the 

protein of the invention or part thereof to selectively kill cells. The protein of the invention or part 
thereof is linked to a recognition moiety capable of binding to a chosen cell, such as lectins, 
receptors or antibodies thus generating cytotoxic reagents using methods and techniques described 
in US patent 5,955,073, which disclosure is hereby incorporated by reference in its entirety. 
10 In still another embodiment, the invention relates to compositions and methods using the 

protein of the invention or part thereof to stimulate cell proliferation both in vitro and in vivo, 
especially endothelial cell growth. For example, soluble forms of the protein of the invention or 
part thereof may be added to cell culture medium in an amount efTective to stimulate cell 
proliferation. 

15 In still another embodiment, the protein of the invention or part thereof may be used in the 

diagnosis, prevention and/or treatment of disorders associated with excessive angiogenesis such as 
tumor growth, arthritis or diabetic retinopathy. 

In a preferred embodiment, the protein of the invention may be used as a diagnostic marker 
to evaluate the risk of a given individual to develop a tumor, to evaluate the risk of recurrence of 

20 tumors or to evaluate the degree of cancer aggressiveness based on the facts that the level of 

circulating angiogenin is lower in normal individuals than in patients bearing tumors, and is lower 
in patients with primary cancers compared to patients with reoccurent tumors, as stated above. 
Thus, quantitative immunoassays can be used for the detection of abnormal levels of either the 
protein of SEQ ID NO: 176 or the mRNA encoding such protein as the polynucleotide of SEQ ID 

25 No:7, thereby identifying those individuals at risk for the development of tumors or the recurrence 
of tumors. Detection of abnormal levels of the protein of the invention may be performed using any 
techniques known to those skilled in the art including those described elsewhere in the application. 
For example, antibodies binding specifically to the protein of the invention, or fragments thereof, 
may be used in routine immunoassays to screen for the presence or absence of the protein of the 

30 invention, or fragments thereof Alternatively, the nucleic acids which encode the protein of the 
invention, or fragments thereof, may be used in hybridization assays to detect and/or quantity the 
expression of said pi-otein. 

Another aspect of the invention provides for molecules which inhibit, or reduce, the 
biological activity or expression of SEQ ID NO: 276. Such molecules may be be administered to 

35 patients to prevent vascularization, especially tumor vascularization, thereby limiting tumor growth. 
Such antagonists and/or inhibitors may be antibodies specific for the protein of the invention that 
can be used directly as an antagonist, or indirectly as a targeting or delivery mechanism for bringing 
a pharmaceutical agent to cells or tissue which express the protein of the invention. Neutralizing 
antibodies, (i.e., those which inhibit protein-protein interactions) are especially preferred for 
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therapeutic use. Alternatively, such molecules may be mutated forms of the protein of the invention 
or truncated forms which will be able to bind to the partners of the protein of the invention and 
compete with if for partners but without eliciting any of its biological activities. Other methods to 
inhibit the expression of the protein of the invention include antisense and triple helix strategies as 
5 described herein. Other antagonists or inhibitors of the protein of the invention may be produced 
using methods which are generally known in the art, including the screening of libraries of 
pharmaceutical agents to identify those which specifically bind the protein of the invention. The 
protein of the invention, or part thereof, preferably its functional or immunogenic fragments, or 
oligopeptides related thereto, can be used for screening libraries of compounds in any of a variety of 
1 0 drug screening techniques including those described herein. 

Protease Inhibitor Protein of SEP ID NO: 181 (internal designation 1000771934) and related 
protein of SEP ID NO:466 

The protein of SEQ ID N0s:181 and 466 encoded by the extended cDNA SEQ ID NO: 12 

1 5 and 349, respectively, is a protease inhibitor belonging to the WAP-type disulfide core family. It 
will be appreciated that all characteristics and uses of the polynucleotides of SEQ ID NOs: 1 2 and 
349 and polypeptides of SEQ ID NOs: 181 and 466, described throughout the present application 
also pertain to the human cDNA of clone 1000771934, and the polypeptides encoded thereby. 
Preferred polypeptides of the invention are polypeptides comprising the amino acid fragments of 

20 SEQ ID NO: 1 8 1 from positions 32 to 73 , 49 to 62, 76 to 1 22, 97 to 1 1 0 or any combination thereof. 
Other preferred polypeptides of the invention are fragments of SEQ ID NO: 1 8 1 having any of the 
biological activity described herein. The protease inhibitor activity of the protein of the invention 
or part thereof may be assessed using any techniques known to those skilled in the art. Possible 
substrates for the protein of the invention include are not limited to trypsin, chymotrypsin, 

25 leukoproteinase, elastase, subtilisin, type IV collagenase and other serine proteases. 

Proteases, which cleave proteins, are largely used in industry including in food processing, 
brewing, and alcohol production. Proteases are important components of laundry detergents and 
other products. Within biological research, proteases are used in purification processes to degrade 
unwanted proteins. It is often desirable to employ proteases of low specificity or mixtures of more 

30 specific proteases to obtain the necessary degree of degradation. 

Proteases are also key components of a broad range of biological pathways, including blood 
coagulation and digestion. For example, the absence or insufficiency of a protease can result in a 
pathological condition that can be treated by replacement or augmentation therapy. Such therapies 
include the treatment of hemophilia with clotting factors VIII, IX, and Vila. In another application, 

35 the proteolytic enzyme tissue plasminogen activator (t-PA) is used to activate the body's clot lysing 
mechanism, thereby reducing morbitity resulting from myocardial infarction. The protease thrombin 
is used to initiate the clotting of fibrinogen-based tissue adhesives during surgery. Neutrophils 
produce several antibacterial serine proteases (Gabay, Ciba Found. Symp. 186:237-247, 1994; 
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Scocchi et al., Eur. J. Biochem. 209:589-595, 1992, which disclosures are hereby incorporated by 
reference in their entireties). Proteases also regulate cellular processes through receptor-mediated 
pathways by proteolytic activation of the cognate receptor (Vu et al.. Cell 64: 1057-1068, 1991 ; 
Blackhart et al., J. Biol. Chem. 271:16466-16471, 1996, which disclosures are hereby incorporated 
5 by reference in their entireties). 

Overproduction or lack of regulation of proteases can also have pathological consequences. 
Elastase, released within the lung in response to the presence of foreign particles, can damage lung 
tissue if its activity is not tightly regulated. Emphysema in smokers is believed to arise from an 
imbalance between elastase and its inhibitor, alpha- 1 -antitrypsin. This balance may be restored by 
1 0 administration of exogenous alpha-l-antitrypsin. 

In addition, protease inhibitors have been shown to inhibit the growth of microorganisms 
including human pathogenic bacteria such as strains of group A streptococci, including antibiotic- 
resistant strains (Merigan, T. et al ( 1 996) Ann Intern Med 1 24: 1 039-1 050; Stoka, V. ( 1 995) FEES. 
Lett 370:101-104; Vonderfecht, S. et al (1988) J Clin Invest 82:201 1-2016; Collins, A. et al (1991) 
1 5 Antimicrob Agents Chemother 35:2444-2446, which disclosures are hereby incorporated by 
reference in their entireties). 

In view of the growing use of proteases in industry, research, and medicine, there is an 
ongoing need in the art for new enzymes and new enzyme inhibitors. The present invention 
addresses these needs. 

20 The invention relates to compositions and methods using the protein of the invention or part 

thereof to inhibit proteases, both in vitro or in vivo. Since proteases play an important role in the 
regulation of many biological processes in virtually all living organisms as well as a major role in 
diseases, inhibitors of proteases are useful in a wide variety of applications. 

In one embodiment, the protein of the invention or part thereof may be useful to quantify 

25 the amount of a given protease in a biological sample, and thus used in assays and diagnostic kits 
for the quantification of proteases in bodily fluids or other tissue samples, in addition to bacterial, 
fungal, plant, yeast, viral or mammalian cell cultures. In a preferred embodiment, the sample is 
assayed using a standard protease substrate. A known concentration of protease inhibitor is added, 
and allowed to bind to a particular protease present. The protease assay is then rerun, and the loss 

30 of activity is correlated to the protease inhibitor activity using techniques well known to those 
skilled in the art. 

In addition, the protein of the invention or part thereof may be used to remove, identify or 
inhibit contaminating proteases in a sample. Compositions comprising the polypeptides of the 
present invention may be added to biological samples as a "cocktail" with other protease inhibitors 
35 to prevent degradation of protein samples. The advantage of using a cocktail of protease inhibitors 
is that one is able to inhibit a wide range of proteases without knowing the specificity of any of the 
proteases. Using a cocktail of protease inhibitors also protects a protein sample from a wide range 
of future unknown proteases which may contaminate a protein sample from a vast number of 
sources. For example, the protein of the invention or part thereof are added to samples where 
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proteolytic degradation by contaminating proteases is undesirable. Such protease inhibitor cocktails 
(see for example the ready to use cocktails sold by Sigma) are widely used in research laboratory 
assays to inhibit proteases susceptible of degrading a protein of interest for which the assay is to be 
performed. Alternatively, the protein of the invention or part thereof may be bound to a 
5 chromatographic support, either alone or in combination with other protease inhibitor, using 

techniques well known in the art, to form an affmity chromatography column. A sample containing 
the undesirable protease is run through the column to remove the protease. Alternatively, the same 
methods may be used to identify new proteases. 

In a preferred embodiment, the protein of the invention or part thereof may be used to 

1 0 inhibit proteases implicated in a number of diseases where cellular proteolysis occur such as 
diseases characterized by tissue degradation including but not limited to arthritis, muscular 
dystrophy, inflammation, tumor invasion, glomerulonephritis, parasite-borne infections, 
Alzheimer's disease, periodontal disease, and cancer metastasis. 

In another preferred embodiment, the protein of the invention or part thereof may be useful 

15 to inhibit exogenous proteases, both in vivo and in vitro, implicated in a number of infectious 
diseases including but not limited to gingivitis, malaria, leishmaniasis, filariasis, osteoporosis and 
osteoarthritis, and other bacterial, and parasite-borne or viral infections. In particular, the protein of 
the invention or part thereof may offer applications in viral diseases where the proteolysis of 
primary polypeptide precursors is essential to the replication of the virus, as for HIV and HCV. 

20 In another embodiment, the protease inhibitors of the present invention may be used as 

antibacterial agents to retard or inhibit the growth of certain bacteria either in vitro or in vivo. 
Particularly, the polypeptides of the present invention may be used to inhibit the growth of group A 
streptococci on non-living matter such as surgical instruments, laboratory glassware and 
plasticware, and in culture of living plant, fiingi, and animal cells. 

25 Furthermore, the protease inhibitors of the present invention find use in drug potentiation 

applications. For example, therapeutic agents such as antibiotics or antitumor drugs can be 
inactivated through proteolysis by endogenous proteases, thus rendering the administered drug less 
effective or inactive. Accordingly, the protease inhibitors of the invention may be administered to a 
patient in conjunction with a therapeutic agent in order to potentiate or increase the activity of the 

30 drug. This co-administration may be by simultaneous administration, such as a mixture of the 
protease inhibitor and the drug, or by separate simultaneous or sequential administration. 

Serpin Protein of SEQ ID NO: 1 79 (internal designation 784093) and related protein of SEP ID 
NO:464 

35 The protein of SEQ ID NOs: 1 79 and 464 and encoded by the extended cDNA SEQ ID 

NOs: 1 0 and 347, respectively, is a serine protease inhibitor. It will be appreciated that all 
characteristics and uses of the polynucleotides of SEQ ID NOs: 10 and 347 and polypeptides of SEQ 
ID NOs: 1 79 and 464, described throughout the present application also pertain to the human cDNA 
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of clone 784093, and the polypeptides encoded thereby. Preferred polypeptides of the invention are 
polypeptides comprising the amino acid fragments of SEQ ID NO: 1 79 from positions 47 to 1 39. 
Other preferred polypeptides of the invention are fragments of SEQ ID NO: 1 79 having any of the 
biological activity described herein. The protease inhibitor activity of the protein of the invention 
S or part thereof may be assessed using any techniques known to those skilled in the art including 
those disclosed in the US patent 5,955,284. Possible substrates for the protein of the invention 
include are not limited to serine proteases such as elastase, trypsin, chymotrypsin, thrombin III, 
plasmin, heparin, complement II, plasminogen activator, protein C, interleukin-lB coverting 
enzyme, preferably trypsin, elastase, and chymotrypsin. 

10 Proteases are key components of a broad range of biological pathways, including blood 

coagulation and digestion. For example, the absence or insufficiency of a protease can result in a 
pathological condition that can be treated by replacement or augmentation therapy. Such therapies 
include the treatment of hemophilia with clotting factors VIll, IX, and Vila. In another application, 
the proteolytic enzyme tissue plasminogen activator (t-PA) is used to activate the body's clot lysing 

15 mechanism, thereby reducing morbitit> resulting from myocardial infarction. The protease thrombin 
is used to initiate the clotting of fibrinogen-based tissue adhesives during surgery. Neutrophils 
produce several antibacterial serine proteases (Gabay, Ciba Found. Symp. 1 86:237-247, 1 994; 
Scocchi et al., Eur. J. Biochem. 209:589-595, 1992, which disclosures are hereby incorporated by 
reference in their entireties). Proteases also regulate cellular processes through receptor-mediated 

20 pathways by proteolytic activation of the cognate receptor (Vu et al., Cell 64: 1057-1068, 1991; 
Blackhart et al., J. Biol. Chem. 271:16466-16471, 1996, which disclosures are hereby incorporated 
by reference in their entireties). 

Overproduction or lack of regulation of proteases can also have pathological consequences. 
Elastase, released within the lung in response to the presence of foreign particles, can damage lung 

25 tissue if its activity is not tightly regulated. Emphysema in smokers is believed to arise from an 
imbalance between elastase and its inhibitor, alpha- 1 -antitrypsin. This balance may be restored by 
administration of exogenous alpha- 1 -antitrypsin. 

The serine proteases (SP) are a large family of proteolytic enzymes that include the 
digestive enzymes, trypsin and chymotrypsin, components of the complement cascade and of the 

30 blood-clotting cascade, and enzymes that control the degradation and turnover of macromolecules 
of the extracellular matrix. SP are so named because of the presence of a serine residue in the active 
catalj^ic site for protein cleavage. They are characterized by a catalytic triad of serine, histidine, and 
aspartic acid residues. SP have a wide range of substrate specificities and can be subdivided into 
subfamilies on the basis of these specificities. The main sub-families are trypases (cleavage after 

35 arginine or lysine), aspases (cleavage after aspartate), chymases (cleavage after phenylalanine or 
leucine), metases (cleavage after methionine), and serases (cleavage after serine). 

Serine proteases are used for a variety of industrial purposes. For example, the serine 
protease subtilisin is used in laundry detergents to aid in the removal of proteinaceous stains (e.g., 
Crabb, ACS Symposium Series 460:82-94, 1991, which disclosure is hereby incorporated by 
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reference in its entirety). In the food processing industry, serine proteases are used to produce 
protein-rich concentrates from fish and livestock, and in the preparation of dairy products (Kida et 
al., Journal of Fermentation and Bioengineering 80:478-484, 1995; Haard and Simpson, in Martin, 
A. M., ed., Fisheries Processing: Biotechnological Applications, Chapman and Hall, London, 1994, 
5 132-1 54; Bos et al., European Patent Office Publication 494 149 Al, which disclosures are hereby 
incorporated by reference in their entireties). 

Serpins are irreversible serine protease inhibitors which are principally located 
extracellularly. Proteins which have been assigned to the serpin family include the following: 
.alpha.-] protease inhibitor,.alpha.-l-antichymotrypsin, antithrombin III, .alpha.-2-antiplasmin, 

10 heparin cofactor II, complement CI inhibitor, plasminogen activator inhibitors 1 and 2, giia derived 
nexin, protein C inhibitor, rat hepatocyte inhibitors, crmA (a viral serpin which inhibits interleukin 
I -.beta, cleavage enzyme), human squamous cell carcinoma antigen which may modulate the host 
immune response against tumor cells, human maspin which seems to function as a tumor 
suppressor, lepidopteran protease inhibitor, leukocyte eiastase inhibitor (the only known 

15 intracellular serpin), and products from three orthopoxviruses (these products may be involved in 
the regulation of the blood clotting cascade and/or of the complement cascade in the mammalian 
host). 

In view of the growing use of proteases in industry, research, and medicine, there is an 
ongoing need in the art for new enzymes and new enzyme inhibitors. The present invention 

20 addresses these needs. 

In one embodiment, the protein of the invention or part thereof may be useful to quantify 
the amount of a given protease in a biological sample, and thus used in assays and diagnostic kits 
for the quantification of proteases in bodily fluids or other tissue samples, in addition to bacterial, 
fungal, plant, yeast, viral or mammalian cell cultures. In a preferred embodiment, the sample is 

25 assayed using a standard protease substrate. A known concentration of protease inhibitor is added, 
and allowed to bind to a particular protease present. The protease assay is then rerun, and the loss 
of activity is correlated to the protease inhibitor activity using techniques well known to those 
skilled in the art. Preferred proteases in this embodiment are serine protease, more preferably 
eiastase, trypsin and chymotrypsin. 

30 In addition, the protein of the invention or part thereof may be used to remove, identify or 

inhibit contaminating proteases in a sample. Compositions comprising the polypeptides of the 
present invention may be added to biological samples as a "cocktail" with other protease inhibitors 
to prevent degradation of protein samples. The advantage of using a cocktail of protease inhibitors 
is that one is able to inhibit a wide range of proteases without knowing the specificity of any of the 

35 proteases. Using a cocktail of protease inhibitors also protects a protein sample from a wide range 
of future unknown proteases which may contaminate a protein sample from a vast number of 
sources. For example, the protein of the invention or part thereof are added to samples where 
proteolytic degradation by contaminating proteases is undesirable. Such protease inhibitor cocktails 
(see for example the ready to use cocktails sold by Sigma) are widely used in research laboratory 



170 



[fije>'/^:/yv'GG208389e.s^] 



WO 02/083898 PCT/IBOl/00914 
assays to inhibit proteases susceptible of degrading a protein of interest for which the assay is to be 
performed. Alternatively, the protein of the invention or part thereof may be bound to a 
chromatographic support, either alone or in combination with other protease inhibitor, using 
techniques well known in the art, to form an affinity chromatography column. A sample containing 
5 the undesirable protease is run through the column to remove the protease. Alternatively, the same 
methods may be used to identify new proteases. 

In a preferred embodiment, the protein of the invention or part thereof may be used to 
inhibit proteases implicated in a number of diseases where cellular proteolysis occur such as 
diseases characterized by tissue degradation including but not limited to arthritis, muscular 

10 dystrophy, inflammation, tumor invasion, glomerulonephritis, parasite-borne infections, 

Alzheimer's disease, periodontal disease, and cancer metastasis. In a more preferred embodiment, 
the invention relates to compositions and methods to use the protein of the invention or part thereof 
in diseases characterized by an abnormally elevated levels of trypsin, chymotrypsin or elastase, 
including but not limited to chronic emphysema of the lungs, cirrhosis, liver failure, cystic fibrosis, 

1 5 alphal -antitrypsin deficiency associated disorders such as aneurysm or toxic shock. For prevention 
and/or treatment purposes, the protein of the invention may be used using any of the gene therapy 
methods described herein or known to those skilled in the art. 

In another preferred embodiment, the protein of the invention or part thereof may be useful 
to inhibit exogenous proteases, both in vivo and in vitro, implicated in a number of infectious 

20 diseases including but not limited to gingivitis, malaria, leishmaniasis, filariasis, osteoporosis and 
osteoarthritis, and other bacterial, and parasite-borne or viral infections. In particular, the protein of 
the invention or part thereof may offer applications in viral diseases where the proteolysis of 
primary polypeptide precursors is essential to the replication of the virus, as for HTV and HCV. 
In another embodiment, the protease inhibitors of the present invention may be used as 

25 antibacterial agents to retard or inhibit the growth of certain bacteria either in vitro or in vivo. 

Particularly, an amount of the polypeptides of the present invention effective to inhibit proliferation 
may be used to inhibit the growth of group A streptococci on non-living matter such as surgical 
instruments, laboratory glassware and plasticware, and in culture of living plant, fungi, and animal 
cells. 

30 Furthermore, the protease inhibitors of the present invention find use in drug potentiation 

applications. For example, therapeutic agents such as antibiotics or antitumor drugs can be 
inactivated through proteolysis by endogenous proteases, thus rendering the administered drug less 
effective or inactive. Accordingly, the protease inhibitors of the invention may be administered to a 
patient in conjunction with a therapeutic agent in order to potentiate or increase the activity of the 

35 drug. This co-administration may be by simultaneous administration, such as a mixture of the 
protease inhibitor and the drug, or by separate simultaneous or sequential administration. 
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ATPKf Protein sequence of SEP ID No.253 (internal designation 1000867870) 



The protein of SEQ ID NO:253 encoded by the extended cDNA SEQ ID NO:84 and relate 
polynucleotides of SEQ ID NO:404 is a variant of the human mitochondrial ATP synthase f subunit 
or ATPK (E.C. 3.6.1 .34) and, as such, plays a role in cellular respiration. Preferred polypeptides of 
5 the invention are are polypeptides comprising the amino acids of SEQ ID NO: 253 from positions 5 
to 88. Other preferred polypeptides of the invention are fragments of SEQ ID NO: 253 having any 
of the biological activity described herein. It will be appreciated that all characteristics and uses of 
the polynucleotides of SEQ ID NOs:84 and 404 and polypeptides of SEQ ID NO: 253 described 
throughout the present application also pertain to the human cDNA of clone 1000867870, and the 

1 0 polypeptides encoded thereby. 

The mitochondrial electron transport (or respiratory) chain is a series of enzyme complexes 
in the mitochondrial membrane that is responsible for the transport of electrons from NADH to 
oxygen and the coupling of this oxidation to the synthesis of ATP (oxidative phosphorylation). ATP 
then provides the primary source of energy for driving a cell's many energy-requiring reactions. 

15 ATP synthase (FO Fl ATPase) is the enzyme complex at the terminus of this chain and serves as a 
reversible coupling device that interconverts the energies of an electrochemical proton gradient 
across the mitochondrial membrane into either the synthesis or hydrolysis of ATP. This gradient is 
produced by other enzymes of the respiratory chain in the course of electron transport from NADH 
to oxygen. When the cell's energy demands are high, electron transport from NADH to oxygen 

20 generates an electrochemical gradient across the mitochondrial membrane. Proton translocation 
from the outer to the inner side of the membrane drives the synthesis of ATP. Under conditions of 
low energy requirements and when there is an excess of ATP present, this electrochemical gradient 
is reversed and ATP synthase hydrolyzes ATP. The energy of hydrolysis is used to pump protons 
out of the mitochondrial matrix. ATP synthase is, therefore, a dual complex, the FO portion of 

25 which is a transmembrane proton carrier or pump, and the Fl portion of which is catalytic and 
synthesizes or hydrolyzes ATP. Mammalian ATP synthase complex consists of sixteen different 
polypeptides (Walker, J. E. and Collinson, T. R. (1994) FEES Lett.346: 39-43, which disclosure is 
hereby incorporated by reference in its entirety). Six of these polypeptides (subunits alpha, beta, 
gamma, delta, epsilon, and an ATPase inhibitor protein IF 1) comprise the globular catalytic F 1 

30 ATPase portion of the complex, which lies outside of the mitochondrial membrane. The remaining 
ten polypeptides (subunits a, b, c, d, e, f, g, F6, OSCP, and A6L) comprise the proton-translocating, 
membrane spanning FO portion of the complex. Like other members of the respiratory chain, all but 
two of the polypeptide subunits of ATP synthase are nuclear gene products that are imported into 
the mitochondria. Enzyme complexes similar to mammalian ATP synthase are found in all cell 

35 types and in chloroplast and bacterial membranes. This universality indicates the central importance 
of this enzyme to ATP metabolism. Transcriptional regulation of these nuclear encoded genes 
appears to be the predominant means for controlling the biogenesis of ATP synthase. Multiple 
mitochondrial pathologies exist because of the essential role of mitochondrial oxidative 
phosphorylation in cellular energy production, in the generation of reactive oxygen species and in 
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the initation of apoptosis (Wallace, Science, 283:1482-1488, 1999, which disclosure is hereby 
incorporated by reference in its entirety). It is now clear that mitochondrial diseases encompass an 
assemblage of clinical problems commonly involving tissues that have high energy requirements 
such as heart, muscle and the renal and endocrine systems. Over the past 1 1 years, a considerable 
5 body of evidence has accumulated implicating defects in the mitochondrial energy-generating 
pathway, oxidative phosphorylation, in a wide variety of degenerative diseases including myopathy 
and cardiomyopathy. Most classes of pathogenic mitochondrial DNA mutations affect the heart, in 
association with a variety of other clinical manifestations that can include skeletal muscle, the 
central nervous system (including eye), the endocrine system, and the renal system. Nuclear 

10 mutations causing mitochondrial disorders have been described. They are often found in highly 
conserved subunits. Mitochondrial disorders with nuclear mutations include : myopathies (PEO, 
MNGIE, congenital muscular dystrophy, carnitine disorders), encephalopathies (Leigh, Infantile, 
Wilson's disease, Deafness-Dystonia syndrome), other systemic disorders and cardiomyopathies. 
The discovery of a new ATP synthase subunit, and polynucleotides encoding it satisfy a 

1 5 need in the art by providing new compositions which are useful for the diagnosis, prevention, and 
treatment of cancer, myopathies, immune disorders, and neurological disorders. 

An object of the present invention relates to compositions and methods of targeting 
heterologous compounds, either polypeptides or polynucleotides to mitochondria by recombinantly 
or chemically fusing a fragment of the protein of the invention to an heterologous polypeptide or 

20 polynucleotide. Preferred fragments are signal peptide, amphiphilic alpha helices and/or any other 
fragments of the protein of the invention, or part thereof, that may contain targeting signals for 
mitochondria including but not limited to matrix targeting signals as defined in Herrman and 
Neupert, Curr. Opinion Microbiol. 3:210-4 (2000); Bhagwat et al. J. Biol. Chem. 274:24014-22 
(1999), Murphy Trends Biotechnol. 15:326-30 (1997); Glaser et al. Plant Mol Biol 38:31 1-38 

25 (1998); Ciminale et al. Oncogene 1 8:4505-14 (1 999), which disclosures are hereby incorporated by 
reference in their entireties. Such heterologous compounds may be used to modulate 
mitochondria's activities. For example, they may be used to induce and/or prevent mitochondrial- 
induced apoptosis or necrosis. In addition, heterologous polynucleotides may be used for 
mitochondrial gene therapy to replace a defective mitochondrial gene and/or to inhibit the 

30 deleterious expression of a mitochondrial gene. 

The invention further relates to methods and compositions using the protein of the invention 
or part thereof to diagnose, prevent and/or treat several disorders in which mitochondrial respiratory 
electron transport chain is impaired, including but not limited to mitochondriocytopathies, necrosis, 
aging, myopathies, cancer and neurodegenerative diseases such as Alzheimer's disease, 

35 Huntington's disease, Parkinson's disease, epilepsy, Down's syndrome, dementia, multiple sclerosis, 
and amyotrophic lateral sclerosis. For diagnostic purposes, the expression of the protein of the 
invention could be investigated using any of the Northern blotting, RT-PCR or immunoblotting 
methods described herein and compared to the expression in control individuals. For prevention 
and/or treatment purposes, the protein of the invention may be used to enhance electron transport 
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and increase energy delivery using any of the gene therapy methods described herein or icnown to 
those skilled in the art. 

In another embodiment, the invention further relates to methods and compositions using the 
protein of the invention or part thereof to diagnose, prevent and/or treat several disorders in which 
5 mitochondrial respiratory electron transport chain needs to be impaired, including but not limited to 
Sjogren's syndrome, Addison's disease, bronchitis, dermatomyositis, polymyositis, 
glomerulonephritis, diabetes mellitus, emphysema. Graves' disease, atrophic gastritis, lupus 
erythematosus, myasthenia gravis, multiple sclerosis, autoimmune thyroiditis, ulcerative colitis, 
anemia, pancreatitis, scleroderma, rheumatoid and osteoarthritis, asthma, allergic iliinitis, atopic 
1 0 dermatitis, dermatomyositis, polymyositis, and gout, using any techniques known to those skilled in 
the art including the antisense or triple helices strategies described herein. 

Moreover, antibodies to the protein of the invention or part thereof may be used for 
detection of mitochondria organelles and/or mitochondrial membranes using any techniques known 
to those skilled in the art. 

15 

Olieomerization Protei n sequence of SEP ID No. 310 (internal designation 0150568^ 

The protein of SEQ ID NO: 3 10 encoded by the cDNA of SEQ ID NO: 141 and 435, is 
able to form homo-oligomers. Preferred polypeptides of the invention are polypeptides comprising 
the amino acids of SEQ ID NO:3 1 0 from positions 1 to 1 09. Other preferred polypeptides of the 

20 invention are fragments of SEQ ID NO: 3 1 0 having any of the biological activities described herein. 

Multivalency is a prerequisite for a variety of macromolecular interactions such as binding 
of antibodies or lectins to specific targets, ligand recognition, activation or inhibition of receptors 
and cell adhesion. Dimerization and oligomerization of proteins are thus general biological control 
mechanisms that contribute to the activation of cell membrane receptors, transcription factors, 

25 vesicle fusion proteins, and other classes of intra- and extracellular proteins. 

Multimerization domains have been shown to be useful tools in several areas of 
biotechnology, especially in protein engineering. For example, Tso et a! have used leucine zippers 
for producing bispecific antibody heterodimers (US patent 5,932,448) / Methods of preparing 
soluble oiigomeric proteins using leucine zippers have been described by Conrad et al (US patent 

30 5,965,712), Ciardelli et al (US patent 5,837,816), Spriggs et al (WO9410308) / Leucine zipper 
forming sequences have been used by Pelletier et al in protein fragment complementation assays to 
detect biomolecular interactions (WO9834120), which disclosures are hereby incorporated by 
reference in their entireties. Because of their usefulness in biotechnology, it is thus highly 
interesting to isolate new multimerization domains. 

35 The multimerization activity of the protein of the invention or part thereof may be assayed 

using any of the assays known to those skilled in the art including circular dichroism spectrum, gel 
filtration chromatography and thermal melting analyses. 
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In one embodiment, the invention relates to compositions and methods of using the protein 
of the invention or part thereof for preparing soluble multimeric proteins, which consist in 
multimers of fusion proteins containing a multimerization domain fused to a protein of interest, 
using any technique known to those skilled in the art including those described in international 
5 patent WO9410308, which disclosure is hereby incorporated by reference in its entirety. 

In another embodiment, the protein of the invention or part thereof or derivative thereof is 
used for detection and determination of an analyte in a biological liquid using the teachings of US 
patent 5,643,73 1 , which disclosure is hereby incorporated by reference in its entirety. Briefly, a 
first multimerization domain is immobilized on a solid support and the second multimerization 

10 domain is coupled to a specific binding partner for an analyte in a biological fluid. The two 

peptides are then brought into contact thereby immobilizing the binding partner on the solid phase. 
The biological sample is then contacted with the immobilized binding partner and the amount of 
analyte in the sample bound to the binding partner determined. 

In still another embodiment, the protein of the invention or part thereof may be used to 

1 5 construct multimerization devices comprising hybrid molecules with a functional domain fused to a 
multimerization domain in order to yield multimeric complexes with improved pharmacokinetic and 
pharmacological properties as described in WOO 1 02440, which disclosure is hereby incorporated 
by reference in its entirety. In a preferred embodiment, the protein of the invention or part thereof 
may be used to construct different fusion proteins with different functional domains such as enzyme 

20 moieties or cytotoxic moieties. Vectors encoding these different proteins may then be transfected in 
the same host cell in conditions allowing for multimerization, thus yielding multimeric 
multifunctional complexes. 

Chaperone Protein of SEP ID NO: 303 fintemal designation D637548'> 

25 The protein of SEQ ID NO : 303 encoded by the cDNA of SEQ ID NO: 134 is a chaperonin. 

Accordingly, the protein of SEQ ID NO:303 plays a role in protein synthesis/folding, cellular 
trafficking , and the cellular stress response. In addition, the protein of SEQ ID No: 303 has 
immunosupressant and growth factor properties. It is able to depress delayed type hypersensitivity 
reactions. It is a product of primary and neoplastic cell proliferation and under these conditions acts 

30 as a growth factor. It is also a product of platelet activation and may play a part in wound healing 
and skin repair. Preferred polypeptides of the invention are polypeptides comprising the amino 
acids of SEQ ID NO:303 from positions 9 to 33, or from positions 7 to 101 . Otiier preferred 
polypeptides of the invention are fragments of SEQ ID NO: 303 having any of the biological 
activities described herein. The different activities of the protein of the invention or part thereof 

35 may be assayed using any of the assays described in US 6,1 17,421 or any of the assays referred into 
US, 61 17,421, which disclosures are hereby incorporated by reference in their entireties. 

Chaperonins belong to a wider class of molecular chaperones, molecules involved in post- 
translational folding, targeting and assembly of other proteins, but which do not tiiemselves form 
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part of the final assembled structure as discussed by Ellis et al., 1991, Annu. Rev. Biochem. 60 321- 
347, which disclosures are hereby incorporated by reference in their entireties. Most molecular 
chaperones are "heat shock" or "stress" proteins (hsp); i.e. their production is induced or increased 
by a variety of cellular insults (such as metabolic disruption, oxygen radicals, inflammation, 
5 infection and transformation), heat being only one of the better studies stresses as reviewed by 
Lindquist et al., 1988, Annu. Rev. Genet. 22 631-677, which disclosure is hereby incorporated by 
reference in its entirety. As well as these quantitative changes in specific protein levels, stress can 
induce the movement of constitutively produced stress proteins to different cellular compartments 
as referred to in the Lindquist reference mentioned above. The heat shock response is one of the 

10 most highly conserved genetic system known and the various heat shock protein families are among 
the most evolutionarily stable proteins in existence. The major stress proteins accumulate to very 
high levels in stressed cells but occur at low to moderate levels in cells that have not been stressed. 
As well as enabling cells to cope under adverse conditions, members of these families perform 
essential functions in normal cells. 

1 5 Chaperones are also involved in a number of disorders, especially autoimmune diseases 

such as type 1 diabetes, rheumatoid arthritis, systemic lupus erythematosus, Sjogren syndrome, and 
mixed connective tissue disease (Feige et al. EXS 1996; 77:359-73; Feili-Hariri et al. J Autoimmun 
2000; 14: 133-42, which disclosures are hereby incorporated by reference in their entireties). 
Chaperones are also involved in various disorders including tuberculosis and leprosy (Zugel et al. 

20 Clin Microbiol Rev 1999; 12:19-39), neurogenerative disorders such as Alzheimer and Parkinson 
diseases (Yoo et al. J Neural Transm Suppl 1999; 57:315-22), and malignant disorders (Csermely et 
al. Pharmacol Ther 1998; 79:129-68), which disclosures are hereby incorporated by reference in 
their entireties. 

In one embodiment, the protein of the invention or part thereof may be used to detect a 
25 potential pregnancy, preferably within 6-24 hours of fertilization using the teaching of Morton et al., 
1976, Proc. R. Soc. B. 193 413-41 and US 6,1 17,42 1, which disclosures are hereby incorporated by 
reference in their entireties. Detection of the expression or activity of the protein of the invention 
may be performed using any techniques known to those skilled in the art including those described 
elsewhere in the application. 
30 In another embodiment, molecules able to block the expression or activity of the protein of 

the invention, such as antibodies, antisense or triple helix oligonucleotides, dominant negative 
forms of the protein, polypeptides or small molecule inhibitors of the expression or activity of the 
proteins, may be used to induce abortion as described in US patent 6,1 17,421 . 

In still another embodiment, the protein of the invention or part thereof may be used to treat 
35 and/or prevent infertility and miscarriage using the simple administration of the protein of the 
invention or part thereof or using any of the gene therapy methods described elsewhere in the 
application and the teaching of the US patent 6,1 17,421. 

In another embodiment, the present invention provide methods of using the present proteins 
to identify specific cell types in vitro and in vivo. For example, as chaperone proteins are often 
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upregulated in response to cellular stress, the detection of cells expressing elevated levels of the 
proteins provides a tool for detecting cells under stress. As cellular stress has been implicated in a 
number of disorders, such as cardiovascular disorders, neurodegenerative disorders, and cancer, the 
ability to detect such stress thus provides a diagnostic or screening tool for such conditions. 
5 In addition, the present polypeptides and polynucleotides can be used to develop diagnostic 

and screening assays for diseases characterized by an abnormal level or activity of the protein of 
SEQ ID NO: 303 such as malignant disorders of various types, and autoimmune diseases including 
type 1 diabetes, rheumatoid arthritis, systemic lupus erythematosus, Sjogren syndrome. Graves 
disease, multiple sclerosis, and mixed connective tissue disease. Such assays can be performed 

10 using any biological sample, such as serum or plasma. 

In another embodiment, various disorders can be treated, attenuated and/or prevented by a 
protein of SEQ ID NO: 303, or part thereof, or any other compound that can affect the level or 
activity of the proteins such as nucleic acids, antibodies, or chemical substances. In a preferred 
embodiment, proteins or other compounds directed to the proteins of the invention can be used to 

15 treat or prevent disorders in which the activity or level of the protein of SEQ ID NO: 303 is 
unbalanced. Such diseases include, but are not limited to, infectious diseases, neurogenerative 
disorders as Alzheimer and Parkinson diseases, schizophrenia, alopecia, aging, atherosclerosis, 
malignant disorders of various types, and autoimmune diseases including type 1 diabetes, 
rheumatoid arthritis, systemic lupus erythematosus, Sjogren syndrome, mixed connective tissue 

20 disease, malignant disorders, autoimmune and any other neurodegenerative disorder. In another 
embodiment, the proteins of SEQ ID NO: 303 or part thereof can be used as vaccines for various 
disorders including, but not limited, to cancer (Wang et al. Immunol Invest 2000;29:131-7), 
tuberculosis (Silva et al. Microbes Infect 1 999; 1 :429-35), diabetes (Int Immunol 1 999; 1 1 :957-66), 
and atherosclerosis (Xu et al. Arterioscler Thromb 1992;12:789-99), which disclosures are hereby 

25 incorporated by reference in their entireties. 

One embodiment of the present invention relates to methods and compositions using the 
protein of SEQ ID NO;303 or fragments thereof as a stabilizing adjuvant to slow down protein 
degradation, boost the yields of recombinant proteins, prevent the aggregation of proteins or 
regenerate denatured proteins. In a preferred embodiment, the protein of SEQ ID NO:303 of 

30 fragment thereof is mixed with a composition comprising the protein for which it is desired to slow 
down degradation, boost yield, or regenerate denatured proteins under conditions which facilitate 
the desired resuh. For example, numerous commercial assay kits commonly used by those skilled 
in the arts of molecular biology and biochemistry depend on the biological properties of proteins 
(mostly enzymes) which can be very short-lived in vitro due to the low stability of those proteins. 

35 An example is described in Eur. Patent DE4124286, the disclosure of which is incorporated herein 
by reference in its entirety, wherein the low intrinsic stability of test solutions used in optical tests is 
increased by addition of chaperone proteins, thus making the test more sensitive. Another example 
is given in US patent 6,013,488, which disclosure is hereby incorporated by reference in its entirety, 
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wherein a heat-labile reverse transcriptase is able to perform cDNA synthesis at high temperature 
levels in the presence of a chaperone. 

The protein of SEQ ID NO:303 may also be used to increase the yield or activity of 
recombinant proteins, preferably secreted proteins. In recombinant DNA technology, a major 
5 unsolved problem is the solubility and biological activity of the recombinantly overexpressed 
protein in a host, especially a bacterial or yeast host. Many eukaryotic proteins, especially the 
secreted ones, require for correct folding a specific cellular machinery which is lacking in bacterial 
hosts such as E. coli or becomes insufficient in mammalian/yeast cells due to high expression of the 
protein. The ability of the protein of SEQ ID NO:303 or fragments thereof to ensure proper folding 

10 of recombinant proteins may be utilized as follows. The protein of SEQ ID NO:303, or fragment 
thereof, may be coexpressed with the recombinant protein in bacterial or eukaryotic hosts to cause 
the hosts to express the heterologous proteins or polypeptides in a form having increased solubility 
and/or biological activity. For example, the protein of SEQ ID NO:303 or fragments thereof may 
be used in the methods described in US patent 5,773,245, the disclosure of which is incorporated 

15 herein by reference in its entirety. Therefore the invention relates to a method for the correct 

folding, deaggregation or prevention of aggregation of a monomeric protein in vivo comprising: (a) 
constructing a host cell transformed with (i) a first DNA encoding a polypeptide having the amino 
acid sequence of a bioactive protein or a precursor thereof, wherein said polypeptide or precursor 
can aggregate within the cell to result in a multimeric, non-bioactive protein or precursor thereof 

20 and (ii) a second DNA which enable the cell to co-express the protein of the invention or part 
thereof with the said polypeptide or precursor, (b) growing said host cell for sufficient time under 
conditions wherein said first DNA and said second DNA express said bioactive protein and said 
protein of the invention, respectively; and (c) obtaining monomeric protein that is a bioactive 
protein. Alternatively the protein of SEQ ID NO:303 or fragments thereof may be exogeneously 

25 added to the cell cultures as described in PCT application WO 00/08135, the disclosure of which is 
incorporated herein by reference in its entirety. 

The protein of SEQ ID NO:303 or fragments thereof, may further be used to regenerate 
denatured proteins. Recombinantly expressed proteins with poor biologival activity are routinely 
denatured with a potent denaturing agent, such as guanidine hydrochloride, followed by refolding 

30 by dilution with a large amount of a diluent to reduce the concentration of the denaturing agent. 
However, this method often results in a poor refolding rate which may be significantly increased by 
addition of a cocktail of chaperone proteins in a fashion similar to that described in Eur. Patent 
EP0650975, the disclosure of which is incorporated herein by reference in its entirety. The 
advantage of using a cocktail of chaperone proteins is to accommodate differences in binding 

3 5 specificity of the Hsp different families and the different members within each family. 

In another embodiment of the present invention, the protein of SEQ ID NO:303 may be 
used to promote tissue repair and/or increase cell survival in stress conditions such as hypoxy, 
oxidative stress, genotoxic agents and more generally harmful conditions leading to programmed 
cell death. Those conditions include but are not limited to infarction, heart surgery, stroke. 
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neurodegenerative diseases, epilepsy, trauma, atherosclerosis, restenosis after angioplasty, and 
nerve damage. 

In addition, the invention relates to compositions and methods for promoting cell growth 
both in vitro and in vivo using any of the techniques known to those skilled in the art including 
5 those described in the US patent 6,1 1 7,42 1 . For example, soluble forms of the protein of the 
invention or part thereof may be added to cell culture medium in an amount effective to stimulate 
cell proliferation. Alternatively, any of the gene therapy methods described herein may be used to 
overexpress the protein of the invention or part thereof in vivo. Alternatively, the protein of the 
invention or part thereof m^ be directly administered in an amount effective to promoting cell 

10 growth in said subject. These applications are particularly important in individuals suffering from 
wounds or tissue damage to enhance tissue repair, in individuals to which organ or skin grafts have 
been applied, in individuals suffering from an inflammatory condition or an allergic disease. 

In addition, the invention relates to compositions and methods for promoting 
immunosupression in a subject using any of the techniques known to those skilled in the art 

15 including those described in the US patent 6,1 1 7,421 . For example, the protein of the invention or 
part thereof may be directly administered in an amount effective to achieve immunosupression in 
said subject. Alternatively, any of the gene therapy methods described herein may be used to 
overexpress the protein of the invention or part thereof in vivo. These applications are particularly 
important in cases in which immunosupression is desired such as in individuals suffering from 

20 autoimmune disease including any of the diseases cited above and in individuals that have received 
an heterologous graft they could reject. 

Ion transport Protein of SEQ ID NO: 276 (internal designation D538694) 

The protein of SEQ ID NO : 276 encoded by the cDN A of SEQ ID NO: 1 07 belongs to the 
25 FXDY family of small ion transport regulators or channels (Sweadner and Rael (2000) Genomics 
68:41-56, which disclosure is hereby incorporated by reference in its entirety). The protein of SEQ 
ID NO: 276 or part thereof plays a role in the control of ion transport. Preferred polypeptides of the 
invention are polypeptides comprising the amino acids of SEQ ID NO:276 from positions 9 to 63, 
or from positions 1 6 to 29. Other preferred polypeptides of the invention are fragments of SEQ ID 
30 NO: 276 having any of the biological activities described herein. The activity of the protein of the 
invention or part thereof may be assayed using any of the assays known to those skilled in the art 
including those described in . 

Osmoregulation occurs in all organisms, though the mechanisms differ according to the 
organism's environment. Fresh water inhabitants need to retain salts, whereas ocean inhabitants 
35 need to retain water. Terrestrial inhabitants need to conserve both water and salts. Organisms must 
balance these needs with a requirement to eliminate metabolic waste, such as nitrogenous waste, 
and generate secreted body fluids, such as saliva for digestion and sweat for thermoregulation. 
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In mammals, sweat glands, salivary glands, and the kidney all produce a primary secretion 
that is essentially isosmotic with blood and extracellular fluids. Modification of this primary 
secretion then occurs as much of the sodium chloride and water are reabsorbed as they pass through 
the excretory ducts of the glands and kidney, whereas potassium and bicarbonate ions are secreted. 
5 This modification of the primary secretion is important in the sweat glands to conserve sodium 
chloride in hot environments, and in the salivary glands to conserve sodium chloride when 
excessive quantities of saliva are lost. This modification is critical in the kidney to maintain proper 
sodium and water balance in the extracellular fluids, a balance which also regulates arterial 
pressure. Loss of this modification activity by the duct cells causes a large loss of sodium and water, 

10 resulting in severe dehydration and low blood volume, and ultimately to circulatory collapse. 

Sodium absorption by the intestines, especially in the colon, is necessary to prevent loss of 
sodium in the stools. The loss of sodium absorption produces a failure to absorb anions and water as 
well. The unabsorbed sodium chloride and water then lead to diarrhea, with fiirther loss of sodium 
chloride from the body. Other body fluids may be under regulation similar to that seen in the 

1 5 systems described above. For example, cerebrospinal fluid is produced by active sodium ion 
transport from the capillaries across the epithelium of the choroid plexus, which in turn attracts 
chloride ions and water. A counter flow of potassium and bicarbonate ions move out of the 
cerebrospinal fluid into the capillaries. A dysfunction in osmoregulation is associated with several 
disease states, including hyponatremia, renal failure, and hypematremia. (Strange, K. (1992) J Am. 

20 Soc. Nephrol. 3:12-27, which disclosure is hereby incorporated by reference in its entirety). 

In one embodiment, the protein of the invention may be useful in the diagnosis, prevention 
and/or treatment of osmoregulatory disorders including but not limited to diabetes insipidus, 
diarrhea, peritonitis, chronic renal failure, Addison's disease, syndrome of inappropriate antidiuretic 
hormone (SIADH), hypoaldosteronism, hyponatremia, adrenal insufficiency, hypothyroidism, 

25 hypematremia, hypokalemia. Barter's syndrome, Cushing's syndrome, metabolic acidosis, metabolic 
alkalosis, encephalopathy, edema, hypotension, and hypertension. For example, any of the gene 
therapy methods described herein may be used to overexpress the protein of the invention or part 
thereof in vivo. Alternatively, the protein of the invention or part thereof may be directly 
administered in an amount effective to promoting cell growth in said subject. For diagnostic 

30 purposes, the expression of the protein of the invention could be investigated using any of the 
Northern blotting, RT-PCR or immunoblotting methods described herein and compared to the 
expression in control individuals. For prevention and/or treatment purposes, the protein of the 
invention may be used to enhance ion transport and prevent or treat osmoregulatory disorders using 
any of the gene therapy methods described herein. 
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Uses of antibodies 

Antibodies of the present invention have uses that include, but are not limited to, methods 
known in the art to purify, detect, and target the polypeptides of the present invention including 
both in vitro and in vivo diagnostic and therapeutic methods. An example of such use using 
5 immunoaffmity chromatography is given below. The antibodies of the present invention may be 
used either alone or in combination with other compositions. For example, the antibodies have use 
in immunoassays for qualitatively and quantitatively measuring levels of antigen-bearing substances, 
including the polypeptides of the present invention, in biological samples (See, e.g., Harlow et al., 
1988). (Incorporated by reference in the entirety). The antibodies may also be used in therapeutic 

1 0 compositions for killing cells expressing the protein or reducing the levels of the protein in the body. 
The invention further relates to antibodies that act as agonists or antagonists of the 
polypeptides of the present invention. For example, the present invention includes antibodies that 
disrupt the receptor/ligand interactions with the polypeptides of the invention either partially or 
fully. Included are both receptor-specific antibodies and ligand-specific antibodies. Included are 

1 5 receptor-specific antibodies, which do not prevent ligand binding but prevent receptor activation. 
Receptor activation (i.e., signaling) may be determined by techniques described herein or otherwise 
known in the art. Also include are receptor-specific antibodies which both prevent ligand binding 
and receptor activation. Likewise, included are neutralizing antibodies that bind the ligand and 
prevent binding of the ligand to the receptor, as well as antibodies that bind the ligand, thereby 

20 preventing receptor activation, but do not prevent the ligand from binding the receptor. Further 
included are antibodies that activate the receptor. These antibodies may act as agonists for either all 
or less than all of the biological activities affected by ligand-mediated receptor activation. The 
antibodies may be specified as agonists or antagonists for biological activities comprising specific 
activities disclosed herein. The above antibody agonists can be made using methods known in the 

25 art. See e.g., WO 96/40281; US Patent 5,81 1,097; Deng et al. (1998); Chen et al. (1998); Harrop et 
al. (1998); Zhu et al. (1998); Yoon et al. (1998); Prat et d. (1998); Pitard et al. (1997); Liautard et 
al. (1997); Carlson et al. (1997); Taryman et al. (1995); Muller e/ a/. (1998); Bartunek et al. (1996) 
(said references incorporated by reference in their entireties). 

As discussed above, antibodies of the polypeptides of the invention can, in turn, be utilized 

30 to generate anti-idiotypic antibodies that "mimic" polypeptides of the invention using techniques 
well known to those skilled in the art {See, e.g. Greenspan and Bona (1989) and Nissinoff (1991), 
which disclosures are hereby incorporated by reference in their entireties). For example, antibodies 
which bind to and competitively inhibit polypeptide muhimerization or binding of a polypeptide of 
the invention to ligand can be used to generate anti-idiotypes that "mimic" the polypeptide 

35 muhimerization or binding domain and, as a consequence, bind to and neutralize polypeptide or its 
ligand. Such neutralization anti-idiotypic antibodies can be used to bind a polypeptide of the 
invention or to bind its ligands/receptors, and thereby block its biological activity. 
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Immunoaffinitv Chromatography 

Antibodies prepared as described herein are coupled to a support. Preferably, the antibodies 
are monoclonal antibodies, but polyclonal antibodies may also be used. The support may be any of 
those typically employed in immunoaffinity chromatography, including Sepharose CL-4B 
5 (Pharmacia, Piscataway, NJ), Sepharose CL-2B (Pharmacia, Piscataway, NJ), Affi-gel 10 (Biorad, 
Richmond, CA), or glass beads. 

The antibodies may be coupled to the support using any of the coupling reagents typically 
used in immunoal¥inity chromatography, including cyanogen bromide. After coupling the antibody 
to the support, the support is contacted with a sample which contains a target polypeptide whose 
1 0 isolation, purification or enrichment is desired. The target polypeptide may be a polypeptide 
selected from the group consisting of sequences of SEQ ID NOs:170-338, 456-560, 785-918 and 
polypeptides encoded by the clone inserts of the deposited clone pool, variants and fragments 
thereof, or a fusion protein comprising said selected polypeptide or a fragment thereof. 

Preferably, the sample is placed in contact with the support for a sufficient amount of time 
1 5 and under appropriate conditions to allow at least 50% of the target polypeptide to specifically bind 
to the antibody coupled to the support. 

Thereafter, the support is washed with an appropriate wash solution to remove polypeptides 
which have non-specifically adhered to the support. The wash solution may be any of those 
typically employed in immunoaffinity chromatography, including PBS, Tris-lithium chloride buffer 
20 (0. 1 M lysine base and 0.5M lithium chloride, pH 8.0), Tris-hydrochloride buffer (0.05M Tris- 
hydrochloride, pH 8.0), or Tris/Triton/NaCI buffer (50mM Tris.cl, pH 8.0 or 9.0, 0.1% Triton X- 
100,and0.5MNaCl). 

After washing, the specifically bound target polypeptide is eluted from the support using 
the high pH or low pH elution solutions typically employed in immunoaffinity chromatography. In 
25 particular, the elution solutions may contain an eluant such as triethanolamine, diethylamine, 
calcium chloride, sodium thiocyanate, potasssium bromide, acetic acid, or glycine. In some 
embodiments, the elution solution may also contain a detergent such as Triton X-100 or octyl-beta- 
D-glucoside. 

Expression of Genset gene products 

30 Spatial expression of the GENSET genes of the invention 

Tissue expression of the cDNAs of the present invention was examined. Tables III and IV 
lists the number of hits for the cDNAs in Gensct's libraries of tissues and cell types as well as in 
public databases. The tissues and cell types examined for polynucleotide expression were, for 
Table III: Brain; Fetal brain; Fetal kidney; Fetal liver; Pituitary gland; Liver; Placenta; Prostate; 
35 Salivary gland; Stomach/Intestine; and Testis. For each cDNA referred to by its corresponding 
sequence identification number from the priority application (see Table I for corresponding SEQ ID 
NO in present application), the number of proprietary 5'ESTs (i.e. cDNA fr^ments) expressed in a 
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particular tissue referred to by its name is indicated in parentheses (second column). In addition, the 
bias in the spatial distribution of the polynucleotide sequences of the present invention was 
examined by comparing the relative proportions of the biological polynucleotides of a given tissue 
using the following statistical analysis. The under- or over-representation of a polynucleotide of a 
S given cluster in a given tissue was performed using the normal approximation of the binomial 
distribution. When the observed proportion of a polynucleotide of a given tissue in a given 
consensus had less than 1% chance to occur randomly according to the chi2 test, the frequency bias 
was reported as "preferred". The results are given in Table V as follows. For each polynucleotide 
showing a bias in tissue distribution as referred to by its sequence identification number in the first 
10 column, the list of tissues where the polynucleotides are under-represented is given in the second 
column entitled "low expression" and the list of tissues where the polynucleotides are over- 
represented is given in the third column entitled "high expression". 

Evaluation of Expression Levels and Patterns of GENSET polypeptide-encoding mRNAs 

The spatial and temporal expression patterns of GENSET polypeptide-encoding mRNAs, as 

1 5 well as their expression levels, may also be further determined as follows. 

Expression levels and patterns of GENSET polypeptide-encoding mRNAs may be analyzed 
by solution hybridization with long probes as described in International Patent Application No. WO 
97/05277, the entire contents of which are hereby incorporated by reference. Briefly, a GENSET 
polynucleotide, or fragment thereof, corresponding to the gene encoding the mRNA to be 

20 characterized is inserted at a cloning site immediately downstream of a bacteriophage (T3, T7 or 
SP6) RNA polymerase promoter to produce antisense RNA. Preferably, the GENSET 
polynucleotide is at least a 100 nucleotides in length. The plasmid is linearized and transcribed in 
the presence of ribonucleotides comprising modified ribonucleotides (i.e. biotin-UTP and DIG- 
UTP). An excess of this doubly labeled RNA is hybridized in solution with mRNA isolated from 

25 cells or tissues of interest. The hybridizations are performed under standard stringent conditions 
(40-50°C for 16 hours in an 80% formamide, 0.4 M NaCI buffer, pH 7-8). The unhybridized probe 
is removed by digestion with ribonucleases specific for single-stranded RNA (i.e. RNases CL3, Tl, 
Phy M, U2 or A). The presence of the biotin-UTP modification enables capture of the hybrid on a 
microtitration plate coated with streptavidin. The presence of the DIG modification enables the 

30 hybrid to be detected and quantified by ELISA using an anti-DIG antibody coupled to alkaline 
phosphatase. 

The GENSET polypeptide-encoding cDNAs, or fragments thereof, may also be tagged with 
nucleotide sequences for the serial analysis of gene expression (SAGE) as disclosed in UK Patent 
Application No. 2 305 241 A, the entire contents of which are incorporated by reference. In this 
35 method, cDNAs are prepared from a cell, tissue, organism or other source of nucleic acid for which 
it is desired to determine gene expression patterns. The resulting cDNAs are separated into two 
pools. The cDNAs in each pool are cleaved with a first restriction endonuclease, called an 
"anchoring enzyme," having a recognition site which is likely to be present at least once in most 
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cDNAs. The fragments which contain the 5' or 3' most region of the cleaved cDNA are isolated by 
binding to a capture medium such as streptavidin coated beads. A first oligonucleotide linker 
having a first sequence for hybridization of an amplification primer and an internal restriction site 
for a "tagging endonuclease" is ligated to the digested cDNAs in the first pool. Digestion with the 
5 second endonuclease produces short "tag" fragments from the cDNAs. A second oligonucleotide 
having a second sequence for hybridization of an amplification primer and an intemal restriction 
site is ligated to the digested cDNAs in the second pool. The cDNA fragments in the second pool 
are also digested with the "tagging endonuclease" to generate short "tag" fragments derived from 
the cDNAs in the second pool. The "tags" resulting from digestion of the first and second pools 

1 0 with the anchoring enzyme and the tagging endonuclease are ligated to one another to produce 
"ditags." In some embodiments, the ditags are concatamerized to produce ligation products 
containing from 2 to 200 ditags. The tag sequences are then determined and compared to the 
sequences of the GENSET polypeptide-encoding cDNAs to determine which genes are expressed in 
the cell, tissue, organism, or other source of nucleic acids from which the tj^s were derived. In this 

1 5 way, the expression pattern of a GENSET polypeptide-encoding gene in the cell, tissue, organism, 
or other source of nucleic acids is obtained. 

Quantitative analysis of GENSET gene expression may also be performed using arrays. 
For example, quantitative analysis of gene expression may be performed with GENSET 
polynucleotides, or fragments thereof in a complementary DNA microarray as described by Schena 

20 et al. (199S and 1996) which disclosures are hereby incorporated by reference in their entireties. 
GENSET polypeptide-encoding cDNAs or fragments thereof are amplified by PGR and arrayed 
fix)m 96-well microtiter plates onto silylated microscope slides using high-speed robotics. Printed 
arrays are incubated in a humid chamber to allow rehydration of the array elements and rinsed, once 
in 0.2% SDS for 1 min, twice in water for 1 min and once for 5 min in sodium borohydride solution. 

25 The arrays are submerged in water for 2 min at 95°C, transferred into 0.2% SDS for 1 min, rinsed 
twice with water, air dried and stored in the dark at 25°C. Cell or tissue mRNA is isolated or 
commercially obtained and probes are prepared by a single round of reverse transcription. Probes 
are hybridized to 1 cm^ microarrays under a 14 x 14 mm glass coverslip for 6-12 hours at 60°C. 
Arrays are washed for 5 min at 25°C in low stringency wash buffer (IX SSC/0.2% SDS), then for 

30 1 0 min at room temperature in high stringency wash buffer (0. 1 X SSC/0.2% SDS). Arrays are 
scanned in 0. 1 X SSC using a fluorescence laser scanning device fitted with a custom filter set. 
Accurate differential expression measurements are obtained by taking the average of the ratios of 
two independent hybridizations. 

Quantitative analysis of the expression of genes may also be performed with GENSET 

35 polypeptide-encoding cDNAs or fragments thereof in complementary DNA arrays as described by 
Pietu et al. (1996), which disclosure is hereby incorporated by reference in its entirety. The 
GENSET polynucleotides of the invention or fragments thereof are PCR amplified and spotted on 
membranes. Then, mRNAs originating from various tissues or cells are labeled with radioactive 
nucleotides. After hybridization and washing in controlled conditions, the hybridized mRNAs are 
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detected by phospho-imaging or autoradiography. Duplicate experiments are performed and a 
quantitative analysis of differentially expressed mRNAs is then performed. 

Alternatively, expression analysis of GENSET genes can be done through high density 
nucleotide arrays as described by Lockhart et al. (1996) and Sosnowski et al. (1997), which 
5 disclosures are hereby incorporated by reference in their entireties. Oligonucleotides of 1 5-50 
nucleotides corresponding to sequences of a GENSET polynucleotide or fragments thereof are 
synthesized directly on the chip (Lockhart et al, supra) or synthesized and then addressed to the 
chip (Sosnowski et al, supra). Preferably, the oligonucleotides are about 20 nucleotides in length. 
cDNA probes labeled with an appropriate compound, such as biotin, digoxigenin or fluorescent dye, 

10 are synthesized from the appropriate mRNA population and then randomly fragmented to an 
average size of 50 to 100 nucleotides. The said probes are then hybridized to the chip. After 
washing as described in Lockhart et al., (supra) and application of different electric fields 
(Sosnowsky et al, supra), the dyes or labeling compounds are detected and quantified. Duplicate 
hybridizations are performed. Comparative analysis of the intensity of the signal originating from 

1 5 cDNA probes on the same target oligonucleotide in different cDNA samples indicates a differential 
expression of the GENSET polypeptide-encoding mRNA. 

Uses of GENSET gene expression data 

Once the expression levels and patterns of a GENSET polypeptide-encoding mRNA has 
been determined using any technique known to those skilled in the art, in particular those described 
20 in the section entitled "Evaluation of Expression Levels and Patterns of GENSET polypeptide- 
encoding mRNAs", or using the instant disclosure, these information may be used to design 
GENSET gene specific markers for detection, identification, screening and diagnosis purposes as 
well as to design DNA constructs with an expression pattern similar to a GENSET gene expression 
pattern. 

25 Detection of GENSET polypeptide expression and/or biological activity 

The invention further relates to methods of detection of GENSET polypeptide expression 
and/or biological activity in a biological sample using the polynucleotide and polypeptide sequences 
described herein. Such method scan be used, for example, as a screen for normal or abnormal 
GENSET polypeptide expression and/or biological activity and, thus, can be used diagnostically. 
30 The biological sample for use in the methods of the present invention includes a suitable sample 
from, for example, a mammal, particularly a human. For example, the sample can be issued from 
tissues or cell lines having the same origin as tissues or cell lines in which the polypeptide is known 
to be expressed, e.g. using data from Tables III, IV, or V. 

Detection of GENSET polypeptides 

35 The invention further relates to methods of detection of GENSET polypeptide or encoding 

polynucleotides in a sample using the sequences described herein and any techniques known to 
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those skilled in the art. For example, a labeled polynucleotide probe having all or a functional 
portion of the nucleotide sequence of a GENSET polypeptide-encoding polynucleotide can be used 
in a method to detect a GENSET polypeptide-encoding polynucleotide in a sample. In one 
embodiment, the sample is treated to render the polynucleotides in the sample available for 
5 hybridization to a polynucleotide probe, which can be DNA or RNA. The resulting treated sample 
is combined with a labeled polynucleotide probe having all or a portion of the nucleotide sequence 
of the GENSET polypeptide-encoding cDNA or genomic sequence, under conditions appropriate 
for hybridization of complementary sequences to occur. Detection of hybridization of 
polynucleotides from the sample with the labeled nucleic probe indicates the presence of GENSET 

10 polypeptide-encoding polynucleotides in a sample. The presence of GENSET polypeptide- 
encoding mRNA is indicative of GENSET polypeptide-encoding gene expression. 

Consequently, the invention comprises methods for detecting the presence of a 
polynucleotide comprising a nucleotide sequence selected from a group consisting of the sequences 
of SEQ IDNOs:l-169, 339-455, 561-784, the sequences of clone inserts of the deposited clone 

1 5 pool, sequences fully complementary thereto, fragments and variants thereof in a sample. In a first 
embodiment, said method comprises the following steps of: 

a) bringing into contact said sample and a nucleic acid probe or a plurality of nucleic acid 
probes which hybridize to said selected nucleotide sequence; and 

b) detecting the hybrid complex formed between said probe or said plurality of probes and 
20 said polynucleotide. 

In a preferred embodiment of the above detection method, said nucleic acid probe or said 
plurality of nucleic acid probes is labeled with a detectable molecule. In another preferred 
embodiment of the above detection method, said nucleic acid probe or said plurality of nucleic acid 
probes has been immobilized on a substrate. In still another preferred embodiment, said nucleic 
25 acid probe or said plurality of nucleic acid probes has a sequence comprised in a sequence 
complementary to said selected sequence. 

In a second embodiment, said method comprises the steps of: 

a) contacting said sample with amplification reaction reagents comprising a pair of 
amplification primers located on either side of the region of said nucleotide sequence to be 

30 amplified; 

b) performing an amplification reaction to synthesize amplification products containing said 
region of said selected nucleotide sequence; and 

c) detecting said amplification products. 

In a preferred embodiment of the above detection method, when the polynucleotide to be 
35 amplified is a RNA molecule, preliminary reverse transcription and synthesis of a second cDNA 
strand are necessary to provide a DNA template to be amplified. In another preferred embodiment 
of the above detection method, the amplification product is detected by hybridization with a labeled 
probe having a sequence which is complementary to the amplified region. In still another preferred 
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embodiment, at least one of said amplification primer has a sequence comprised in said selected 
sequence or in the sequence complementary to said selected sequence. 

Alternatively, a method of detecting GENSET polypeptide expression in a test sample can 
be accomplished using any product which binds to a GENSET olypeptide of the present invention 
S or a portion of a GENSET polypeptide. Such products may be antibodies, binding fragments of 
antibodies, polypeptides able to bind specifically to GENSET polypeptides or fragments thereof, 
including GENSET polypeptide agonists and antagonists. Detection of specific binding to the 
antibody indicates the presence of a GENSET polypeptide in the sample (e.g., ELISA). 

Consequently, the invention is also directed to a method for detecting specifically the 
10 presence of a GENSET polypeptide according to the invention in a biological sample, said method 
comprising the steps of: 

a) bringing into contact said biological sample with a product able to bind to a polypeptide 
of the invention or fragments thereof; 

b) allowing said product to bind to said polypeptide to form a complex; and 
15 b) detecting said complex. 

In a preferred embodiment of the above detection method, the product is an antibody. In a 
more preferred embodiment, said antibody is labeled with a detectable molecule. In another more 
preferred embodiment of the above detection method, said antibody has been immobilized on a 
substrate. 

20 In addition, the invention also relates to methods of determining whether a GENSET gene 

product (e.g. a polynucleotide or polypeptide) is present or absent in a biological sample, said 
methods comprising the steps of: 

a) obtaining said biological sample from a human or non-human animal, preferably a 
mammal; 

25 b) contacting said biological sample with a product able to bind to a GENSET polypeptide 

or encoding polynucleotide of the invention; and 

c) determining the presence or absence of said GENSET polypeptide-encoding gene 
product in said biological sample. 

The present invention also relates to kits that can be used in the detection of GENSET 
30 polypeptide-encoding gene expression products. The kit can comprise a compound that specifically 
binds a GENSET polypeptide (e.g. binding proteins, antibodies or binding fragments thereof (e.g. 
F(ab')2 fragments) or a GENSET polypeptide-encoding mRNA (e.g. a complementary probe or 
primer), for example, disposed within a container means. The kit can further comprise ancillary 
reagents, including buffers and the like. 

35 Detection of GENSET polypeptide biological activity 

The invention further includes methods of detecting specifically a GENSET polypeptide 
biological activity, and to identify compounds capable of modulating the activity of a GENSET 
polypeptide. Assessing the GENSET polypeptide biological activity may be performed by the 
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detection of a change in any cellular property associated with the GENSET polypeptide, using a 
variety of techniques, including those described herein. To identify modulators of the polypeptides, 
a control is preferably used. For example, a control sample includes all of the same reagents but 
lacks the compound or agent being assessed; it is treated in the same manner as the test sample. A 
5 number of potentially assayable biological activities for many of the herein-described proteins are 
described supra, under the heading, "Uses of polypeptides of the invention." 

The present invention also relates to kits that can be used in the detection of GENSET 
polypeptide biological activity. The kit can comprise, e.g. substrates for GENSET polypeptides, 
GENSET-binding compounds, antibodies to GENSET polypeptides, etc., for example, disposed 
10 within a container means. The kit can further comprise ancillary reagents, including buffers and the 
like. 

Identification of a specific context of GENSET polvpeptide-encoding gene expression 

When the expression pattern of a GENSET polypeptide-encoding mRNA shows that a 
GENSET polypeptide-encoding gene is specifically expressed in a given context, probes and 

1 5 primers specific for this gene as well as antibodies binding to the GENSET polypeptide-encoding 
polynucleotide may then be used as markers for the specific context. Examples of specific contexts 
are: specific expression in a given tissue/cell or tissue/cell type (see, e.g., Tables III-V) , expression 
at a given stage of development of a process such as embryo development or disease development, 
or specific expression in a given organelle. Such primers, probes, and antibodies are useful 

20 commercially to identify tissues/cells/organelles of unknown origin, for example, forensic samples, 
diiferentiated tumor tissue that has metastasized to foreign bodily sites, or to differentiate different 
tissue types in a tissue cross-section using any technique known to those skilled in the art including 
in situ PGR or immunochemistry for example. 

For example, the cDNAs and proteins of the sequence listing and fragments thereof, may be 

25 used to distinguish human tissues/cells from non-human tissues/cells and to distinguish between 
human tissues/cells/organelles that do and do not express the polynucleotides comprising the 
cDNAs. By knowing the expression pattern of a given GENSET polypeptide, either through 
routine experimentation or by using the instant disclosure, the polynucleotides and polypeptides of 
the present invention may be used in methods of determining the identity of an unknown tissue/cell 

30 sample/organelle. As part of determining the identity of an unknown tissue/cell sampie/organelle, 
the polynucleotides and polypeptides of the present invention may be used to determine what the 
unknown tissue/cell sample is and what the unknown sample is not. For example, if a cDNA is 
expressed in a particular tissue/cell type/organelle, and the unknown tissue/cell sample/organelle 
does not express the cDNA, it may be inferred that the unknown tissue/cells are either not human or 

35 not the same human tissue/cell type/organelle as that which expresses the cDNA. These methods of 
determining tissue/cell/organelle identity are based on methods which detect the presence or 
absence of the mRNA (or corresponding cDNA) in a tissue/cell sample using methods well know in 
the art (e.g., hybridization, PGR based methods, immunoassays, immunochemistry, ELISA). 
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Examples of such techniques are described in more detail below. Therefore, the invention 
encompasses uses of the polynucleotides and polypeptides of the invention as tissue markers. In a 
preferred embodiment, polynucleotides preferentially expressed in given tissues as indicated in 
Tables III-V and polypeptides encoded by such polynucleotides are used for this purpose. The 
5 invention also encompasses uses of polypeptides of the invention as organelle markers. 

Consequently, the present invention encompasses methods of identification of a tissue/cell 
type/subcellular compartment, wherein said method includes the steps of: 

a) contacting a biological sample which identity is to be assayed with a product able to 
bind a GENSET gene product; and 
10 b) determining whether a GENSET gene product is expressed in said biological 

sample. 

Products that are able to bind specifically to a GENSET gene product, namely a GENSET 
polypeptide or a GENSET polypeptide-encoding mRNA, include GENSET polypeptide binding 
proteins, antibodies or binding fragments thereof (e.g. F(ab')2 fragments), as well as GENSET 
15 polynucleotide complementary probes and primers. 

Step b) may be performed using any detection method known to those skilled in the art 
including those disclosed herein, especially in the section entitled "Detection of GENSET 
polypeptide expression and/or biological activity". 

Identification of Tissue Types or Cell Species by Means of Labeled Tissue Specific Antibodies 

20 Identification of specific tissues is accomplished by the visualization of tissue specific 

antigens by means of antibody preparations which are conjugated, directly (e.g., green fluorescent 
protein) or indirectly to a detectable marker. Selected labeled antibody species bind to their specific 
antigen binding partner in tissue sections, cell suspensions, or in extracts of soluble proteins from a 
tissue sample to provide a pattern for qualitative or semi-qualitative interpretation. 

25 Antisera for these procedures must have a potency exceeding that of the native preparation, 

and for that reason, antibodies are concentrated to a mg/ml level by isolation of the gamma globulin 
fraction, for example, by ion-exchange chromatography or by ammonium sulfate fractionation. 
Also, to provide the most specific antisera, unwanted antibodies, for example to common proteins, 
must be removed from the gamma globulin fraction, for example by means of insoluble 

30 immunoabsorbents, before the antibodies are labeled with the marker. Either monoclonal or 
heterologous antisera is suitable for either procedure. 

A. Immunohistochemical Techniques 

Purified, high-titer antibodies, prepared as described above, are conjugated to a detectable 
marker, as described, for example, by Fudenberg, (1980) or Rose et al, (1980), which disclosures 
35 are hereby incorporated by reference in their entireties. 

A fluorescent marker, either fluorescein or rhodamine, is preferred, but antibodies can also 
be labeled with an enzyme that supports a color producing reaction with a substrate, such as 
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horseradish peroxidase. Maricers can be added to tissue-bound antibody in a second step, as 
described below. Alternatively, the specific anti-tissue antibodies can be labeled with ferritin or 
other electron dense particles, and localization of the ferritin coupled antigen-antibody complexes 
achieved by means of an electron microscope. In yet another approach, the antibodies are 
5 radiolabeled, with, for example '^^I, and detected by overlaying the antibody treated preparation 
with photographic emulsion. Preparations to carry out the procedures can comprise monoclonal or 
polyclonal antibodies to a single protein or peptide identified as specific to a tissue type, for 
example, brain tissue, or antibody preparations to several antigenically distinct tissue specific 
antigens can be used in panels, independently or in mixtures, as required. Tissue sections and cell 

10 suspensions are prepared for immunohistochemical examination according to common histological 
techniques. Multiple cryostat sections (about 4 urn, unfixed) of the unknown tissue and known 
control, are mounted and each slide covered with different dilutions of the antibody preparation. 
Sections of known and unknown tissues should also be treated with preparations to provide a 
positive control, a negative control, for example, pre-immune sera, and a control for non-specific 

1 5 staining, for example, buffer. Treated sections are incubated in a humid chamber for 30 min at 
room temperature, rinsed, then washed in buffer for 30-45 min. Excess fluid is blotted away, and 
the marker developed. If the tissue specific antibody was not labeled in the first incubation, it can 
be labeled at this time in a second antibody-antibody reaction, for example, by adding fluorescein- 
or enzyme-conjugated antibody against the immunoglobulin class of the antiserum-producing 

20 species, for example, fluorescein labeled antibody to mouse IgG. Such labeled sera are 

commercially available. The antigen found in the tissues by the above procedure can be quantified 
by measuring the intensity of color or fluorescence on the tissue section, and calibrating that signal 
using appropriate standards. 

B. Identification of Tissue Specific Soluble Proteins 

25 The visualization of tissue specific proteins and identification of unknown tissues from that 

procedure is carried out using the labeled antibody reagents and detection strategy as described for 
immunohistochemistry; however the sample is prepared according to an electrophoretic technique 
to distribute the proteins extracted from the tissue in an orderly array on the basis of molecular 
weight for detection. A tissue sample is homogenized using a Virtis apparatus; cell suspensions are 

30 disrupted by Dounce homogenization or osmotic lysis, using detergents in either case as required to 
disrupt cell membranes, as is the practice in the art. Insoluble cell components such as nuclei, 
microsomes, and membrane fragments are removed by ultracentrifugation, and the soluble protein- 
containing fraction concentrated if necessary and reserved for analysis. A sample of the soluble 
protein solution is resolved into individual protein species by conventional SDS polyacrylamide 

35 electrophoresis as described, for example, by Davis et ah. Section 19-2 (1986), using a range of 
amounts of polyacrylamide in a set of gels to resolve the entire molecular weight range of proteins 
to be detected in the sample. A size marker is run in parallel for putposes of estimating molecular 
weights of the constituent proteins. Sample size for analysis is a convenient volume of from 5 to55 
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ul, and containing from about 1 to 100 ug protein. An aliquot of each of the resolved proteins is 
transferred by blotting to a nitrocellulose filter paper, a process that maintains the pattern of 
resolution. Multiple copies are prepared. The procedure, known as Western Blot Analysis, is well 
described in Davis et al, (1986) Section 19-3. One set of nitrocellulose blots is stained with 
5 Coomassie Blue dye to visualize the entire set of proteins for comparison with the antibody bound 
proteins. The remaining nitrocellulose filters are then incubated with a solution of one or more 
specific antisera to tissue specific proteins prepared as described herein. In this procedure, as in 
procedure A above, appropriate positive and negative sample and reagent controls are run. 

In either procedure A or B, a detectable label can be attached to the primary tissue antigen- 
ic primary antibody complex according to various strategies and permutations thereof In a 

straightforward approach, the primary specific antibody can be labeled; alternatively, the unlabeled 
complex can be bound by a labeled secondary anti-IgG antibody. In other approaches, either the 
primary or secondary antibody is conjugated to a biotin molecule, which can, in a subsequent step, 
bind an avidin conjugated marker. According to yet another strategy, enzyme labeled or radioactive 
15 protein A, which has the property of binding to any IgG, is bound in a final step to either the 

primary or secondary antibody. The visualization of tissue specific antigen binding at levels above 
those seen in control tissues to one or more tissue specific antibodies, prepared from the gene 
sequences identified from cDNA sequences, can identify tissues of unknown origin, for example, 
forensic samples, or differentiated tumor tissue that has metastasized to foreign bodily sites. 

20 Screening and diagnosis of abnormal GENSET polypeptide expression and/or biological activity 

Moreover, antibodies and/or primers specific for GENSET polypeptide expression may also 
be used to identify abnormal GENSET polypeptide expression and/or biological activity, and 
subsequently to screen and/or diagnose disorders associated with abnormal GENSET polypeptide 
expression. For example, a particular disease may result from lack of expression, over expression, 

25 or under expression of a GENSET polypeptide-encoding mRN A. By comparing mRNA expression 
patterns and quantities in samples taken from healthy individuals with those from individuals 
suffering from a particular disorder, genes responsible for this disorder may be identified. Primers, 
probes and antibodies specific for this GENSET polypeptide may then be used to elaborate kits of 
screening and diagnosis for a disorder in which the gene of interest is specifically expressed or in 

30 which its expression is specifically dysregulated, i.e. underexpressed or overexpressed. 

Screening for specific disorders 

The present invention also relates to methods and uses of GENSET polypeptides for 
identifying individuals having elevated or reduced levels of GENSET polypeptides, which 
individuals are likely to benefit from therapies to suppress or enhance GENSET polypeptide- 
35 encoding gene expression, respectively. One example of such methods and uses comprises the 

steps of: 

a) obtaining from a mammal a biological sample; 



191 



WO 02/083898 PCT/IBOl/00914 

b) detecting the presence in said sample of a GENSET polypeptide-encoding gene product 
(mRNA or protein); 

c) comparing tiie amount of said GENSET polypeptide-encoding gene product present in 
said sample with that of a control sample; and 

5 d) determing whether said human or non-human mammal has a reduced or elevated level of 

GENSET gene expression compared to the control sample. 

A biological sample from a subject affected by, or at risk of developing, any disease or 
condition associated with a GENSET polypeptide can be screened for the presence of increased or 
decreased levels of GENSET gene product, relative to a normal population (standard or control), 

10 with an increased or decreased level of the GENSET polypeptide relative to the normal population 
being indicative of predisposition to or a present indication of the disease or condition, or any 
sympton associated with the disease or condition. Such individuals would be candidates for 
therapies, e.g., treatment with pharmaceutical compositions comprising the GENSET polypeptide, a 
polynucleotide encoding the GENSET polypeptide, or any other compound that affects the 

15 expression or activity of the GENSET polypeptide. Generally, the identification of elevated levels 
of the GENSET polypeptide in a patient would be indicative of an individual that would benefit 
from treatment with agents that suppress GENSET polypeptide expression or activity, and the 
identification of low levels of the GENSET polypeptide in a patient would be indicative of an 
individual that would benefit from agents that induce GENSET expression or activity. 

20 Biological samples suitable for use in this method include any biological fluids, including, 

but not limited to, blood, saliva, milk, and urine. Tissue samples (e.g. biopsies) can also be used in 
the method of the invention, including samples derived from any tissue associated with GENSET 
gene expression (see, e.g. Tables III-V). Cell cultures or cell extracts derived, for example, from 
tissue biopsies can also be used. The detection step of the present method can be performed using 

25 standard protocols for protein/mRNA detection. Examples of suitable protocols include Northern 
blot analysis, immunoassays (e.g. RIA, Western blots, immunohistochemical analyses), and PGR. 

Thus, the present invention further relates to methods and uses of GENSET polypeptides 
for identifying individuals or non-human animals at increased risk for developing, or present state 
of having, certain diseases/disorders associated with abnormal GENSET polypeptide expression or 

30 biological activity. One example of such methods comprises the steps of: 

a) obtaining from a human or non-human mammal a biological sample; 

b) detecting the presence in said sample of a GENSET gene product (mRNA or protein); 

c) comparing the amount of said GENSET gene product present in said sample with that of 
a control sample; and 

35 d) determing whether said human or non-human mammal is at increased risk for 

developing, or present state of having, a diseases or disorder. 

In preferred embodiments, the biological sample is taken from animals presenting any 
symptom associated with any disease or condition associated with a GENSET gene product. In 
accordance with this method, the presence in the sample of altered (e.g. increased or decreased) 
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levels of the GENSET product indicates that the subject is predisposed to the disease or condition. 
Biological samples suitable for use in this method include biological fluids including, but not 
limited to, blood, saliva, milk, and urine. Tissue samples (e.g. biopsies) can also be used in the 
method of the invention, including samples derived from any of the tissues listed in Tables III-V. 
5 Cell cultures or cell extracts derived, for example, from tissue biopsies can also be used. 

The diagnostic methodologies described herein are applicable to both humans and 
non-human mammals. 

Detection of GENSET gene mutations 

The invention also encompasses methods and uses of GENSET polynucleotides to detect 

1 0 mutations in GENSET polynucleotides of the invention. Such methods may advantageously be 
used to detect mutations occurring in GENSET genes and preferably in their regulatory regions. 
When the mutation was proven to be associated with a disease, the detection of such mutations may 
be used for screening and diagnosis purposes. 

In one embodiment of the oligonucleotide arrays of the invention, an oligonucleotide probe 

1 5 matrix may advantageously be used to detect mutations occurring in GENSET genes and preferably 
in their regulatory regions. For this particular purpose, probes are specifically designed to have a 
nucleotide sequence allowing their hybridization to the genes that carry known mutations (either by 
deletion, insertion or substitution of one or several nucleotides). By known mutations, it is meant, 
mutations on the GENSET genes that have been identified according, for example to the technique 

20 used by Huang et a/.(1996) or Samson et a/.(1996), which disclosures are hereby incorporated by 
reference in their entireties. 

Another technique that is used to detect mutations in GENSET genes is the use of a high- 
density DNA array. Each oligonucleotide probe constituting a unit element of the high density 
, DNA array is designed to match a specific subsequence of a GENSET genomic DNA or cDNA. 

25 Thus, an array consisting of oligonucleotides complementary to subsequences of the target gene 
sequence is used to determine the identity of the target sequence with the wild gene sequence, 
measure its amount, and detect differences between the target sequence and the reference wild gene 
sequence of the GENSET gene. In one such design, termed 4L tiled array, is implemented a set of 
four probes (A, C, G, T), preferably 1 S-nucleotide oligomers. In each set of four probes, the perfect 

30 complement will hybridize more strongly than mismatched probes. Consequently, a nucleic acid 
target of length L is scanned for mutations with a tiled array containing 4L probes, the whole probe 
set containing all the possible mutations in the known wild reference sequence. The hybridization 
signals of the 15-mer probe set tiled array are perturbed by a single base change in the target 
sequence. As a consequence, there is a characteristic loss of signal or a "footprint" for the probes 

35 flanking a mutation position. This technique was described by Chee et al. in 1996, which disclosure 
is hereby incorporated by reference in its entirety. 
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Construction of DNA constructs with a GENSET gene expression pattern 



In addition, cliaracterization of the spatial and temporal expression patterns and expression 
levels of GENSET polypeptide-encoding mRNAs is also useful for constructing expression vectors 
capable of producing a desired level of gene product in a desired spatial or temporal manner, as 

5 discussed below. 

DNA Constructs That Direct Temporal And Spatial GENSET Gene Expression In Recombinant Cell 
Hosts And In Transgenic Animals. 

In order to study the physiological and phenotypic consequences of a lack of synthesis of a 
GENSET polypeptide, both at the cellular level and at the multi cellular organism level, the 

10 invention also encompasses DNA constructs and recombinant vectors enabling a conditional 

expression of a specific allele of a GENSET polypeptide-encoding genomic sequence or cDNA and 
also of a copy of this genomic sequence or cDNA harboring substitutions, deletions, or additions of 
one or more bases as regards to a nucleotide sequence selected from the group consisting of 
sequences of SEQ ID NOs:]-169, 339-455, 561-784 and sequences of clone inserts of the deposited 

1 5 clone pool, or a fragment thereof, these base substitutions, deletions or additions being located 

either in an exon, an intron or a regulatory sequence, but preferably in the 5 '-regulatory sequence or 
in an exon of the GENSET polypeptide-encoding genomic sequence or within the GENSET 
polypeptide-encoding cDNA. 

A first preferred DNA construct is based on the tetracycline resistance operon tet from E. 

20 coll transposon TnlO for controlling the GENSET gene expression, such as described by Gossen et 
al. (1992, 1995) and Furth et al. (1994), which disclosures are hereby incorporated by reference in 
their entireties. Such a DNA construct contains seven tet operator sequences from TnlO (tetop) that 
are fused to either a minimal promoter or a 5'-regulatory sequence of the GENSET gene, said 
minimal promoter or said GENSET polynucleotide regulatory sequence being operably linked to a 

25 polynucleotide of interest that codes either for a sense or an antisense oligonucleotide or for a 

polypeptide, including a GENSET polypeptide, or a peptide fragment thereof This DNA construct 
is fiinctional as a conditional expression system for the nucleotide sequence of interest when the 
same cell also comprises a nucleotide sequence coding for either the wild type (tTA) or the mutant 
(rTA) repressor fused to the activating domain of viral protein VPl 6 of herpes simplex virus, placed 

30 under the control of a promoter, such as the HCMVIE I enhancer/promoter or the MMTV-LTR. 
Indeed, a preferred DNA construct of the invention comprise both the polynucleotide containing the 
tet operator sequences and the polynucleotide containing a sequence coding for the tTA or the rTA 
repressor. In a specific embodiment, the conditional expression DNA construct contains the 
sequence encoding the mutant tetracycline repressor rTA, the expression of the polynucleotide of 

35 interest is silent in the absence of tetracycline and induced in its presence. 

DNA Constructs Allowing Homologous Recombination: Replacement Vectors 
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A second preferred DNA construct will comprise, from 5 '-end to 3 '-end: (a) a first 
nucleotide sequence that is found in the GENSET polypeptide-encoding genomic sequence; (b) a 
nucleotide sequence comprising a positive selection marker, such as the marker for neomycin 
resistance (neo); and (c) a second nucleotide sequence that is found in the GENSET polypeptide- 
5 encoding genomic sequence, and is located on the genome downstream the first GENSET 
polypeptide-encoding nucleotide sequence (a). 

In a preferred embodiment, this DNA construct also comprises a negative selection marker 
located upstream of the nucleotide sequence (a) or downstream from the nucleotide sequence (c). 
Preferably, the negative selection marker comprises the thymidine kinase (tk) gene (Thomas et al, 
10 1 986), the hygromycine beta gene (Te Riele et al. , 1 990), the hprt gene ( Van der Lugt et a/. , 1 99 1 ; 
Reid et al., 1990) or the Diphteria toxin A fragment (Dt-A) gene (Nada et al., 1993; Yagi et 
al. 1990), which disclosures are hereby incorporated by reference in their entireties. Preferably, the 
positive selection marker is located within a GENSET exon sequence so as to interrupt the sequence 
encoding a GENSET polypeptide. These replacement vectors are described, for example, by 
15 Thomas et al.(\9S6; 1987), Mansour et al.(\9SS) and Koller et fl/.(1992). 

The first and second nucleotide sequences (a) and (c) may be indifferently located within a 
GENSET polypeptide-encoding regulatory sequence, an intronic sequence, an exon sequence or a 
sequence containing both regulatory and/or intronic and/or exon sequences. The size of the 
nucleotide sequences (a) and (c) ranges from 1 to SO kb, preferably from 1 to 1 0 kb, more 
20 preferably from 2 to 6 kb and most preferably from 2 to 4 kb. 

DNA Constructs Allowing Homologous Recombination: Cre-LoxP System. 

These new DNA constructs make use of the site specific recombination system of the PI 
phage. The P 1 phage possesses a recombinase called Cre which interacts specifically with a 34 
base pairs loxP site. The lox? site is composed of two palindromic sequences of 13 bp separated by 

25 a 8 bp conserved sequence (Hoess et al., 1986), which disclosure is hereby incorporated by 

reference in its entirety. The recombination by the Cre enzyme between two lox? sites having an 
identical orientation leads to the deletion of the DNA fragment. 

The Cre-/ojcP system used in combination with a homologous recombination technique has 
been first described by Gu et al. (1 993, 1 994), which disclosures are hereby incorporated by 

30 reference in their entireties. Briefly, a nucleotide sequence of interest to be inserted in a targeted 
location of the genome harbors at least two loxP sites in the same orientation and located at the 
respective ends of a nucleotide sequence to be excised from the recombinant genome. The excision 
event requires the presence of the recombinase (Cre) enzyme within the nucleus of the recombinant 
cell host. The recombinase enzyme may be brought at the desired time either by (a) incubating the 

35 recombinant cell hosts in a culture medium containing this enzyme, by injecting the Cre enzyme 
directly into the desired cell, such as described by Araki et al. (1995), which disclosure is hereby 
incorporated by reference in its entirety, or by lipofection of the enzyme into the cells, such as 
described by Baubonis et al (1993), which disclosure is hereby incorporated by reference in its 
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entirety; (b) transfecting the cell host with a vector comprising the Cre coding sequence operably 
linked to a promoter functional in the recombinant cell host, which promoter being optionally 
inducible, said vector being introduced in the recombinant cell host, such as described by Gu et al. 
(1993) and Sauer et al. (1988), which disclosures are hereby incorporated by reference in their 
5 entireties; (c) introducing in the genome of the cell host a polynucleotide comprising the Cre coding 
sequence operably linked to a promoter functional in the recombinant cell host, which promoter is 
optionally inducible, and said polynucleotide being inserted in the genome of the cell host either by 
a random insertion event or an homologous recombination event, such as described by Gu et 
fl/.(1994). 

10 In a specific embodiment, the vector containing the sequence to be inserted in the GENSET 

gene by homologous recombination is constructed in such a way that selectable markers are flanked 
by loxP sites of the same orientation, it is possible, by treatment by the Cre enzyme, to eliminate the 
selectable markers while leaving the GENSET sequences of interest that have been inserted by an 
homologous recombination event. Again, two selectable markers are needed: a positive selection 

1 5 marker to select for the recombination event and a negative selection marker to select for the 

homologous recombination event. Vectors and methods using the Crt-lox? system are described by 
Zou et al. (1994), which disclosure is hereby incorporated by reference in its entirety. 

Thus, a third preferred DNA construct of the invention comprises, from 5'-end to 3'-end: 
(a) a first nucleotide sequence that is comprised in the GENSET genomic sequence; (b) a nucleotide 

20 sequence comprising a polynucleotide encoding a positive selection marker, said nucleotide 

sequence comprising additionally two sequences defining a site recognized by a recombinase, such 
as a lox? site, the two sites being placed in the same orientation; and (c) a second nucleotide 
sequence that is comprised in the GENSET genomic sequence, and is located on the genome 
downstream of the first GENSET nucleotide sequence (a). 

25 The sequences defining a site recognized by a recombinase, such as a /oxP site, are 

preferably located within the nucleotide sequence (b) at suitable locations bordering the nucleotide 
sequence for which the conditional excision is sought. In one specific embodiment, two lox? sites 
are located at each side of the positive selection marker sequence, in order to allow its excision at a 
desired time after the occurrence of the homologous recombination event. 

30 In a preferred embodiment of a method using the third DNA construct described above, the 

excision of the polynucleotide fragment bordered by the two sites recognized by a recombinase, 
preferably two loxP sites, is performed at a desired time, due to the presence within the genome of 
the recombinant host cell of a sequence encoding the Cre enzyme operably linked to a promoter 
sequence, preferably an inducible promoter, more preferably a tissue-specific promoter sequence 

35 and most preferably a promoter sequence which is both inducible and tissue-specific, such as 
described by Gu et fl/.(1994). 

The presence of the Cre enzyme within the genome of the recombinant cell host may result 
from the breeding of two transgenic animals, the first transgenic animal bearing the GENSET- 
derived sequence of interest containing the lox? sites as described above and the second transgenic 
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animal bearing the Cre coding sequence operably liniced to a suitable promoter sequence, such as 
described by Gu et a/. (1994). 

Spatio-temporal control of the Cre enzyme expression may also be achieved with an 
adenovirus based vector that contains the Cre gene thus allov^'ing infection of cells, or in vivo 
5 infection of organs, for delivery of the Cre enzyme, such as described by Anton and Graham ( 1 995) 
and Kanegae et al. (1995), which disclosures are hereby incorporated by reference in their 
entireties. 

The DNA constructs described above may be used to introduce a desired nucleotide 
sequence of the invention, preferably a GENSET genomic sequence or a GENSET cDNA sequence, 
10 and most preferably an altered copy of a GENSET genomic or cDNA sequence, within a 

predetermined location of the targeted genome, leading either to the generation of an altered copy of 
a targeted gene (knock-out homologous recombination) or to the replacement of a copy of the 
targeted gene by another copy sufficiently homologous to allow an homologous recombination 
event to occur (knock-in homologous recombination). 

15 Modifying Gensetpolypoptide expression and/or biological activity 

Modifying endogenous GENSET expression and/or biological activity is expressly 
contemplated by the present invention. 

Screening for compounds that modulate GENSET expression and/or biological activity 

The present invention further relates to compounds able to modulate GENSET expression 
20 and/or biological activity and methods to use these compounds. Such compounds may interact with 
the regulatory sequences of GENSET genes or they may interact with GENSET polypeptides 
directly or indirectly. 

Compounds Interacting With GENSET Regulatory Sequences 

The present invention also concerns a method for screening substances or molecules that 
25 are able to interact with the regulatory sequences of a GENSET gene, such as for example promoter 
or enhancer sequences in untranscribed regions of the genomic DNA, as determined using any 
techniques known to those skilled in the art including those described in the section entitled 
"Identification of Promoters in Cloned Upstream Sequences, or such as regulatory sequences 
located in untranslated regions of GENSET mRNA. 
30 Sequences within untranscribed or untranslated regions of polynucleotides of the invention 

may be identified by comparison to databases containing known regulatory sequence such as 
transcription start sites, transcription factor binding sites, promoter sequences, enhancer sequences, 
5'UTR and 3'UTR elements (Pesole et al., 2000; http://igs-server.cnrs- 
mrs.fr/~gauthere/UTR/index.html). Alternatively, the regulatory sequences of interest may be 
35 identified through conventional mutagenesis or deletion analyses of reporter plasmids using, for 
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instance, techniques described in the section entitled "Identification of Promoters in Cloned 
Upstream Sequences". 

Following the identification of potential GENSET regulatory sequences, proteins which 
interact with these regulatory sequences may be identified as described below. 
5 Gel retardation assays may be performed independently in order to screen candidate 

molecules that are able to interact with the regulatory sequences of the GENSET gene, such as 
described by Fried and Crothers (1981), Gamer and Revzin (1981) and Dent and Latchman (1993), 
the teachings of these publications being herein incorporated by reference. These techniques are 
based on the principle according to which a DNA or mRNA fragment which is bound to a protein 

10 migrates slower than the same unbound DNA or mRNA fragment. Briefly, the target nucleotide 
sequence is labeled. Then the labeled target nucleotide sequence is brought into contact with either 
a total nuclear extract from cells containing regulation factors, or with different candidate molecules 
to be tested. The interaction between the target regulatory sequence of the GENSET gene and the 
candidate molecule or the regulation factor is detected after gel or capillary electrophoresis through 

15 a retardation in the migration. 

Nucleic acids encoding proteins which are able to interact with the promoter sequence of 
the GENSET gene, more particulariy a nucleotide sequence selected from the group consisting of 
the polynucleotides of the 5' and 3' regulatory region or a fragment or variant thereof, may be 
identified by using a one-hybrid system, such as that described in the booklet enclosed in the 

20 Matchmaker One-Hybrid System kit from Clontech (Catalog Ref. no. K1603-1, the technical 

teachings of which arc herein incorporated by reference). Briefly, the target nucleotide sequence is 
cloned upstream of a selectable reporter sequence and the resulting polynucleotide construct is 
integrated in the yeast genome {Saccharomyces cerevisiae). Preferably, multiple copies of the 
target sequences are inserted into the reporter plasmid in tandem. The yeast cells containing the 

25 reporter sequence in their genome are then transformed with a library comprising fusion molecules 
between cDNAs encoding candidate proteins for binding onto the regulatory sequences of the 
GENSET gene and sequences encoding the activator domain of a yeast transcription factor such as 
GAL4. The recombinant yeast cells are plated in a culture broth for selecting cells expressing the 
reporter sequence. The recombinant yeast cells thus selected contain a fusion protein that is able to 

30 bind onto the target regulatory sequence of the GENSIT gene. Then, the cDNAs encoding the 
fusion proteins are sequenced and may be cloned into expression or transcription vectors in vitro. 
The binding of the encoded polypeptides to the target regulatory sequences of the GENSET gene 
may be confirmed by techniques familiar to the one skilled in the art, such as gel retardation assays 
or DNAse protection assays. 

35 Ligands interacting with GENSET polypeptides 

For the purpose of the present invention, a ligand means a molecule, such as a protein, a 
peptide, an antibody or any synthetic chemical compound capable of binding to a GENSET protein 
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or one of its fragments or variants or to modulate the expression of the polynucleotide coding for 
GENSET or a fragment or variant thereof 

In the ligand screening method according to the present invention, a biological sample or a 
defined molecule to be tested as a putative ligand of a GENSET protein is brought into contact with 
5 the corresponding purified GENSET protein, for example the corresponding purified recombinant 
GENSET protein produced by a recombinant cell host as described herein, in order to form a 
complex between this protein and the putative ligand molecule to be tested. 

As an illustrative example, to study the interaction of a GENSET protein, or a fragment 
comprising a contiguous span of at least 6 amino acids, preferably at least 8 to 10 amino acids, more 

10 preferably at least 12, 1 5, 20, 25, 30, 40, 50, or 100 amino acids of a polypeptide selected from the 
group consisting of sequences of SEQ ID NOs: 170-338, 456-560, 785-918 and polypeptides 
encoded by the clone inserts of the deposited clone pool, with drugs or small molecules, such as 
molecules generated through combinatorial chemistry approaches, the microdialysis coupled to 
HPLC method described by Wang et al. (1997) or the affinity capillary electrophoresis method 

15 described by Bush et al. (1997), the disclosures of which are incorporated by reference, can be used. 
In further methods, peptides, drugs, fatty acids, lipoproteins, or small molecules which 
interact with a GENSET protein, or a fragment comprising a contiguous span of at least 6 amino 
acids, preferably at least 8 to 10 amino acids, more preferably at least 12, 15, 20, 25, 30, 40, 50, or 
100 amino acids of a polypeptide selected from the group consisting of sequences of SEQ ID 

20 NOs:170-338, 456-560, 785-918 and polypeptides encoded by the clone inserts of the deposited 
clone pool may be identified using assays such as the following. The molecule to be tested for 
binding is labeled with a detectable label, such as a fluorescent, radioactive, or enzymatic tag and 
placed in contact with immobilized GENSET protein, or a fragment thereof under conditions which 
permit specific binding to occur. After removal of non-specifically bound molecules, bound 

25 molecules are detected using appropriate means. 

Various candidate substances or molecules can be assayed for interaction with a GENSET 
polypeptide. These substances or molecules include, without being limited to, natural or synthetic 
organic compounds or molecules of biological origin such as polypeptides. When the candidate 
substance or molecule comprises a polypeptide, this polypeptide may be the resulting expression 

30 product of a phage clone belonging to a phage-based random peptide library, or alternatively the 
polypeptide may be the resulting expression product of a cDNA library cloned in a vector suitable 
for performing a two-hybrid screening assay. 

A. Candidate ligands obtained from random peptide libraries 

In a particular embodiment of the screening method, the putative ligand is the expression 
35 product of a DNA insert contained in a phage vector (Parmley and Smith, 1988). Specifically, 
random peptide phages libraries are used. The random DNA inserts encode for peptides of 8 to 20 
amino acids in length (Oldenburg er a/., 1992; Valadon e/o/., 1996; Lucas, 1 994; Westerink, 1995; 
Felici et al., 1 991), which disclosures are hereby incorporated by reference in their entireties. 
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According to this particular embodiment, the recombinant phages expressing a protein that binds to 
an immobilized GENSET protein is retained and the complex formed between the GENSET protein 
and the recombinant phage may be subsequently immunoprecipitated by a polyclonal or a 
monoclonal antibody directed against the GENSET protein. 
5 Once the ligand library in recombinant phages has been constructed, the phage population 

is brought into contact with the immobilized GENSET protein. Then the preparation of complexes 
is washed in order to remove the non-specifically bound recombinant phages. The phages that bind 
specifically to the GENSET protein are then eluted by a buffer (acid pH) or immunoprecipitated by 
the monoclonal antibody produced by the hybridoma anti-GENSET, and this phage population is 

10 subsequently amplified by an over-infection of bacteria (for example E. coli). The selection step 
may be repeated several times, preferably 2-4 times, in order to select the more specific 
recombinant phage clones. The last step comprises characterizing the peptide produced by the 
selected recombinant phage clones either by expression in infected bacteria and isolation, 
expressing the phage insert in another host-vector system, or sequencing the insert contained in the 

1 5 selected recombinant phages. 

B. Candidate ligands obtained by competition experiments. 

Alternatively, peptides, drugs or small molecules which bind to a GENSET protein or 
fragment thereof comprising a contiguous span of at least 6 amino acids, preferably at least 8 to 10 
amino acids, more preferably at least 12, 15, 20, 25, 30, 40, 50, or 100 amino acids of a polypeptide 

20 selected from the group consisting of sequences of SEQ ID NOs: 1 70-338, 456-560, 785-91 8 and 
polypeptides encoded by the clone inserts of the deposited clone pool, may be identified in 
competition experiments. In such assays, the GENSET protein, or a fragment thereof, is 
immobilized to a surface, such as a plastic plate. Increasing amounts of the peptides, drugs or small 
molecules are placed in contact with the immobilized GENSET protein, or a fragment thereof, in 

25 the presence of a detectable labeled known GENSET protein ligand. For example, the GENSET 
ligand may be detectably labeled with a fluorescent, radioactive, or enzymatic tag. The ability of 
the test molecule to bind the GENSET protein, or a fragment thereof, is determined by measuring 
the amount of detectably labeled known ligand bound in the presence of the test molecule. A 
decrease in the amount of known ligand bound to the GENSET protein, or a fragment thereof, when 

30 the test molecule is present indicated that the test molecule is able to bind to the GENSET protein, 
or a fragment thereof 

C. Candidate ligands obtained by affinity chromatography. 

Proteins or other molecules interacting with a GENSET protein, or a fragment thereof 
comprising a contiguous span of at least 6 amino acids, preferably at least 8 to 10 amino acids, more 
35 preferably at least 12, 15, 20, 25, 30, 40, 50, or 1 00 amino acids of a polypeptide selected from the 
group consisting of sequences of SEQ ID NOs: 170-338, 456-560, 785-918 and polypeptides 
encoded by the clone inserts of the deposited clone pool, can also be found using affinity columns 
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which contain the GENSET protein, or a fragment thereof. The GENSET protein, or a fragment 
thereof, may be attached to the column using conventional techniques including chemical coupling 
to a suitable column matrix such as agarose, Affi Gel® , or other matrices familiar to those of skill 
in art. In some embodiments of this method, the affinity column contains chimeric proteins in 
5 which the GENSET protein, or a fragment thereof, is fused to glutathion S transferase (GST). A 
mixture of cellular proteins or pool of expressed proteins as described above is applied to the 
affinity column. Proteins or other molecules interacting with the GENSET protein, or a fragment 
thereof, attached to the column can then be isolated and analyzed on 2-D electrophoresis gel as 
described in Ramunsen et al. (1997), the disclosure of which is incorporated by reference. 
10 Alternatively, the proteins retained on the affinity column can be purified by electrophoresis based 
methods and sequenced. The same method can be used to isolate antibodies, to screen phage 
display products, or to screen phage display human antibodies. 

D. Candidate ligands obtained by optical biosensor methods 

Proteins interacting with a GENSET protein, or a fragment comprising a contiguous span of 

1 5 at least 6 amino acids, preferably at least 8 to 10 amino acids, more preferably at least 12, 15, 20, 
25, 30, 40, 50, or 100 amino acids of a polypeptide selected from the group consisting of sequences 
of SEQ ID NOs: 170-338, 456-560, 785-918 and polypeptides encoded by the clone inserts of the 
deposited clone pool, can also be screened by using an Optical Biosensor as described in Edwards 
and Leatherbarrow (1997) and also in Szabo etal. (1995), the disclosures of which are incorporated 

20 by reference. This technique permits the detection of interactions between molecules in real time, 
without the need of labeled molecules. This technique is based on the surface plasmon resonance 
(SPR) phenomenon. Briefly, the candidate ligand molecule to be tested is attached to a surface 
(such as a carboxymethyl dextran matrix). A light beam is directed towards the side of the surface 
that does not contain the sample to be tested and is reflected by said surface. The SPR phenomenon 

25 causes a decrease in the intensity of the reflected light with a specific association of angle and 
wavelength. The binding of candidate ligand molecules cause a change in the refraction index on 
the surface, which change is detected as a change in the SPR signal. For screening of candidate 
ligand molecules or substances that are able to interact with the GENSET protein, or a fragment 
thereof, the GENSET protein, or a fragment thereof, is immobilized onto a surface. This surface 

30 comprises one side of a cell through which flows the candidate molecule to be assayed. The 

binding of the candidate molecule on the GENSET protein, or a fragment thereof, is detected as a 
change of the SPR signal. The candidate molecules tested may be proteins, peptides, carbohydrates, 
lipids, or small molecules generated by combinatorial chemistry. This technique may also be 
performed by immobilizing eukaryotic or prokaryotic cells or lipid vesicles exhibiting an 

35 endogenous or a recombinantly expressed GENSET protein at their surface. 

The main advantage of the method is that it allows the determination of the association rate 
between the GENSET protein and molecules interacting with the GENSET protein. It is thus 
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possible to select specifically ligand molecules interacting with the GENSET protein, or a fragment 
thereof, through strong or conversely weak association constants. 

E. Candidate ligands obtained through a two-hybrid screening assay. 

The yeast two-hybrid system is designed to study protein-protein interactions in vivo (Fields 
5 and Song, 1989), which disclosure is hereby incorporated by reference in its entirety, and relies 
upon the fusion of a bait protein to the DNA binding domain of the yeast Gal4 protein. This 
technique is also described in the US Patent No. US 5,667,973 and the US Patent No. 5,283,173, the 
technical teachings of both patents being herein incorporated by reference. 

The general procedure of library screening by the two-hybrid assay may be performed as 
1 0 described by Harper et al. ( 1 993) or as described by Cho et al. ( 1 998) or also Fromont-Racine et al. 
(1997), which disclosures are hereby incorporated by reference in their entireties. 

The bait protein or polypeptide comprises, consists essentially of, or consists of a GENSET 
polypeptide or a fragment thereof comprising a contiguous span of at least 6 amino acids, preferably 
at least 8 to 10 amino acids, more preferably at least 12, 15, 20, 25, 30, 40, 50, or 100 amino acids 
15 of a polypeptide selected from the group consisting of sequences of SEQ ID NOs: 170-338, 456- 
560, 785-918 and polypeptides encoded by the clone inserts of the deposited clone pool. 

More precisely, the nucleotide sequence encoding the GENSET polypeptide or a fragment 
or variant thereof is fused to a polynucleotide encoding the DNA binding domain of the GAL4 
protein, the fused nucleotide sequence being inserted in a suitable expression vector, for example 
20 pAS2orpM3. 

Then, a human cDNA library is constructed in a specially designed vector, such that the 
human cDNA insert is fused to a nucleotide sequence in the vector that encodes the transcriptional 
domain of the GAL4 protein. Preferably, the vector used is the pACT vector. The polypeptides 
encoded by the nucleotide inserts of the human cDNA library are termed "prey" polypeptides. 

25 A third vector contains a detectable marker gene, such as beta galactosidase gene or CAT 

gene that is placed under the control of a regulation sequence that is responsive to the binding of a 
complete Gal4 protein containing both the transcriptional activation domain and the DNA binding 
domain. For example, the vector pG5EC may be used. 

Two different yeast strains are also used. As an illustrative but non limiting example the 

30 two different yeast strains may be the followings : 

- Y190, the phenotype of which is (MATa, Leu2-3, 1 12 ura3-12, trpl-901, his3-D200, 
ade2-101, gal4Dgall80D URA3 GAL-LacZ, LYS GAL-mS3, cyh'); 

- Y187, the phenotype of which is (MATa gal4 gal80 his3 trpl-901 ade2-101 ura3-52 Ieu2- 
3, -1 12 URA3 GAL-lacZmef), which is the opposite mating type of Y 190. 

35 Briefly, 20 jig of pAS2/GENSET and 20 (ig of pACT-cDNA library are co-transformed 

into yeast strain Y190. The transformants are selected for growth on minimal media lacking 
histidine, leucine and tryptophan, but containing the histidine synthesis inhibitor 3-AT (50 mM). 
Positive colonies are screened for beta galactosidase by filter lift assay. The double positive 
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colonies (His^ beta-gar) are then grown on plates lacking histidine, leucine, but containing 
tryptophan and cycloheximide (10 mg/ml) to select for loss of pAS2/GENSET plasmids but 
retention of pACT-cDNA library plasmids. The resulting Y190 strains are mated with Y187 strains 
expressing GENSET or non-related control proteins; such as cyclophilin B, lamin, or SNFl , as Gal4 
5 fusions as described by Harper et al. (1993) and by Bram et al. (1993), which disclosures are hereby 
incorporated by reference in their entireties, and screened for beta galactosidase by filter lift assay. 
Yeast clones that are beta gal- after mating with the control Gal4 fusions are considered false 
positives. 

In another embodiment of the two-hybrid method according to the invention, interaction 
1 0 between the GENSET or a fragment or variant thereof with cellular proteins may be assessed using 
the Matchmaker Two Hybrid System 2 (Catalog No. K1604-1, Clontech). As described in the 
manual accompanying the kit, the disclosure of which is incorporated herein by reference, nucleic 
acids encoding the GENSET protein or a portion thereof, are inserted into an expression vector such 
that they are in frame with DNA encoding the DNA binding domain of the yeast transcriptional 
1 5 activator GAL4. A desired cDNA, preferably human cDNA, is inserted into a second expression 
vector such that they are in frame with DNA encoding the activation domain of GAL4. The two 
expression plasmids are transformed into yeast and the yeast are plated on selection medium which 
selects for expression of selectable markers on each of the expression vectors as well as GAL4 
dependent expression of the HIS3 gene. Transformants capable of growing on medium lacking 
20 histidine are screened for GAL4 dependent lacZ expression. Those cells which are positive in both the 
histidine selection and the lacZ assay contain interaction between GENSET and the protein or peptide 
encoded by the initially selected cDNA insert. 

Compounds Modulating GENSET biological activity 

Another method of screening for compounds that modulate GENSET expression and/or 
25 biological activity is by measuring the effects of test compounds on specific biological activity, e.g. 
a GENSET biological activity in a host cell. In one embodiment, the present invention relates to a 
method of identifying an agent which alters GENSET biological activity, wherein a nucleic acid 
construct comprising a nucleic acid which encodes a mammalian GENSET polypeptide is 
introduced into a host cell. The host cells produced are maintained under conditions appropriate for 
30 expression of the encoded mammalian GENSET polypeptides, whereby the nucleic acid is 

expressed. The host cells are then contacted with a compound to be assessed (an "agent," or "test 
agent"), and the properties of the cells is assessed. Detection of a change in any GENSET 
polypeptide-associated property in the presence of the agent indicates that the agpnt alters GENSET 
activity. In a particular embodiment, the invention relates to a method of identifying an agent 
35 which is an activator of GENSET activity, wherein detection of an increase of any GENSET 
polypeptide-associated property in the presence of the agent indicates that the agent activates 
GENSET activity. In another particular embodiment, the invention relates to a method of 
identifying an agent which is an inhibitor of GENSET activity, wherein detection of a decrease of 
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any GENSET polypeptide-associated property in the presence of the agent indicates that the agent 
inhibits GENSET activity. 

In a particular embodiment, a high throughput screen can be used to identify agents that 
activate (enhance) or inhibit GENSET activity (See e.g., PCT publication WO 98/45438, which 
5 disclosure is hereby incorporated by reference in its entirety). For example, the method of 
identifying an agent which alters GENSET activity can be performed as follows. A nucleic acid 
construct comprising a polynucleotide which encodes a mammalian GENSET polypeptide is 
introduced into a host cell to produce recombinant host cells. The recombinant host cells are then 
maintained under conditions appropriate for expression of the encoded mammalian GENSET 

10 polypeptide, whereby the nucleic acid is expressed. The compound to be assessed is added to the 
recombinant host cells; the resulting combination is referred to as a test sample. A detectable, 
GENSET polypeptide-associated property of the cells is detected. A control can be used in the 
methods of detecting agents which alter GENSET activity. For example, the control sample 
includes the same reagents but lacks the compound or agent being assessed; it is treated in the same 

1 5 manner as the test sample. 

Methods of Screening for Compounds Modulating GENSET Expression and/or Activity 

The present invention also relates to methods of screening compounds for their ability to 
modulate (e.g. increase or inhibit) the activity or expression of GENSET. More specifically, the 
present invention relates to methods of testing compounds for their ability either to increase or to 
20 decrease expression or activity of GENSET. The assays are performed in vitro or in vivo. 

In vitro methods 

In vitro, cells expressing GENSET polypeptides are incubated in the presence and absence 
of the test compound. By determining the level of GENSET expression in the presence of the test 
compound or the level of GENSET activity in the presence of the test compound, compounds can 

25 be identified that suppress or enhance GENSET expression or activity. Alternatively, constructs 
comprising a GENSET regulatory sequence operably linked to a reporter gene (e.g. luciferase, 
chloramphenicol acetyl transferase, LacZ, green fluorescent protein, etc.) can be introduced into 
host cells and the effect of the test compounds on expression of the reporter gene detected. Cells 
suitable for use in the foregoing assays include, but are not limited to, cells having the same origin 

30 as tissues or cell lines in which the polypeptide is known to be expressed using the data from Tables 
III, IV, or V. 

Consequently, the present invention encompasses a method for screening molecules that 
modulate the expression of a GENSET gene, said screening method comprising the steps of: 

a) cultivating a prokaryotic or an eukaryotic cell that has been transfected with a nucleotide 
35 sequence encoding a GENSET protein or a variant or a fragment thereof, placed under the control 

of its own promoter; 

b) bringing into contact said cultivated cell with a molecule to be tested; 
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c) quantifying the expression of said GENSET protein or a variant or a fragment tliereof in 
the presence of said molecule. 

Using DNA recombination techniques well known by the one skill in the art, the GENSET 
protein encoding DNA sequence is inserted into an expression vector, downstream from its 
5 promoter sequence. As an illustrative example, the promoter sequence of the GENSET gene is 
contained in the 5' untranscribed region of the GENSET genomic DNA. 

The quantification of the expression of a GENSET protein may be realized either at the 
mRNA level (using for example Northen blots, RT-PCR, preferably quantitative RT-PCR with 
primers and probes specific for the GENSET mRNA of interest) or at the protein level (using 
10 polyclonal or monoclonal antibodies in immunoassays such as ELISA or RIA assays. Western blots, 
or immunochemistry). 

The present invention also concerns a method for screening substances or molecules that 
are able to increase, or in contrast to decrease, the level of expression of a GENSET gene. Such a 
method may allow the one skilled in the art to select substances exerting a regulating effect on the 
15 expression level of a GENSET gene and which may be useful as active ingredients included in 
pharmaceutical compositions for treating patients suffering from disorders associated with abnormal 
levels of GENSET products. 

Thus, another part of the present invention is a method for screening a candidate molecule 
that modulates the expression of a GENSET gene, this method comprises the following steps: 
20 a) providing a recombinant cell host containing a nucleic acid, wherein said nucleic acid 

comprises a GENSET 5' regulatory region or a regulatory active fragment or variant thereof, 
operably linked to a polynucleotide encoding a detectable protein; 

b) obtaining a candidate molecule; and 

c) determining the ability of said candidate molecule to modulate the expression levels of 
25 said polynucleotide encoding the detectable protein. 

In a further embodiment, said nucleic acid comprising a GENSET 5' regulatory region or a 
regulatory active fragment or variant thereof, includes the 5'UTR region of a GENSET cDNA 
selected from the group comprising of the 5'UTRs of the sequences of SEQ IDNOs:l-169, 339- 
455, 561-784, sequences of clones inserts of the deposited clone pool, regulatory active fragments 
30 and variants thereof. In a more preferred embodiment of the above screening method, said nucleic 
acid includes a promoter sequence which is endogenous with respect to the GENSET 5'UTR 
sequence. In another more preferred embodiment of the above screening method, said nucleic acid 
includes a promoter sequence which is exogenous with respect to the GENSET 5'UTR sequence 
defined therein. 

35 Preferred polynucleotides encoding a detectable protein are polynucleotides encoding beta 

galactosidase, green fluorescent protein (GFP) and chloramphenicol acetyl transferase (CAT). 

The invention further relates to a method for the production of a pharmaceutical 
composition comprising a method of screening a candidate molecule that modulates the expression 
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of a GENSET gene and furthermore mixing the identified molecule with a pharmaceutical ly 
acceptable carrier. 

The invention also pertains to kits for the screening of a candidate substance modulating the 
expression of a GENSET gene. Preferably, such kits comprise a recombinant vector that allows the 
5 expression of a GENSET 5' regulatory region or a regulatory active fragment or a variant thereof, 
operably linked to a polynucleotide encoding a detectable protein or a GENSET protein or a 
fi'agment or a variant thereof. More preferably, such kits include a recombinant vector that 
comprises a nucleic acid including the 5'UTR region of a GENSET cDNA selected from the group 
comprising the 5'UTRs of the sequences of SEQ IDNOs:l-169, 339-455, 561-784, sequences of 
1 0 clone inserts of the deposited clone pool, regulatory active fragments and variants thereof, being 
operably linked to a polynucleotide encoding a detectable protein. 

For the design of suitable recombinant vectors useful for performing the screening methods 
described above, it will be referred to the section of the present specification wherein the preferred 
recombinant vectors of the invention are detailed. 
1 5 Another object of the present invention comprises methods and kits for the screening of 

candidate substances that interact with a GENSET polypeptide, fragments or variants thereof By 
their capacity to bind covaiently or non-covalently to a GENSET protein, fragments or variants 
thereof, these substances or molecules may be advantageously used both in vitro and in vivo. 

In vitro, said interacting molecules may be used as detection means in order to identify the 
20 presence of a GENSET protein in a sample, preferably a biological sample. 

A method for the screening of a candidate substance that interact with a GENSET 
polypeptide, fragments or variants thereof, said methods comprising the following steps: 

a) providing a polypeptide comprising, consisting essentially of, or consisting of a 
GENSET protein or a fragment comprising a contiguous span of at least 6 amino acids, preferably 

25 at least 8 to 1 0 amino acids, more preferably at least 1 2, 1 5, 20, 25, 30, 40, 50, or 100 amino acids 
of a polypeptide selected from the group consisting of sequences of SEQ ]D NOs: 1 70-338, 456- 
560, 785-918 and polypeptides encoded by the clone inserts of the deposited clone pool; 

b) obtaining a candidate substance; 

c) bringing into contact said polypeptide with said candidate substance; 

30 d) detecting the complexes formed between said polypeptide and said candidate substance. 

The invention further relates to a method for the production of a pharmaceutical 
composition comprising a method for the screening of a candidate substance that interact with a 
GENSET polypeptide, fragments or variants thereof and furthermore mixing the identified 
substance with a pharmaceutically acceptable carrier. 

35 The invention further concerns a kit for the screening of a candidate substance interacting 

with the GENSET polypeptide, wherein said kit comprises: 

a) a polypeptide comprising, consisting essentially of, or consisting of a GENSET protein 
or a fragment comprising a contiguous span of at least 6 amino acids, preferably at least 8 to 10 
amino acids, more preferably at least 12, 15, 20, 25, 30, 40, 50, or 100 amino acids of a polypeptide 
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selected from the group consisting of sequences of SEQ ID NOs: 170-338, 456-560, 785-918 and 
polypeptides encoded by the clone inserts of the deposited clone pool; and 

b) optionally means useful to detect the complex formed between said polypeptide or a 
variant thereof and the candidate substance. 
5 In a preferred embodiment of the kit described above, the detection means comprises a 

monoclonal or polyclonal antibody binding to said GENSET protein or fragment or variant thereof. 

In vivo methods 

Compounds that suppress or enhance GENSET expression can also be identified using in 
vivo screens. In these assays, the test compound is administered (e.g. IV, IP, IM, orally, or 

10 otherwise), to the animal, for example, at a variety of dose levels. The effect of the compound on 
GENSET expression is determined by comparing GENSET levels, for example in tissues known to 
express the gene of interest using, for example the data obtained in Tables III, IV, or V, and using 
Northern blots, immunoassays, PGR, etc., as described above. Suitable test animals include, but are 
not limited to, rodents (e.g., mice and rats), primates, and rabbits. Humanized mice can also be 

1 5 used as test animals, that is mice in which the endogenous mouse protein is ablated (knocked out) 
and the homologous human protein added back by standard transgenic approaches. Such mice 
express only the human form of a protein. Humanized mice expressing only the human GENSET 
can be used to study in vivo responses to potential agents regulating GENSET protein or mRNA 
levels. As an example, transgenic mice have been produced carrying the human apoE4 gene. They 

20 are then bred with a mouse line that lacks endogenous apoE, to produce an animal model carrying 
human proteins believed to be instrumental in development of Alzheimer's pathology. Such 
transgenic animals are useful for dissecting the biochemical and physiological steps of disease, and 
for development of therapies for disease intervention (Loring, et al, 1 996) (incorporated herein by 
reference in its entirety). 

25 Uses for compounds modulating GENSET expression and/or biological activity 

Using in vivo (or in vitro) systems, it may be possible to identify compounds that exert a 
tissue specific effect, for example, that increase GENSET expression or activity only in tissues of 
interest, such as the adrenal gland, bone marrow, brain, cerebellum, colon, fetal brain, fetal kidney, 
fetal liver, heart, hypertrophic prostate, kidney, liver, lung, lymph ganglia, lymphocytes, muscle, 

30 ovary, pancreas, pituitary gland, placenta, prostate, salivary gland, spinal cord, spleen, stomach, 
intestine, substantia nigra, testis, thyroid, umbilical cord, and uterus. Screening procedures such as 
those described above are also useful for identifying agents for their potential use in 
pharmacological intervention strategies. Agents that enhance GENSET gene expression or 
stimulate its activity may thus be used to induce any phenotype associated with a GENSET gene, or 

35 to treat disorders resulting from a deficiency of a GENSET polypeptide activity or expression. 
Compounds that suppress GENSET polypeptide expression or inhibit its activity can be used to 
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treat any disease or condition associated with increased or deleterious GENSET polypeptide activity 
or expression. 

Also encompassed by the present invention is an agent which interacts with a GENSET 
gene or polypeptide directly or indirectly, and inhibits or enhances GENSET polypeptide expression 
5 and/or function. In one embodiment, the agent is an inhibitor which interferes with a GENSET 
polypeptide directly (e.g., by binding the GENSET polypeptide) or indirectly (e.g., by blocking the 
ability of the GENSET polypeptide to have a GENSET biological activity). In a particular 
embodiment, an inhibitor of a GENSET protein is an antibody specific for the GENSET protein or a 
functional portion of the GENSET protein; that is, the antibody binds a GENSET polypeptide. For 

10 example, the antibody can be specific for a polypeptide encoded by one of the nucleic acid 

sequences of human GENSETs (SEQ ID NOs:l-169, 339-455, 561-784), a mammalian GENSET 
nucleic acid, or portions thereof Alternatively, the inhibitor can be an agent other than an antibody 
(e.g., small organic molecule, protein or peptide) which binds the GENSET polypeptide and blocks 
its activity. For example, the inhibitor can be an agent which mimics the GENSET polypeptide 

15 structurally, but lacks its function. Alternatively, it can be an agent which binds to or interacts with 
a molecule which the GENSET polypeptide normally binds to or interacts with, thus blocking the 
GENSET polypepetide from doing so and preventing it from exerting the effects it would normally 
exert. 

In another embodiment, the agent is an enhancer (activator) of a GENSET polypeptide 

20 which increases the activity of the GENSET polypeptide (increases the effect of a given amount or 
level of GENSET), increases the length of time it is effective (by preventing its degradation or 
otherwise prolonging the time during which it is active) or both either directly or indirectly. For 
example, GENSET polynucleotides and polypeptides can be used to identify drugs which increase 
or decrease the ability of GENSET polypeptides to induce GENSET biological activity, which 

25 drugs are useful for the treatment or prevention of any disease or condition associated with a 
GENSET biological activity. 

The GENSET sequences of the present invention can also be used to generate nonhuman 
gene knockout animals, such as mice, which lack a GENSET gene or transgenically overexpress a 
GENSET gene. For example, such GENSET gene knockout mice can be generated and used to 

30 obtain further insight into the function of the GENSET gene as well as assess the specificity of 
GENSET activators and inhibitors. Also, over expression of the GENSET gene (e.g., a human 
GENSET gene) in transgenic mice can be used as a means of creating a test system for GENSET 
activators and inhibitors (e.g., against a human GENSET polypeptide). In addition, the GENSET 
gene can be used to clone the GENSET promoter/enhancer in order to identify regulators of 

35 GENSET gene transcription. GENSET gpne knockout animals include animals which completely 
or partially lack the GENSET gene and/or GENSET activity or function. Thus the present 
invention relates to a method of inhibiting (partially or completely) a GENSET biological activity in 
a mammal (e.g., a human), the method comprising administering to the mammal an effective 
amount of an inhibitor of a GENSET polypeptide or polynucleotide. The invention also relates to a 
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method of enhancing a GENSET biological activity in a mammal, the method comprising 
administering to the mammal an effective amount of an enhancer of a GENSET polypeptide or 
polynucleotide. 

Inhibiting GENSET gene expression 

3 Therapeutic compositions according to the present invention may comprise advantageously 

one or several GENSET oligonucleotide fragments as an antisense tool or a triple helix tool that 
inhibits the expression of the corresponding GENSET gene. 

Antisense Approach 

In antisense approaches, nucleic acid sequences complementary to an mRNA are 

1 0 hybridized to the mRNA intracellularly, thereby blocking the expression of the protein encoded by 
the mRNA. The antisense nucleic acid molecules to be used in gene therapy may be either DNA or 
RNA sequences. Preferred methods using antisense polynucleotide according to the present 
invention are the procedures described by Sczakiel et a/.(1995), which disclosure is hereby 
incorporated by reference in its entirety. 

1 5 Preferably, the antisense tools are chosen among the polynucleotides ( 1 5-200 bp long) that 

are complementary to GENSET mRNA, more preferably to the 5 'end of the GENSET mRNA. In 
another embodiment, a combination of different antisense polynucleotides complementary to 
different parts of the desired targeted gene are used. 

Other preferred antisense polynucleotides according to the present invention are sequences 

20 complementary to either a sequence of GENSET mRN As comprising the translation initiation 
codon ATG or a sequence of GENSET genomic DNA containing a splicing donor or acceptor site. 

Preferably, the antisense polynucleotides of the invention have a 3' poiyadenylation signal 
that has been replaced with a self-cleaving ribozyme sequence, such that RNA polymerase II 
transcripts are produced without poly(A) at their 3' ends, these antisense polynucleotides being 

25 incapable of export from the nucleus, such as described by Liu el a/.(I994), which disclosure is 
hereby incorporated by reference in its entirety. In a preferred embodiment, these GENSET 
antisense polynucleotides also comprise, within the ribozyme cassette, a histone stem-loop structure 
to stabilize cleaved transcripts against 3'-5' exonucleolytic degradation, such as the structure 
described by Eckner et a/.(1991), which disclosure is hereby incorporated by reference in its 

30 entirety. 

The antisense nucleic acids should have a length and melting temperature sufficient to 
permit formation of an intracellular duplex having sufficient stability to inhibit the expression of the 
GENSET mRNA in the duplex. Strategies for designing antisense nucleic acids suitable for use in 
gene therapy are disclosed in Green et al, (1986) and Izant and Weintraub, (1984), the disclosures 
35 of which are incorporated herein by reference. 

In some strategies, antisense molecules are obtained by reversing the orientation of the 
GENSET coding region with respect to a promoter so as to transcribe the opposite strand from that 
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which is normally transcribed in the cell. The antisense molecules may be transcribed using in vitro 
transcription systems such a;s those which employ T7 or SP6 polymerase to generate the transcript. 
Another approach involves transcription of GENSET antisense nucleic acids in vivo by operably 
linking DNA containing the antisense sequence to a promoter in a suitable expression vector. 
5 Alternatively, oligonucleotides which are complementary to the strand normally transcribed 

in the cell may be synthesized in vitro. Thus, the antisense nucleic acids are complementary to the 
corresponding mRNA and are capable of hybridizing to the mRNA to create a duplex. In some 
embodiments, the antisense sequences may contain modified sugar phosphate backbones to increase 
stability and make them less sensitive to RNase activity. Examples of modifications suitable for use 

10 in antisense strategies include 2' O-methyl RNA oligonucleotides and Protein-nucleic acid (PNA) 
oligonucleotides. Further examples are described by Rossi et al, (1991), which disclosure is hereby 
incorporated by reference in its entirety. 

Various types of antisense oligonucleotides complementary to the sequence of the GENSET 
cDNA or genomic DNA may be used. In one preferred embodiment, stable and semi-stable 

15 antisense oligonucleotides described in International Application No. PCT WO94/23026, hereby 
incorporated by reference, are used. In these molecules, the 3' end or both the 3' and 5' ends are 
engaged in intramolecular hydrogen bonding between complementary base pairs. These molecules 
are better able to withstand exonuclease attacks and exhibit increased stability compared to 
conventional antisense oligonucleotides. 

20 In another preferred embodiment, the antisense oligodeoxynucleotides against herpes 

simplex virus types 1 and 2 described in International Application No. WO 95/04141, hereby 
incorporated by reference, are used. 

In yet another preferred embodiment, the covalently cross-linked antisense oligonucleotides 
described in International Application No. WO 96/31523, hereby incorporated by reference, are 

25 used. These double- or single-stranded oligonucleotides comprise one or more, respectively, inter- 
or intra-oligonucleotide covalent cross-linkages, wherein the linkage consists of an amide bond 
between a primary amine group of one strand and a carboxyl group of the other strand or of the 
same strand, respectively, the primary amine group being directly substituted in the 2' position of 
the strand nucleotide monosaccharide ring, and the carboxyl group being carried by an aliphatic 

30 spacer group substituted on a nucleotide or nucleotide analog of the other strand or the same strand, 
respectively. 

The antisense oligodeoxynucleotides and oligonucleotides disclosed in International 
Application No. WO 92/18522, incorporated by reference, may also be used. These molecules are 
stable to degradation and contain at least one transcription control recognition sequence which binds 
35 to control proteins and are effective as decoys therefor. These molecules may contain "hairpin" 
structures, "dumbbell" structures, "modified dumbbell" structures, "cross-linked" decoy structures 
and "loop" structures. 

In another preferred embodiment, the cyclic double-stranded oligonucleotides described in 
European Patent Application No. 0 572 287 A2, hereby incorporated by reference are used. These 
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ligated oligonucleotide "dumbbells" contain the binding site for a transcription factor and inhibit 
expression of the gene under control of the transcription factor by sequestering the factor. 

Use of the closed antisense oligonucleotides disclosed in International Application No. WO 
92/19732, hereby incorporated by reference, is also contemplated. Because these molecules have 
5 no free ends, they are more resistant to degradation by exonucleases than are conventional 

oligonucleotides. These oligonucleotides may be multifunctional, interacting with several regions 
which are not adjacent to the target mRNA. 

The appropriate level of antisense nucleic acids required to inhibit gene expression may be 
determined using in vitro expression analysis. The antisense molecule may be introduced into the 

10 cells by diffusion, injection, infection or transfection using procedures known in the art. For 
example, the antisense nucleic acids can be introduced into the body as a bare or naked 
oligonucleotide, oligonucleotide encapsulated in lipid, oligonucleotide sequence encapsidated by 
viral protein, or as an oligonucleotide operably linked to a promoter contained in an expression 
vector. The expression vector may be any of a variety of expression vectors known in the art, 

1 5 including retroviral or viral vectors, vectors capable of extrachromosomal replication, or integrating 
vectors. The vectors may be DNA or RNA. 

The antisense molecules are introduced onto cell samples at a number of different 
concentrations preferably between IxlC'^M to IxlO'^M. Once the minimum concentration that can 
adequately control gene expression is identified, the optimized dose is translated into a dosage 

20 suitable for use in vivo. For example, an inhibiting concentration in culture of 1x10'^ translates into 
a dose of approximately 0.6 mg/kg bodyweight. Levels of oligonucleotide approaching 100 mg/kg 
bodyweight or higher may be possible after testing the toxicity of the oligonucleotide in laboratory 
animals. It is additionally contemplated that cells from the vertebrate are removed, treated with the 
antisense oligonucleotide, and reintroduced into the vertebrate. 

25 In a preferred application of this invention, the polypeptide encoded by the gene is first 

identified, so that the effectiveness of antisense inhibition on translation can be monitored using 
techniques that include but are not limited to antibody-mediated tests such as RIAs and ELISA, 
functional assays, or radiolabeling. 

An alternative to the antisense technology' that is used according to the present invention 

30 comprises using ribozymes that will bind to a target sequence via their complementary 

polynucleotide tail and that will cleave the corresponding RNA by hydrolyzing its target site 
(namely "hammerhead ribozymes"). Briefly, the simplified cycle of a hammerhead ribozyme 
comprises ( 1 ) sequence specific binding to the target RNA via complementary antisense sequences; 
(2) site-specific hydrolysis of the cleavable motif of the target strand; and (3) release of cleavage 

35 products, which gives rise to another catalytic cycle. Indeed, the use of long-chain antisense 

polynucleotide (at least 30 bases long) or ribozymes with long antisense arms are advantageous. A 
preferred delivety system for antisense ribozyme is achieved by covalently linking these antisense 
ribozymes to lipophilic groups or to use liposomes as a convenient vector. Preferred antisense 
ribozymes according to the present invention are prepared as described by Rossi et a/, (1991) and 
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Sczakiel et al. (1995), the specific preparation procedures being referred to in said articles being 
herein incorporated by reference. 

Triple Helix Approach 

The GENSET genomic DNA may also be used to inhibit the expression of the GENSET 
5 gene based on intracellular triple helix formation. 

Triple helix oligonucleotides are used to inhibit transcription from a genome. They are 
particularly useful for studying alterations in cell activity when it is associated with a particular 
gene. The GENSET cDNAs or genomic DNAs of the present invention or, more preferably, a 
fragment of those sequences, can be used to inhibit gene expression in individuals having diseases 
10 associated with expression of a particular gene. Similarly, a portion of the GENSET genomic DNA 
can be used to study the effect of inhibiting GENSET gene transcription within a cell. 
Traditionally, homopurine sequences were considered the most useful for triple helix strategies. 
However, homopyrimidine sequences can also inhibit gene expression. Such homopyrimidine 
oligonucleotides bind to the major groove at homopurine:homopyrimidine sequences. Thus, both 
1 5 types of sequences from the GENSET genomic DNA are contemplated within the scope of this 
invention. 

To carry out gene therapy strategies using the triple helix approach, the sequences of the 
GENSET genomic DNA are first scanned to identify 10-mer to 20-mer homopyrimidine or 
homopurine stretches which could be used in triple-helix based strategies for inhibiting GENSET 
20 expression. Following identification of candidate homopyrimidine or homopurine stretches, their 
efficiency in inhibiting GENSET expression is assessed by introducing varying amounts of 
oligonucleotides containing the candidate sequences into tissue culture cells which express the 
GENSET gene. 

The oligonucleotides can be introduced into the cells using a variety of methods known to 
25 those skilled in the art, including but not limited to calcium phosphate precipitation, DEAE- 
Dextran, electroporation, liposome-mediated transfection or native uptake. 

Treated cells are monitored for altered cell function or reduced GENSET expression using 
techniques such as Northern blotting, RNase protection assays, or PGR based strategies to monitor 
the transcription levels of the GENSET gene in cells which have been treated with the 
30 oligonucleotide. The cell functions to be monitored are predicted based upon the homologies of the 
target gene corresponding to the cDNA from which the oligonucleotide was derived with known 
gene sequences that have been associated with a particular function. The cell functions can also be 
predicted based on the presence of abnormal physiology within cells derived from individuals with 
a particular inherited disease, particularly when the cDNA is associated with the disease using 
35 techniques described in the section entitled "Identification of genes associated with hereditary 
diseases or drug response". 
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The oligonucleotides which are effective in inhibiting gene expression in tissue culture cells 

may then be introduced in vivo using the techniques and at a dosage calculated based on the in vitro 

results, as described in the section entitled "Antisense Approach". 

In some embodiments, the natural (beta) anomers of the oligonucleotide units can be 
5 replaced with alpha anomers to render the oligonucleotide more resistant to nucleases. Further, an 

intercalating agent such as ethidium bromide, or the like, can be attached to the 3' end of the alpha 

oligonucleotide to stabilize the triple helix. For information on the generation of oligonucleotides 

suitable for triple helix formation see Griffm etal. (1989), which is hereby incorporated by this 

reference. 

1 0 Treating GENSET gene-related disorders 

The present invention further relates to methods, uses of GENSET polypeptides and 
polynucleotides, and uses of modulators of GENSET polypeptides and polynucleotides, for treating 
diseases/disorders associated with GENSET genes by increasing or decreasing GENSET gene 
activity and/or expression. These methodologies can be effected using compounds selected using 

1 5 screening protocols such as those described herein and/or by using the gene therapy and antisense 
approaches described in the art and herein. Gene therapy can be used to effect targeted expression 
of GENSET genes in any tissue, e.g. a tissue associated with the disease or condition to be treated. 
The GENSET coding sequence can be cloned into an appropriate expression vector and targeted to 
a particular cell type(s) to achieve efficient, high level expression. Introduction of the GENSET 

20 coding sequence into target cells can be achieved, for example, using particle mediated DNA 

delivery, (Haynes, 1996 and Maurer, 1999), direct injection of naked DNA, (Levy et al, 1996; and 
Feigner, 1996), or viral vector mediated transport (Smith etal, 1996, Stone et ai, 2000; Wu and 
Atai, 2000), each of which disclosures are hereby incorporated by reference in their entireties. 
Tissue specific effects can be achieved, for example, in the case of virus mediated transport by 

25 using viral vectors that are tissue specific, or by the use of promoters that are tissue specific. For 
instance, any tissue-specific promoter may be used to achieve specific expression, for example 
albumin promoters (liver specific; Pinkert et al., 1 987 Genes Dev. 1 :268-277), lymphoid specific 
promoters (Calame et al., 1988 Adv. Immunol. 43:235-275), promoters of T-cell receptors (Winoto 
et al., 1989 EMBO J. 8:729-733) and immunoglobulins (Banerji et al., 1983 Cell 33:729-740; 

30 Queen and Baltimore 1 983 Cell 33:74 1 -748), neuron-specific promoters (e.g. the neurofilament 
promoter; Byrne et al., 1989 Proc. Natl. Acad. Sci. USA 86:5473-5477), pancreas-specific 
promoters (Ediunch et al., 1985 Science 230:912-916) or mammary gland-specific promoters (milk 
whey promoter, U.S. Pat. No. 4,873,3 16 and European Application Publication No. 264, 166). 
Developmentally-regulated promoters can also be used, such as the murine homeobox promoters 

35 (Kessel et al., 1 990 Science 249:374-379) or the alpha-fetoprotein promoter (Campes et al., 1 989 
Genes Dev. 3:537-546). 

Combinatorial approaches can also be used to ensure that the GENSET coding sequence is 
activated in the target tissue (Butt and Karathanasis, 1995; Miller and Whelan, 1997), which 
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disclosures are hereby incorporated by reference in their entireties. Antisense oligonucleotides 
complementary to GENSET mRNA can be used to selectively diminish or ablate the expression of 
the protein, for example, at sites of inflammation. More specifically, antisense constructs or 
antisense oligonucleotides can be used to inhibit the production of GENSET in high expressing 
5 cells such as those cited in the third column of Table V. Antisense mRNA can be produced by 
transfecting into target cells an expression vector with the GENSET gene sequence, or a portion 
thereof, oriented in an antisense direction relative to the direction of transcription. Appropriate 
vectors include viral vectors, including retroviral, adenoviral, and adeno-associated viral vectors, as 
well as nonviral vectors. Tissue specific promoters can be used, as described supra. Alternatively, 

10 antisense oligonucleotides can be introduced directly into target cells to achieve the same goal. 
(See also other delivery methodologies described herein in connection with gene therapy.). 
Oligonucleotides can be selected/designed to achieve a high level of specificity (Wagner et al. , 
1996), which disclosure is hereby incorporated by reference in its entirety. The therapeutic 
methodologies described herein are applicable to both human and non-human mammals (including 

1 S cats and dogs). 

Pharmaceutical and physiologically acceptable compositions 

The present invention also relates to pharmaceutical or physiologically acceptable 
compositions comprising, as active agent, the polypeptides, nucleic acids or antibodies of the 
invention. The invention also relates to compositions comprising, as active agent, compounds 

20 selected using the above-described screening protocols. Such compositions include the active agent 
in combination with a pharmaceutical or physiologically acceptable carriers. In the case of naked 
DNA, the "carrier" may be gold particles. The amount of active agent in the composition can vary 
with the agent, the patient and the effect sought. Likewise, the dosing regimen can vary depending 
on the composition and the disease/disorder to be treated. 

25 Therefore, the invention related to methods for the production of pharmaceutical 

composition comprising a method for selecting an active agent, compound, substance or molecule 
using any of the screening method described herein and furthermore mixing the identified active 
agent, compound, substance or molecule with a pharmaceutical ly acceptable carrier. 

The pharmaceutical compositions utilized in this invention may be administered by any 

30 number of routes including, but not limited to, oral, intravenous, intramuscular, intra-arterial, 

intramedullary, intrathecal, intraventricular, transdermal, subcutaneous, intraperitoneal, intranasal, 
enteral, topical, sublingual, or rectal means. In addition to the active ingredients, these 
pharmaceutical compositions may contain suitable pharmaceutical ly acceptable carriers comprising 
excipients and auxiliaries which facilitate processing of the active compounds into preparations 

35 which can be used pharmaceutical ly. Further details on techniques for formulation and 

administration may be found in the latest edition of Remington's Pharmaceutical Sciences (Maack 
Publishing Co. Easton, Pa). 
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Pharmaceutical compositions for oral administration can be formulated using 
pharmaceutically acceptable carriers well known in the art in dosages suitable for oral 
administration. Such carriers enable the pharmaceutical compositions to be formulated as tablets, 
pills, dragees, capsules, liquids, gels, syrups, slurries, suspensions, and the like, for ingestion by the 
5 patient. 

Pharmaceutical preparations for oral use can be obtained through a combination of active 
comfwunds with solid excipient, suiting mixture is optionally grinding, and processing the mixture 
of granules, after adding suitable auxiliaries, if desired, to obtain tablets or dragee cores. Suitable 
excipients are carbohydrate or protein fillers, such as sugars, including lactose, sucrose, mannitol, or 

1 0 sorbitol; starch from corn, wheat, rice, potato, or other plants; cellulose, such as methyl cellulose, 
hydroxypropylmethyl-cellulose, or sodium carboxymethylcellulose; gums including arable and 
tragacanth; and proteins such as gelatin and collagen. If desired, disintegrating or solubilizing 
agents may be added, such as the cross-linked polyvinyl pyrrolidone, agar, alginic acid, or a salt 
thereof, such as sodium alginate. 

1 5 Dragee cores may be used in conjunction with suitable coatings, such as concentrated sugar 

solutions, which may also contain gum arable, talc, polyvinylpyrrolidone, carbopol gel, 
polyethylene glycol, and/or titaniumdioxide, lacquer solutions, and suitable organic solvents or 
solvent mixtures. Dyestuffs or pigments may be added to the tablets or dragee coatings for product 
identification or to characterize the quantity of active compound, i.e., dosage. 

20 Pharmaceutical preparations which can be used orally include push-fit capsules made of 

gelatin, as well as soft, sealed capsules made of gelatin and a coating, such as glycerol or sorbitol. 
Push-fit capsules can contain active ingredients mixed with a filler or binders, such as lactose or 
starches, lubricants, such as talc or magnesium stearate, and, optionally, stabilizers. In soft 
capsules, the active compounds may be dissolved or suspended in suitable liquids, such as fatty oils, 

25 liquid, or liquidpolyethylene glycol with or without stabilizers. 

Pharmaceutical formulations suitable for parenteral administration may be formulated in 
aqueous solutions, preferably in physiologically compatible buffers such as Hanks solution. 
Ringer's solution, or physiologically buffered saline. Aqueous injection suspensions may contain 
substances which increase the viscosity of the suspension, such as sodium carboxymethylcellulose, 

30 sorbitol, or dextran. Additionally, suspensions of the active compounds may be prepared as 
appropriate oily injection suspensions. Suitable lipophilic solvents or vehicles include fatty oils 
such as sesame oil, or synthetic fatty acid esters, such as ethyl oleate or triglycerides, or liposomes. 
Optionally, the suspension may also contain suitable stabilizers or agents which increase the 
solubility of the compounds to allow for the preparation of highly concentrated solutions. 

35 For topical or nasal administration, penetrants appropriate to the particular barrier to be 

permeated are used in the formulation. Such penetrants are generally known in the art. 

The pharmaceutical compositions of the present invention may be manufactured in a 
manner that is known in the art, e.g., by means of conventional mixing, dissolving, granulating, 
dragee-making, levigating, emulsifying, encapsulating, entrapping, or lyophilizing processes. 
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The pharmaceutical composition may be provided as a salt and can be formed with many 

acids, including but not limited to, hydrochloric, sulfuric, acetic, lactic, tartaric, malic, succinic, etc. 

Salts tend to be more soluble in aqueous or other protonic solvents than are the corresponding free 

base forms. In other cases, the preferred preparation may be a lyophilized powder which may 
5 contain any or all of the following: 1-50 mM histidine, 0. l%-2% sucrose, and 2-7% mannitol, at a 

pH range of 4.5 to 5.5, that is combined with buffer prior to use. 

After pharmaceutical compositions have been prepared, they can be placed in an 

appropriate container and labeled for treatment of an indicated condition. For administration of a 

GENSET polypeptide, such labeling would include amount, frequency, and method of 
10 administration. 

Pharmaceutical compositions suitable for use in the invention include compositions wherein 
the active ingredients are contained in an effective amount to achieve the intended purpose. The 
determination of an effective dose is well within the capability of those skilled in the art. 

For any compound, the therapeutically effective dose can be estimated initially either in cell 

1 5 culture assays, e.g., of neoplastic cells, or in animal models, usually mice, rabbits, dogs, or pigs. 
The animal model may also be used to determine the appropriate concentration range and route of 
administration. Such information can then be used to determine useful doses and routes for 
administration in humans. 

A therapeutically effective dose refers to that amount of active ingredient, for example a 

20 GENSET polypeptide or fragments thereof, antibodies specific to GENSET polypeptides, agonists, 
antagonists or inhibitors of GENSET polypeptides, which ameliorates the symptoms or condition. 
Therapeutic efficacy and toxicity may be determined by standard pharmaceutical procedures in cell 
cultures or experimental animals, e.g., ED50 (the dose therapeutically effective in 50% of the 
population) and LD50 (the dose lethal to 50% of the population). The dose ratio between 

25 therapeutic and toxic effects is the therapeutic index, and it can be expressed as the ratio, 

LD50/ED50. Pharmaceutical compositions which exhibit large therapeutic indices are preferred. 
The data obtained from cell culture assays and animal studies is used in formulating a range of 
dosage for human use. The dosage contained in such compositions is preferably within a range of 
circulating concentrations that include the ED50 with little or no toxicity. The dosage varies within 

30 this range depending upon the dosage form employed, sensitivity of the patient, and the route of 
administration. 

The exact dosage will be determined by the practitioner, in light of factors related to the 
subject that requires treatment. Dosage and administration are adjusted to provide sufficient levels 
of the active moiety or to maintain the desired effect. Factors which may be taken into account 
35 include the severity of the disease state, general health of the subject, age, weight, and gender of the 
subject, diet, time and frequency of administration, drug combination(s), reaction sensitivities, and 
tolerance/response to therapy. Long-acting pharmaceutical compositions maybe administered every 
3 to 4 days, every week, or once every two weeks depending on half-life and clearance rate of the 
particular formulation. 
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Normal dosage amounts may vary from 0. 1 to 100,000 micrograms, up to a total dose of 
about 1 g, depending upon the route of administration. Guidance as to particular dosages and 
methods of delivery is provided in the literature and generally available to practitioners in the art. 
Those skilled in the art will employ different formulations for nucleotides than for proteins or their 
5 inhibitors. Similarly, delivery of polynucleotides or polypeptides will be specific to particular cells, 
conditions, locations, etc. 

Uses of Genset Sequences: Computer-Related Embodiments 

As used herein the term "cDNA codes of SEQ ID NOs:l-169, 339-455, 561-784" 

10 encompasses the nucleotide sequences of SEQ ID NOs: 1-169, 339-455, 561-784 and of clones 
inserts of the deposited clone pool, fragments thereof, nucleotide sequences homologous thereto, 
and sequences complementary to all of the preceding sequences. The fragments include fragments 
of SEQ ID NOs: I -169, 339-455, 561-784 comprising at least 8, 10, 12, 15, 18,20,25,28,30,35, 
40, 50, 75, 100, 150, 200, 300, 400, 500, 1000 or 2000 consecutive nucleotides of SEQ ID NOs: 1- 

15 169, 339-455, 561-784. Preferably the fragments include polynucleotides described herein as 
encoding polypeptides having a biological activity. Homologous sequences and fragments of SEQ 
ID NOs: 1-169, 339-455, 561-784 refer to a sequence having at least 99%, 98%, 97%, 96%, 95%, 
90%, 85%, 80%, or 75% identity to these sequences. Identity may be determined using any of the 
computer programs and parameters described herein, including BLAST2N with the default 

20 parameters or with any modified parameters. Homologous sequences also include RNA sequences 
in which uridines replace the thymines in the cDNA codes of SEQ ID NOs: 1-1 69, 339-455, 561- 
784. The homologous sequences may be obtained using any of the procedures described herein or 
may result from the correction of a sequencing error as described above. It will be appreciated that 
the cDNA codes of SEQ ID NOs:l -169, 339-455, 561-784 can be represented in the traditional 

25 single character format (see, e.g. the inside back cover of Stryer, 1 995) or in any other format which 
records the identity of the nucleotides in a sequence. 

As used herein the term "polypeptide codes of SEQ ID NOs: 170-338, 456-560, 785-918" 
encompasses the polypeptide sequences of SEQ ID NOs:170-338, 456-560, 785-918 which are 
encoded by the cDNAs of SEQ ID NOs:l-l69, 339-455, 561-784, the polypeptide sequences 

30 encoded by the clone inserts of the deposited clone pool, polypeptide sequences homologous 

thereto, or fragments of any of the preceding sequences. Homologous polypeptide sequences refer 
to a polypeptide sequence having at least 99%, 98%, 97%, 96%, 95%, 90%, 85%, 80%, 75% 
identity to one of the polypeptide sequences of SEQ ID NOs: 1 70-338, 456-560, 785-9 1 8. Identity 
may be determined using any of the computer programs and parameters described herein, including 

35 FASTA with the default parameters or with any modified parameters. The homologous sequences 
may be obtained using any of the procedures described herein or may result from the correction of a 
sequencing error as described above. The polypeptide fragments comprise at least 5, 6, 8, 10, 12, 
15, 20, 25, 30, 35, 40, 50, 60, 75, 100, 150 or 200 consecutive amino acids of the polypeptides of 
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SEQ ID NOs: 170-338, 456-560, 785-918. Preferably, the fragments include polypeptides described 
herein as having a biological activity, or fragments comprising at least 5, 10, 15, 20, 25, 30, 35, 40, 
50, 75, 100, or 150 consecutive amino acids of polypeptides described herein as having a biological 
activity. It wil I be appreciated that the polypeptide codes of the SEQ ID NOs: 1 70-33 8, 456-560, 
5 785-9 1 8 can be represented in the traditional single character format or three letter format (see, the 
inside back cover of Stryer, 1995) or in any other format which relates the identity of the 
polypeptides in a sequence. 

It will be appreciated by those skilled in the art that the nucleic acid codes of the invention 
and polypeptide codes of the invention can be stored, recorded, and manipulated on any medium 

10 which can be read and accessed by a computer. As used herein, the words "recorded" and "stored" 
refer to a process for storing information on a computer medium. A skilled artisan can readily 
adopt any of the presently known methods for recording information on a computer readable 
medium to generate manufactures comprising one or more of the nucleic acid codes of the 
invention, or one or more of the polypeptide codes of the invention. Another aspect of the present 

15 invention is a computer readable medium having recorded thereon at least 2, 5, 10, 15, 20, 25, 30, or 
50 nucleic acid codes of the invention. Another aspect of the present invention is a computer 
readable medium having recorded thereon at least 2, 5, 10, 15, 20, 25, 30, or 50 polypeptide codes 
of the invention. 

Computer readable media include magnetically readable media, optically readable media, 
20 electronically readable media and magnetic/optical media. For example, the computer readable 
media may be a hard disk, a floppy disk, a magnetic tape, CD-ROM, Digital Versatile Disk (DVD), 
Random Access Memory (RAM), or Read Only Memory (ROM) as well as other types of other 
media known to those skilled in the art. 

Embodiments of the present invention include systems, particularly computer systems 
25 which store and manipulate the sequence information described herein. One example of a computer 
system 100 is illustrated in block diagram form in Figure 1. As used herein, "a computer system" 
refers to the hardware components, software components, and data storage components used to 
analyze the nucleotide sequences of the nucleic acid codes of the invention or the amino acid 
sequences of the polypeptide codes of the invention. In one embodiment, the computer system 100 
30 is a Sun Enterprise 1000 server (Sun Microsystems, Palo Alto, CA). The computer system 1 00 
preferably includes a processor for processing, accessing and manipulating the sequence data. The 
processor 105 can be any well-known type of central processing unit, such as the Pentium III from 
Intel Corporation, or similar processor from Sun, Motorola, Compaq or International Business 
Machines. 

35 Preferably, the computer system 1 00 is a general purpose system that comprises the 

processor 105 and one or more internal data storage components 1 10 for storing data, and one or 
more data retrieving devices for retrieving the data stored on the data storage components. A 
skilled artisan can readily appreciate that any one of the currently available computer systems are 
suitable. 
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In one particular embodiment, the computer system 100 includes a processor 105 connected 
to a bus which is connected to a main memory 1 1 5 (preferably implemented as RAM) and one or 
more internal data storage devices 1 10, such as a hard drive and/or other computer readable media 
having data recorded thereon. In some embodiments, the computer system 100 further includes one 
5 or more data retrieving device 1 1 8 for reading the data stored on the internal data storage devices 
110. 

The data retrieving device 118 may represent, for example, a floppy disk drive, a compact 
disk drive, a magnetic tape drive, etc. In some embodiments, the internal data storage device 1 10 is 
a removable computer readable medium such as a floppy disk, a compact disk, a magnetic tape, etc. 

1 0 containing control logic and/or data recorded thereon. The computer system 1 00 may 

advantageously include or be programmed by appropriate software for reading the control logic 
and/or the data from the data storage component once inserted in the data retrieving device. 

The computer system 100 includes a display 120 which is used to display output to a 
computer user. It should also be noted that the computer system 100 can be linked to other 

1 5 computer systems 1 25a-c in a network or wide area network to provide centralized access to the 
computer system 100. 

Software for accessing and processing the nucleotide sequences of the nucleic acid codes of 
the invention or the amino acid sequences of the polypeptide codes of the invention (such as search 
tools, compare tools, and modeling tools etc.) may reside in main memory 1 1 5 during execution. 

20 In some embodiments, the computer system 1 00 may further comprise a sequence comparer 

for comparing the above-described nucleic acid codes of the invention or the polypeptide codes of 
the invention stored on a computer readable medium to reference nucleotide or polypeptide 
sequences stored on a computer readable medium. A "sequence comparer" refers to one or more 
programs which are implemented on the computer system 100 to compare a nucleotide or 

25 polypeptide sequence with other nucleotide or polypeptide sequences and/or compounds including 
but not limited to peptides, peptidomimetics, and chemicals stored within the data storage means. 
For example, the sequence comparer may compare the nucleotide sequences of nucleic acid codes 
of the invention or the amino acid sequences of the polypeptide codes of the invention stored on a 
computer readable medium to reference sequences stored on a computer readable medium to 

30 identify homologies, motifs implicated in biological function, or structural motifs. The various 
sequence comparer programs identified elsewhere in this patent specification are particularly 
contemplated for use in this aspect of the invention. 

Figure 2 is a flow diagram illustrating one embodiment of a process 200 for comparing a 
new nucleotide or protein sequence with a database of sequences in order to determine the 

35 homology levels between the new sequence and the sequences in the database. The database of 
sequences can be a private database stored within the computer system 100, or a public database 
such as GENBANK, PIR OR SWISSPROT that is available through the Internet. 
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The process 200 begins at a start state 201 and then moves to a state 202 wherein the new 
sequence to be compared is stored to a memory in a computer system 100. As discussed above, the 
memory could be any type of memory, including RAM or an internal storage device. 

The process 200 then moves to a state 204 wherein a database of sequences is opened for 
5 analysis and comparison. The process 200 then moves to a state 206 wherein the first sequence 
stored in the database is read into a memoty on the computer. A comparison is then performed at a 
state 2 1 0 to determine if the first sequence is the same as the second sequence. It is important to 
note that this step is not limited to performing an exact comparison between the new sequence and 
the first sequence in the database. Well-known methods are known to those of skill in the art for 
1 0 comparing two nucleotide or protein sequences, even if they are not identical. For example, gaps 
can be introduced into one sequence in order to raise the homology level between the two tested 
sequences. The parameters that control whether gaps or other features are introduced into a 
sequence during comparison are normally entered by the user of the computer system. 
Once a comparison of the two sequences has been performed at the state 210, a 
1 5 determination is made at a decision state 2 1 0 whether the two sequences are the same. Of course, 
the term "same" is not limited to sequences that are absolutely identical. Sequences that are within 
the homology parameters entered by the user will be marked as "same" in the process 200. 

If a determination is made that the two sequences are the same, the process 200 moves to a 
state 214 wherein the name of the sequence from the database is displayed to the user. This state 
20 notifies the user that the sequence with the displayed name fulfills the homology constraints that 
were entered. Once the name of the stored sequence is displayed to the user, the process 200 moves 
to a decision state 2 1 8 wherein a determination is made whether more sequences exist in the 
database. If no more sequences exist in the database, then the process 200 terminates at an end state 
220. However, if more sequences do exist in the database, then the process 200 moves to a state 
25 224 wherein a pointer is moved to the next sequence in the database so that it can be compared to 
the new sequence. In this manner, the new sequence is aligned and compared with every sequence 
in the database. 

It should be noted that if a determination had been made at the decision state 212 that the 
sequences were not homologous, then the process 200 would move immediately to the decision 
30 state 2 1 8 in order to determine if any other sequences were available in the database for 

comparison. 

Accordingly, one aspect of the present invention is a computer system comprising a 
processor, a data storage device having stored thereon a nucleic acid code of the invention or a 
polypeptide code of the invention, a data storage device having retrievably stored thereon reference 
35 nucleotide sequences or polypeptide sequences to be compared to the nucleic acid code of the 
invention or polypeptide code of the invention and a sequence comparer for conducting the 
comparison. The sequence comparer may indicate a homology level between the sequences 
compared or identify motifs implicated in biological function and structural motifs in the nucleic 
acid code of the invention and polypeptide codes of the invention or it may identify structural 
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motifs in sequences whicli are compared to these nucleic acid codes and polypeptide codes. In 
some embodiments, the data storage device may have stored thereon the sequences of at least 2, 5, 
10, 1 5, 20, 25, 30, or 50 of the nucleic acid codes of the invention or polypeptide codes of the 
invention. 

5 Another aspect of the present invention is a method for determining the level of homology 

between a nucleic acid code of the invention and a reference nucleotide sequence, comprising the 
steps of reading the nucleic acid code and the reference nucleotide sequence through the use of a 
computer program which determines homology levels and determining homology between the 
nucleic acid code and the reference nucleotide sequence with the computer program. The computer 

10 program may be any of a number of computer programs for determining homology levels, including 
those specifically enumerated herein, including BLAST2N with the default parameters or with any 
modified parameters. The method may be implemented using the computer systems described 
above. The method may also be performed by reading 2, 5, 10, 15, 20, 25, 30, or 50 of the above 
described nucleic acid codes of the invention through the use of the computer program and 

15 determining homology between the nucleic acid codes and reference nucleotide sequences. 

Figure 3 is a flow diagram illustrating one embodiment of a process 250 in a computer for 
determining whether two sequences are homologous. The process 250 begins at a start state 252 and 
then moves to a state 254 wherein a first sequence to be compared is stored to a memory. The 
second sequence to be compared is then stored to a memory at a state 256. The process 250 then 

20 moves to a state 260 wherein the first character in the first sequence is read and then to a state 262 
wherein the first character of the second sequence is read. It should be understood that if the 
sequence is a nucleotide sequence, then the character would normally be either A, T, C, G or U. If 
the sequence is a protein sequence, then it should be in the single letter amino acid code so that the 
first and sequence sequences can be easily compared. 

25 A determination is then made at a decision state 264 whether the two characters are the 

same. If they are the same, then the process 250 moves to a state 268 wherein the next characters in 
the first and second sequences are read. A determination is then made whether the next characters 
are the same. If they are, then the process 250 continues this loop until two characters are not the 
same. If a determination is made that the next two characters are not the same, the process 250 

30 moves to a decision state 274 to determine whether there are any more characters either sequence to 
read. 

If there are no more characters to read, then the process 250 moves to a state 276 wherein 
the level of homology between the first and second sequences is displayed to the user. The level of 
homology is determined by calculating the proportion of characters between the sequences that 
35 were the same out of the total number of sequences in the first sequence. Thus, if every character in 
a first 100 nucleotide sequence aligned with a every character in a second sequence, the homology 
level would be 100%. 

Alternatively, the computer program may be a computer program which compares the 
nucleotide sequences of the nucleic acid codes of the present invention, to reference nucleotide 
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sequences in order to determine whether the nucleic acid code of the invention differs from a 
reference nucleic acid sequence at one or more positions. Optionally such a program records the 
length and identity of inserted, deleted or substituted nucleotides with respect to the sequence of 
either the reference polynucleotide or the nucleic acid code of the invention. In one embodiment, 
5 the computer program may be a program which determines whether the nucleotide sequences of the 
nucleic acid codes of the invention contain one or more single nucleotide polymorphisms (SNP) 
with respect to a reference nucleotide sequence. These single nucleotide polymorphisms may each 
comprise a single base substitution, insertion, or deletion. 

Another aspect of the present invention is a method for determining the level of homology 

10 between a polypeptide code of the invention and a reference polypeptide sequence, comprising the 
steps of reading the polypeptide code of the invention and the reference polypeptide sequence 
through use of a computer program which determines homology levels and determining homology 
between the polypeptide code and the reference polypeptide sequence using the computer program. 
Accordingly, another aspect of the present invention is a method for determining whether a 

15 nucleic acid code of the invention differs at one or more nucleotides from a reference nucleotide 
sequence comprising the steps of reading the nucleic acid code and the reference nucleotide 
sequence through use of a computer program which identifies differences between nucleic acid 
sequences and identifying differences between the nucleic acid code and the reference nucleotide 
sequence with the computer program. In some embodiments, the computer program is a program 

20 which identifies single nucleotide polymorphisms. The method may be implemented by the 

computer systems described above and the method illustrated in Figure 3. The method may also be 
performed by reading at least 2, 5, 10, 15, 20, 25, 30, or 50 of the nucleic acid codes of the 
invention and the reference nucleotide sequences through the use of the computer program and 
identifying differences between the nucleic acid codes and the reference nucleotide sequences with 

25 the computer program. 

In other embodiments the computer based system may further comprise an identifier for 
identifying features within the nucleotide sequences of the nucleic acid codes of the invention or the 
amino acid sequences of the polypeptide codes of the invention. An "identifier" refers to one or 
more programs which identifies certain features within the above-described nucleotide sequences of 

30 the nucleic acid codes of the invention or the amino acid sequences of the polypeptide codes of the 
invention. In one embodiment, the identifier may comprise a program which identifies an open 
reading frame in the cDNAs codes of the invention. 

Figure 4 is a flow diagram illustrating one embodiment of an identifier process 300 for 
detecting the presence of a feature in a sequence. The process 300 begins at a start state 302 and 

35 then moves to a state 304 wherein a first sequence that is to be checked for features is stored to a 
memory 1 15 in the computer system 100. The process 300 then moves to a state 306 wherein a 
database of sequence features is opened. Such a database would include a list of each feature's 
attributes along with the name of the feature. For example, a feature name could be "Initiation 
Codon" and the attribute would be "ATG". Another example would be the feature name 
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"TAATAA Box" and the feature attribute would be "TAATAA". An example of such a database is 
produced by the University of Wisconsin Genetics Computer Group (www.gcg.com). 

Once the database of features is opened at the state 306, the process 300 moves to a state 
308 wherein the first feature is read from the database. A comparison of the attribute of the first 
5 feature with the first sequence is then made at a state 3 10. A determination is then made at a 
decision state 316 whether the attribute of the feature was found in the first sequence. If the 
attribute was found, then the process 300 moves to a state 3 1 8 wherein the name of the found 
feature is displayed to the user. 

The process 300 then moves to a decision state 320 wherein a determination is made 

10 whether move features exist in the database. If no more features do exist, then the process 300 
terminates at an end state 324. However, if more features do exist in the database, then the process 
300 reads the next sequence feature at a state 326 and loops back to the state 3 1 0 wherein the 
attribute of the next feature is compared against the first sequence. 

It should be noted, that if the feature attribute is not found in the first sequence at the 

1 5 decision state 3 1 6, the process 300 moves directly to the decision state 320 in order to determine if 
any more features exist in the database. 

In another embodiment, the identifier may comprise a molecular modeling program which 
determines the 3-dimensional structure of the polypeptides codes of the invention. Such programs 
may use any methods known to those skilled in the art including methods based on homology- 

20 modeling, fold recognition and ab initio methods as described in Sternberg et al, 1999, which 
disclosure is hereby incorporated by reference in its entirety. In some embodiments, the molecular 
modeling program identifies target sequences that are most compatible with profiles representing 
the structural environments of the residues in known three-dimensional protein structures. (See, 
e.g., Eisenberg et al, U.S. Patent No. 5,436,850 issued July 25, 1 995, which disclosure is hereby 

25 incorporated by reference in its entirety). In another technique, the known three-dimensional 
structures of proteins in a given family are superimposed to define the structurally conserved 
regions in that family. This protein modeling technique also uses the known three-dimensional 
structure of a homologous protein to approximate the structure of the polypeptide codes of the 
invention. (See e.g., Srinivasan, et al., U.S. Patent No. 5,557,535 issued September 17, 1996, which 

30 disclosure is hereby incorporated by reference in its entirety). Conventional homology modeling 
techniques have been used routinely to build models of proteases and antibodies. (Sowdhamini et 
al., (1997)). Comparative approaches can also be used to develop three-dimensional protein models 
when the protein of interest has poor sequence identity to template proteins. In some cases, proteins 
fold into similar three-dimensional structures despite having very weak sequence identities. For 

3 5 example, the three-dimensional structures of a number of helical cytokines fold in similar three- 
dimensional topology in spite of weak sequence homology. 

The recent development of threading methods now enables the identification of likely 
folding patterns in a number of situations where the structural relatedness between target and 
template(s) is not detectable at the sequence level. Hybrid methods, in which fold recognition is 
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performed using Multiple Sequence Threading (MST), structural equivalencies are deduced from 
the threading output using a distance geometry program DRAGON to construct a low resolution 
model, and a full-atom representation is constructed using a molecular modeling package such as 
QUANTA. 

5 According to this 3-step approach, candidate templates are first identified by using the 

novel fold recognition algorithm MST, which is capable of performing simultaneous threading of 
multiple aligned sequences onto one or more 3-D structures. In a second step, the structural 
equivalencies obtained from the MST output are converted into interresidue distance restraints and 
fed into the distance geometry program DRAGON, together with auxiliary information obtained 

10 from secondary structure predictions. The program combines the restraints in an unbiased manner 
and rapidly generates a large number of low resolution model confirmations. In a third step, these 
low resolution model confirmations are converted into full-atom models and subjected to energy 
minimization using the molecular modeling package QUANTA. (See e.g., Aszodi etal., (1997)). 
The results of the molecular modeling analysis may then be used in rational drug design 

1 5 techniques to identify agents which modulate the activity of the polypeptide codes of the invention. 
Accordingly, another aspect of the present invention is a method of identifying a feature 
within the nucleic acid codes of the invention or the polypeptide codes of the invention comprising 
reading the nucleic acid code(s) or the polypeptide code(s) through the use of a computer program 
which identifies features therein and identifying features within the nucleic acid code(s) or 

20 polypeptide code(s) with the computer program. In one embodiment, computer program comprises 
a computer program which identifies open reading frames. In a further embodiment, the computer 
program identifies linear or structural motifs in a polypeptide sequence. In another embodiment, 
the computer program comprises a molecular modeling program. The method may be performed by 
reading a single sequence or at least 2, 5, 10, 1 5, 20, 25, 30, or 50 of the nucleic acid codes of the 

25 invention or the polypeptide codes of the invention through the use of the computer program and 
identifying features within the nucleic acid codes or polypeptide codes with the computer program. 

The nucleic acid codes of the invention or the polypeptide codes of the invention may be 
stored and manipulated in a variety of data processor programs in a variety of formats. For 
example, they may be stored as text in a word processing file, such as Microsoft WORD or 

30 WORDPERFECT or as an ASCII file in a variety of database programs familiar to those of skill in 
the art, such as DB2, SYBASE, or ORACLE. In addition, many computer programs and databases 
may be used as sequence comparers, identifiers, or sources of reference nucleotide or polypeptide 
sequences to be compared to the nucleic acid codes of the invention or the polypeptide codes of the 
invention. The following list is intended not to limit the invention but to provide guidance to 

35 programs and databases which are useful with the nucleic acid codes of the invention or the 

polypeptide codes of the invention. The programs and databases which may be used include, but 
are not limited to: MacPattem (EMBL), DiscoveryBase (Molecular Applications Group), GeneMine 
(Molecular Applications Group), Look (Molecular Applications Group), MacLook (Molecular 
Applications Group), BLAST and BLAST2 (NCBI), BLASTN and BLASTX (Altschul etal, 1990), 
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FASTA (Pearson and Lipman, 1988), FASTDB (Brutlag et al., 1990), Catalyst (Molecular 
Simulations Inc.), Catalyst/SHAPE (Molecular Simulations Inc.), Cerius2.DBAccess (Molecular 
Simulations Inc.), HypoGen (Molecular Simulations Inc.), Insight II, (Molecular Simulations Inc.), 
Discover (Molecular Simulations Inc.), CHARMm (Molecular Simulations Inc.), Felix (Molecular 
5 Simulations Inc.), DelPhi, (Molecular Simulations Inc.), QuanteMM, (Molecular Simulations Inc.), 
Homology (Molecular Simulations Inc.), Modeler (Molecular Simulations Inc.), ISIS (Molecular 
Simulations Inc.), Quanta/Protein Design (Molecular Simulations Inc.), WebLab (Molecular 
Simulations Inc.), WebLab Diversity Explorer (Molecular Simulations Inc.), Gene Explorer 
(Molecular Simulations Inc.), SeqFold (Molecular Simulations Inc.), the EMBL/Swissprotein 

10 database, the MDL Available Chemicals Directory database, the MDL Drug Data Report data base, 
the Comprehensive Medicinal Chemistry database, Derwents's World Drug Index database, the 
BioByteMasterFile database, the Genbank database, and the Genseqn database. Many other 
programs and data bases would be apparent to one of skill in the art given the present disclosure. 
Motifs which may be detected using the above programs include sequences encoding 

15 leucine zippers, helix-tum-helix motifs, glycosylation sites, ubiquitination sites, alpha helices, and 
beta sheets, signal sequences encoding signal peptides which direct the secretion of the encoded 
proteins, sequences implicated in transcription regulation such as homeoboxes, acidic stretches, 
enzymatic active sites, substrate binding sites, and enzymatic cleavage sites. 

Conclusion 

20 As discussed above, the GENSET polynucleotides and polypeptides of the present 

invention or fragments thereof can be used for various purposes. The polynucleotides can be used 
to express recombinant protein for analysis, characterization or therapeutic use; as markers for 
tissues in which the corresponding protein is preferentially expressed (either constitutively or at a 
particular stage of tissue differentiation or development or in disease states); as molecular weight 

25 markers on Southern gels; as chromosome markers or tags (when labeled) to identify chromosomes 
or to map related gene positions; as a reagent (including a labeled reagent) in assays designed to 
quantitatively determine levels of GENSET expression in biological samples; to compare with 
endogenous DNA sequences in patients to identify potential genetic disorders; as probes to 
hybridize and thus discover novel, related DNA sequences; as a source of information to derive 

30 PCR primers for genetic fingerprinting; for selecting and making oligomers for attachment to a 
"gene chip" or other support, including for examination for expression patterns; to raise anti-protein 
antibodies using DNA immunization techniques; and as an antigen to raise anti-DNA antibodies or 
elicit another immune response. Where the polynucleotide encodes a protein which binds or 
potentially binds to another protein (such as, for example, in a receptor-ligand interaction), the 

35 polynucleotide can also be used in interaction trap assays (such as, for example, that described in 
Gyuris et al., (1993) to identify polynucleotides encoding the other protein with which binding 
occurs or to identify inhibitors of the binding interaction. 



225 



[fije>'/^:/yv'GG208389e.s^] 



WO 02/083898 PCT/IBOl/00914 
The proteins or polypeptides provided by the present invention can similarly be used in 
assays to determine biological activity, including in a panel of multiple proteins for high-throughput 
screening; to raise antibodies or to elicit another immune response; as a reagent (including the 
labeled reagent) in assays designed to quantitatively determine levels of the protein (or its receptor) 
5 in biological fluids; as markers for tissues in which the corresponding protein is preferentially 
expressed (either constitutively or at a particular stage of tissue differentiation or development or in 
a disease state); and, of course, to isolate correlative receptors or ligands. Where the protein binds 
or potentially binds to another protein (such as, for example, in a receptor-ligand interaction), the 
protein can be used to identify the other protein with which binding occurs or to identify inhibitors 
10 of the binding interaction. Proteins involved in these binding interactions can also be used to screen 
for peptide or small molecule inhibitors or agonists of the binding interaction. 

Any or all of these research utilities are capable of being developed into reagent grade or kit 
format for commercialization as research products. 

Methods for performing the uses listed above are well known to those skilled in the art. 
15 References disclosing such methods include without limitation "Molecular Cloning; A Laboratory 
Manual", 2d ed., Cole Spring Harbor Laboratory Press, Sambrook, J., E.F. Fritsch and T. Maniatis 
eds., 1989, and "Methods in Enzymology; Guide to Molecular Cloning Techniques", Academic 
Press, Berger and Kimmel eds., 1987, which disclosures are hereby incorporated by reference in 
their entireties. 

20 Polynucleotides and proteins of the present invention can also be used as nutritional sources 

or supplements. Such uses include without limitation use as a protein or amino acid supplement, 
use as a carbon source, use as a nitrogen source and use as a source of carbohydrate. In such cases 
the protein or polynucleotide of the invention can be added to the feed of a particular organism or 
can be administered as a separate solid or liquid preparation, such as in the form of powder, pills, 

25 solutions, suspensions or capsules. In the case of microorganisms, the protein or polynucleotide of 
the invention can be added to the medium in or on which the microorganism is cultured. 

Although this invention has been described in terms of certain preferred embodiments, 
other embodiments which will be apparent to those of ordinary skill in the art in view of the 
disclosure herein are also within the scope of this invention. Accordingly, the scope of the 

30 invention is intended to be defined only by reference to the appended claims. 

Examples 

Preparation of Antibody Compositions to the GENSET protein 

Substantially pure protein or polypeptide is isolated from transfected or transformed cells 
35 containing an expression vector encoding the GENSET protein or a portion thereof. The 

concentration of protein in the fmal preparation is adjusted, for example, by concentration on an 
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Amicon filter device, to the level of a few micrograms/mi. Monoclonal or polyclonal antibody to the 
protein can then be prepared as follows: 

A. Monoclonal Antibody Production by Hvbridoma Fusion 

Monoclonal antibody to epitopes in the GENSET protein or a portion thereof can be 
5 prepared from murine hybridomas according to the classical method of Kohler and Milstein, (1975) 
or derivative methods thereof. Also see Harlow and Lane. (1988). 

Briefly, a mouse is repetitively inoculated with a few micrograms of the GENSET protein 
or a portion thereof over a period of a few weeks. The mouse is then sacrificed, and the antibody 
producing cells of the spleen isolated. The spleen cells are fused by means of polyethylene glycol 

10 with mouse myeloma cells, and the excess unfused cells destroyed by growth of the system on 
selective media comprising aminopterin (HAT media). The successfully fused cells are diluted and 
aliquots of the dilution placed in wells of a microliter plate where growth of the culture is 
continued. Antibody-producing clones are identified by detection of antibody in the supernatant 
fluid of the wells by immunoassay procedures, such as ELISA, as originally described by Engvall, 

15 (1980), which disclosure is hereby incorporated by reference in its entirety, and derivative methods 
thereof. Selected positive clones can be expanded and their monoclonal antibody product harvested 
for use. Detailed procedures for monoclonal antibody production are described in Davis, et al. 
(1986) Section 21-2. 

20 B. Polyclonal Antibody Production by Immunization 

Polyclonal antiserum containing antibodies to heterogeneous epitopes in the GENSET 
protein or a portion thereof can be prepared by immunizing suitable non-human animal with the 
GENSET protein or a portion thereof, which can be unmodified or modified to enhance 
immunogenicity. A suitable non-human animal is preferably a non-human mammal is selected, 

25 usually a mouse, rat, rabbit, goat, or horse. Alternatively, a crude preparation which has been 
enriched for GENSET concentration can be used to generate antibodies. Such proteins, fragments 
or preparations are introduced into the non-human mammal in the presence of an appropriate 
adjuvant (e.g. aluminum hydroxide, RIBI, etc.) which is known in the art. In addition the protein, 
fi-agment or preparation can be pretreated with an agent which will increase antigenicity, such 

30 agents are known in the art and include, for example, methylated bovine serum albumin (mBSA), 
bovine serum albumin (BSA), Hepatitis B surface antigen, and keyhole limpet hemocyanin (KLH). 
Serum from the immunized animal is collected, treated and tested according to known procedures. 
If the serum contains polyclonal antibodies to undesired epitopes, the polyclonal antibodies can be 
purified by immunoaffinity chromatography. 

35 Effective polyclonal antibody production is affected by many factors related both to the 

antigen and the host species. Also, host animals vary in response to site of inoculations and dose, 
with both inadequate or excessive doses of antigen resulting in low titer antisera. Small doses (ng 
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level) of antigen administered at multiple intradermal sites appears to be most reliable. Techniques 
for producing and processing polyclonal antisera are known in the art. An effective immunization 
protocol for rabbits can be found in Vaitukaitis etal. (1971), which disclosure is hereby 
incorporated by reference in its entirety. 
5 Booster injections can be given at regular intervals, and antiserum harvested when antibody 

titer thereof, as determined semi-quantitatively, for example, by double immunodiffusion in agar 
against known concentrations of the antigen, begins to fall. See, for example, Ouchterlony et al., 
(1973), which disclosure is hereby incorporated by reference in its entirety. Plateau concentration 
of antibody is usually in the range of 0.1 to 0.2 mg/ml of serum (about 12 uM). Affinity of the 

1 0 antisera for the antigen is determined by preparing competitive binding curves, as described, for 
example, by Fisher (1980), which disclosure is hereby incorporated by reference in its entirety. 

Antibody preparations prepared according to either the monoclonal or the polyclonal 
protocol are useful in quantitative immunoassays which determine concentrations of antigen- 
bearing substances in biological samples; they are also used semi-quantitatively or qualitatively to 

15 identify the presence of antigen in a biological sample. The antibodies may also be used in 

therapeutic compositions for killing cells expressing the protein or reducing the levels of the protein 
in the body. 
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TABLE I 



SEQ ID NO 
from priority 
application 
(nuci) 


SEQ ID NO 
in present 
application 
(nucl.) 


SEQ ID NO 
in present 
application 
(prt) 


SEQ ID NO 
in present 
application 
(nucl.) 


Clone ID 


37 






NUC561 


486589;523280 


51 






NUC562 


500707076;609080;642931 


179 






NUC563 


147941: 153834; 193 100 


180 






NUC564 


147941 


183 






NUC565 


1 00038; 1 004 1 9; 1 00523 ; 1 00546 


201 






NUC566 


528046 


326 






NUC567 


484303 


362 






NUC568 


211034;211122 


440 






NUC569 


627628 


452 






NUC570 


5007 1 3596;500733538;50074 1 9 
77 


483 






NUC571 


500730326 


500 






NUC572 


482482 


505 






NUC573 


611492 


528 






NUC574 


221311 


573 






NUC575 


144150 


574 






NUC576 


626500 


587 






NUC577 


490129 


588 






NUC578 


129471 


593 


NUC86 


PRT255 


NUC406 


500762786 


599 






NUC579 


148991;206343;211039 


603 






NUC580 


212418 


621 






NUC581 


395370 


628 






NUC582 


116680 


653 






NUC583 


224425 


670 






NUC584 


225626 


678 


NUC87 


PRT256 


NUC407 


822794 


678 


NUC88 


PRT257 


NUC407 


337572 
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500760143 
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l4<^7/l•l<ft'50<•00<ft70• 
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131662;131668;177901;200591 
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NUC590 
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812 






NUC591 


500694179 


940 






NUC592 


158339;213121;220652;236981; 
239275;239598;244360;582565 


988 






NUC593 


238 1 23;239495;334569;582920 
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NUC594 


334818 


1036 






NUC595 


483794;519407;633595;633902 
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NUC596 
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1151 
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500743552;500744660 
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101006,509431 
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106061 
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NUC607 


654607 
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172048; 172057 
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101090 
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NUC610 


119222;119491;151036 


2843 
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619446;619452;633712 
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145573 


2932 
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650981 
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130068 
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NUC615 


100966;99482 
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147041 
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132294;132317 
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NUC618 


126303;! 3423 1;135 124 
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625238,632902;635258 


3633 






NUC621 
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131690;588080 
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— " 
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— — 


XTT JCA 1 f\ 
IN 01^4 lU 
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— nrr — 

— ^— 
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XTI jr^A 1 1 


53 8 182*6208 18 '6251 54*628241 ■ 
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1000943975 
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1000771934 
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PRT184 


NUC352 


500742698 


4908 






NUC636 


nnA^^^ACo -^lA n« 


4943 
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241215 


4947 


NUC16 


PRT185 


NUC353 


201980 


4947 


NUC17 


PRT186 


NUC353 


198002 


4976 






NUC638 


248323;248866 


5000 


NUC18 


PRT187 


■NUC354 


500739047 


5002 


NUC19 


PRT188 


NUC355 


1000904024 


5005 






NUC639 


I55986;2223I3;237229 


501 1 


NUC20 


PRT189 


NUC356 


125817 


5040 






NUC640 


646668 


5058 






NUC641 


224715 


5071 


NUC21 


PRT190 


NUC357 


147648 


5089 


NUC22 


PRTI91 


NUC358 


1000839315 


5117 


NUC23 


PRT192 


NUC359 


122473 


5141 


NUC24 


PRT193 


NUC360 


585770 


5141 


NUC25 


PRT194 


NUC360 


123996 


5162 


NUC26 


PRT195 


NUC361 


1000904064 


5167 


NUC27 


PRTI96 


NUC362 


482181 


5178 


NUC28 


PRT197 


NUC363 


500731597 


5192 


NUC29 


PRTI98 


NUC364 


581232 


5214 






NUC642 


394359 


5230 


NUC30 


PRTi99 


NUC365 


613647 


5240 


NUC31 


PRT200 


NUC366 


715437 


5250 


NUC32 


PRT201 


NUC367 


1000878517 


5262 


NUC33 


PRT202 


NUC368 


544474 


5270 


NUC34 


PRT203 


NUC369 


143880 


5278 


NUC35 


PRT204 


NUC370 


1000853793 


5358 


NUC36 


PRT205 


NUC371 


500732568 


5453 


NUC37 


PRT206 


NUC372 


427150 


5453 


NUC38 


PRT207 


NUC372 


593306 


5453 


NUC39 


PRT208 


NUC372 


593993 


5453 


NUC40 


PRT209 


NUC372 


590939 


5453 






NUC372 


432874;435627 


5494 


NUC41 


PRT210 


NUC373 


155600 


5494 


NUC42 


PRT2n 


NUC373 


641537 


5499 






NUC643 


500702809 


5533 


NUC43 


PRT212 


NUC374 


1000872335 


5563 


NUC44 


PRT213 


NUC375 


1000852500 


5609 


NUC45 


PRT214 


NUC376 


500720555 


5657 


NUC46 


PRT215 


NUC377 


500715373 


5691 


NUC47 


PRT216 


NUC378 


167435 


5748 


NUC48 


PRT217 


NUC379 


520429 


5748 


NUC49 


PRT218 


NUC379 


613335 


5806 


NUC50 


PRT219 


NUC380 


589848 


5806 


NUC51 


PRT220 


NUC380 


211883 


5806 


NUC52 


PRT221 


NUC380 


642603 


5806 


NUC53 


PRT222 


NUC380 


193316 


5816 


NUC54 


PRT223 


NUC381 


495917 


5824 


NUC55 


PRT224 


NUC382 


160935 


5861 


NUC56 


PRT225 


NUC383 


593736 


5885 


NUC57 


PRT226 


NUC384 


613887 


5913 


NUC58 


PRT227 


NUC385 


166601 


5947 


NUC96 


PRT265 


NUC413 


654627 


5966 


NUC59 


PRT228 


NUC386 


500762665 


5966 


NUC60 


PRT229 


NUC386 


500742089 
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5966 


NUC61 


PRT230 


NUC386 


500759088 


5970 






NUC644 


193675;423656 


5974 


NUC62 


PRT23 1 


NUC387 


650666 


5983 






NUC645 


626803 


5985 


NUC63 


PRT232 


NUC388 


594066 


6011 


NUC97 


PRT266 


NUC414 


1000886279 


6080 


NUC64 


PRT233 


NUC389 


642569 




NUC65 


PRT234 


NUC390 


519656 


— ^tI^ — 






NUC646 


145580 


6159 


NUC66 


p^^^^g — 


NUC391 


1000903258 


^31 


— NUC67 — 




NUC392 


715579 


6238 






NUC647 


500694849;500699591 ;5007060 
2 8;5007 1 0562 ;500724984;6256 
42;628058;633030 


6252 


NUC98 


PRT267 


NUC415 


1000855876 


6283 


NUC68 


PRT237 


NUC393 


820495 


6290 


NUC69 


PRT238 


NUC394 


500709853 


6290 


NUC70 


PRT239 


NUC394 


500757399 


6290 


NUC71 


PRT240 


NUC394 


592868 


6322 


NUC72 


PRT241 


NUC395 


500739746 


6329 






NUC648 


615173 


6334 






NUC649 


237324 


6345 


NUC73 


PRT242 


NUC396 


500714172 


6345 


NUC74 


PRT243 


NUC396 


500716683 


6350 


NUC75 


PRT244 


NUC397 


1000869553 


6358 


NUC76 


PRT245 


NUC398 


608537 


6384 


NUC77 


PRT246 


NUC399 


1000906334 


6400 






NUC650 


149691 


6418 






NUC651 


237026 


6431 


NUC78 


PRT247 


NUC400 


614334 


6453 


NUC99 


PRT268 


NUC416 


211056 


6636 






NUC652 


608607 


6660 






NUC653 


129407 


6688 


NUClOO 


PRT269 


NUC417 


646099 


6727 


NUC79 


PRT248 


NUC401 


199782 


6727 


NUC80 


PRT249 


NUC401 


821212 


6727 


NUC81 


PRT250 


NUC401 


202863 


6835 


NUClOl 


PRT270 


NUC418 


158243 


6865 






NUC654 


612052 


6892 


NUC102 


PRT271 


NUC419 


153261 


6892 


NUC103 


PRT272 


NUC419 


650872 


6892 


NUC104 


PRT273 


NUC419 


599054 


6892 


NUC105 


PRT274 


NUC419 


152042 


6892 


NUC106 


PRT275 


NUC419 


493328 


7000 


NUC107 


PRT276 


NUC420 


538694 


7041 






NUC655 


142587; 145561; 146609; 
149065; 153394; 153773; 
205319; 206906; 215376; 
227424; 228016; 240538; 
242510; 530873; 588304 


7533 






>aJC656 


500758154 


7535 






NUC657 


632835 


7577 






NUC658 


205411 


7697 


NUC108 


PRT277 


NUC421 


653966 


7712 


NUC109 


PRT278 


NUC422 


237552 


7712 






NUC422 


202997;206456 


8009 


NUCllO 


PRT279 


NUC423 


645452 


8078 


NUCl 1 1 


PRT280 


NUC424 


335367 
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8078 


NUC112 


PRT281 


NUC424 


334488 


8078 


NUC113 


PRT282 


NUC424 


329736 


8078 


NUCl 14 


PRT283 


NUC424 


244355 


8078 


NUC115 


PRT284 


NUC424 


150197 


8078 


NUCl 16 


PRT285 


NUC424 


244242 


8078 


NUCl 17 


PRT286 


NUC424 


223147 


8078 


NUCl 18 


PRT287 


NUC424 


221735 


8078 


NUCl 19 


PRT288 


NUC424 


215414 


8078 


NUCl 20 


PRT289 


NUC424 


149875 


8078 


NUCl 21 


PRT290 


NUC424 


167198 


8078 


NUCl 22 


PRT291 


NUC424 


193511 


8078 


NUCl 23 


PRT292 


NUC424 


226917 


8078 


NUCl 24 


PRT293 


NUC424 


225461 


8078 


NUCl 25 


PRT294 


NUC424 


193742 


8078 






NUC424 


1 6507 1 ; 1 65245;200864;22 1 825; 
243230;581542 


8079 






NUC659 


628867 


8097 






NUC660 


486772;511180 


8166 


NUCl 26 


PRT295 


NUC425 


642948 


8166 


NUCl 27 


PRT296 


NUC425 


638743 


8262 






NUC661 


151662 


8341 






NUC662 


101420 


8534 






NUC663 


131658;196152;243686 


8666 


NUCl 28 


PRT297 


NUC426 


763024 


8666 


NUCl 29 


PRT298 


NUC426 


500720430 


8671 






NUC664 


193411 


8744 






NUC665 


162906 


8968 


NUC82 


PRT251 


NUC402 


771827 


8968 


NUCl 30 


PRT299 


NUC402 


500695719 


8968 






NUC402 


620376;635045 


8994 






NUC666 


199362;227277;242546 


9297 






NUC667 


651871 


9327 






NUC668 


247810 


9332 






NUC669 


199155;200810;336623 








NUC670 




9407 






NUC671 


106061 


9668 






NUC672 


1 682 1 8; 1 9777 1 ;205623;228775 ; 
238794 


9679 


NUC131 


PRT300 


NUC427 


206381 


9755 






NUC673 


197091 


9868 






NUC674 


1 44783 ;206407:2 1 57 1 4;234057; 
336758:582582 


10044 






NUC675 


107768;! 11854;500721812;500 
723626;500723636;500724389; 
500725580;500729834;50073 54 
42;500735787;500758255;5007 
62395;586703;589397;612312;6 
35730;642849;6458 1 2;762987;7 
67609 


10322 


NUC132 


PRT301 


NUC428 


200895 


10526 


NUC133 


PRT302 


NUC429 


1000891255 


10584 






NUC676 


500745219 


10650 






NUC677 


I87889;242499 


10739 


NUCl 34 


PRT303 


NUC430 


637548 


10743 


NUC135 


PRT304 


NUC431 


767426 


10744 


NUC136 


PRT305 


NUC432 


500691428 


10761 






NUC678 


131060 


10880 


NUCl 37 


PRT306 


NUC433 


116153 



241 



WO 02/083898 



PCT/IBOl/00914 



10942 


NUC138 


PRT307 


NUC434 


500699885 


10942 


NUC139 


PRT308 


NUC434 


746303 


10942 


NUC140 


PRT309 


NUC434 


500705937 


10942 






NUC434 


500705002;500712632;633931; 
634489;813634;816859 


11019 


NUC141 


PRT310 


NUC435 


150568 


11278 


NUC142 


PRT311 


NUC436 


495638 


11342 


NUC143 


PRT3I2 


NUC437 


143196 


11562 






NUC679 


187543 


11688 






NUC 680 


1 654 1 9; 1 65544; 1 663 87; 1 8 1 924; 
181930il%904h 99001;199269i 
224447;238731;243770 


11735 


NUC144 


PRT313 


NUC438 


633418 


11735 


NUC145 


PRT314 


NUC438 


422878 


11735 






NUC438 


500706283;50071 1792;5007I27 
1 1;500725618;500738973;6513 


TTaU 

i^^T^ — 






XII IP^ift 1 
INUCOo 1 


633791 




NUC146 


— PDT315 


NUC439 


546312 


pTfMT 







NUC682 


632330 


12048 






NUC683 


101164 


12098 






NUC684 


135037 


12202 






NUC685 


168232;243338 


12220 






NUC686 


433866 


12243 






NUC687 


659527 


12263 






NUC688 


139596 


12276 






NUC689 


624892;628879;631590;633480 


12490 






NUC690 


500704734;500744586;5007449 
72;611533;634337 


12604 


NUC147 


PRT316 


NUC440 


614106 


12604 






NUC440 


178304 


12657 






NUC691 


238579;248948;397864;521873; 
526295;589951 


ir7S8 — 


NUC148 


— 


NUC441 




TtqTTi — 


NUC 149 


p ^ — 
— 


NUC442 


124608 


VTom — 


MI TP 1 

FN UL^ 1 JU 


— 


NUC443 


478617 


VTom — 






NUC443 


481184 


HTTn — 







NUC692 


58373 1 ;650307;650848 


TrxY? — 






NUC 693 


6l23ot 


13229 — 


^^^^^^ 


p^^^^j — 


NUC444 




13256 


NUC153 


PRT322 


NUC445 


165123 


13256 


NUC154 


PRT323 


NUC445 


165643 


13256 






NUC445 


142964; 1 50214;223536;245008 


13267 


NUC155 


PRT324 


NUC446 


488818 








NUC694 


193487;238191;248387 


26638 — 






NUC695 




26710 — 






NUC696 


600909;608784;611 758:6 1472 1 ; 
619435-620041 ;620372;6258 15; 
625933;625983;626308;627299; 
627481;628147;628753;631655; 
633039;633371;633760;634553; 
642966 


26726 






NUC697 


421115 


26786 






NUC698 


500702480 


26982 






NUC699 


638872 


27084 


NUC156 


PRT325 


NUC447 


242080 


27084 






NUC447 


128161;186671;210505;211578; 
2 14909;22 1 663;222 1 0 1 ;223000; 
224361 ;226849^42326;242424; 
243662;244913;247912 
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27273 






NUC700 


525674;60I556 


27301 






NUC701 


135037 


27336 






NUC702 


500742735 


27361 






NUC703 


205346;530902 


27374 






NUC704 


643006 


27627 






NUC705 


129706;223196 


27697 






NUC706 


500701900 


27877 






NUC707 


99497 


28413 






NUC708 


135042 


28517 






NUC709 


15001 1;20 1848 


28518 






NUC710 


50072 1 700;500729093 ;5007301 
52 


29120 


NUC157 


PRT326 


NUC448 


488444 


29469 






NUC711 


638852 


29472 






NUC712 


637812 


29557 






NUC713 


188208 


29673 






NUC714 


650606 


29814 






NUC715 


813496 


30218 






NUC716 


241681;242553;589203 


30446 






NUC717 


105288 


30477 






NUC718 


500724995;500758517 


30583 






NUC719 


117932;194613;225013;331614 


30719 






NUC720 


637363 


31356 






NUC721 


106998 


31422 


NUC158 


PRT327 


NUC449 


500732587 


31554 






NUC722 


222161 


31627 






NUC723 


173050 


31726 






NUC724 


393750 


31744 






NUC725 


176380 


31790 






NUC726 


625728 


32102 






NUC727 


500762549 


32473 


NUC159 


PRT328 


NUC450 


183902 


32475 


NUC160 


PRT329 


NUC451 


635993 


32962 






NUC728 


500740719 


33130 






NUC729 


165852;165888 


33712 


NUC161 


PRT330 


NUC452 


398703 


35005 






NUC730 


124493 


35185 


NUC83 


PRT252 


NUC403 


589785 


35258 






NUC731 


224898 


35326 






NUC732 


145027 


35597 






NUC733 


237630 


35912 






NUC734 


637431 


35984 






NUC735 


117238 


36122 






NUC736 


226039 


37337 






NUC737 


143508;196052;221995 


38112 






NUC738 


129444 


38220 






NUC739 


608709 


38311 






NUC740 


600921 


38631 






NUC741 


194909 


38749 






NUC742 


163588 


38890 


NUC162 


PRT33 1 


NUC453 


500742815 


38890 


NUC163 


PRT332 


NUC453 


500735594 


38890 


NUC164 


PRT333 


NUC453 


500737569 


38890 


NUC165 


PRT334 


NUC453 


500730242 


38890 


NUC166 


PRT335 


NUC453 


500766374 


38890 


NUC167 


PRT336 


NUC453 


500711885 


40163 






NUC743 


244540 
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40975 






NUC744 


106061 


40991 






NUC745 


106061 


42896 






NUC746 


137110 


43190 






NUC747 


244266 


44053 






NUC748 


236316;249443;392450;449591; 
486460;509697;5 1 92 1 0;528872 ; 
585226;589902;601550 


45091 






NUC749 


238559 


45179 






NUC750 


132269 


45274 


NUC84 


PRT253 


NUC404 


1000867870 


46679 


NUC85 


PRT254 


NUC405 


140265 


47171 






NUC751 


420959 


48024 


NUC168 


PRT337 


NUC454 


113448 


48548 






NUC752 


227400 


48603 






NUC753 


200687;244886 


48670 






NUC754 


164887 


48671 






NUC755 


221136 


48823 






NUC756 


525888 


48901 






NUC757 


525775 


49018 






NUC758 


229481 


49034 






NUC759 


237358 


49133 






NUC760 


224706;582974 


49140 






NUC761 


636146 


49261 






NUC762 


186091 


49387 






NUC763 


212526 


49416 






NUC764 


530211 


49426 






NUC765 


313150;313151 


49493 






NUC766 


213393 


49640 






NUC767 


626919 


49863 






NUC768 


181361;382057;631692 


49871 






NUC769 


I81361;382057;631692 


50015 






NUC770 


203070;331013 


50049 






NUC771 


196104 


50112 






NUC772 


118123;335719 


50185 






NUC773 


489405;496185 


50241 






NUC774 


145339;156I86;244350 


50353 






NUC775 


1 19282 


50763 






NUC776 


500701668 


50982 






NUC777 


637372 


51130 






NUC778 


180402 


51212 






NUC779 


238923 


51346 






NUC780 


646118 


51380 


NUC169 


PRT338 


NUC455 


523002 


51400 






NUC781 


231102 


51796 






NUC782 


210281 


51954 






NUC783 


141971;183117;205855;502241 


52076 






NUC784 


625028 



TABLE II 



SEO ID NO. 
in prioritv 
aoDlication 


Chromosomal location 


51 


3q26.2 
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SEO ID NO. 


Chromosomal location 






application 




452 


2q31 


483 


llpll.2 


505 


20pl2 


573 


5q21 


796 


15 


1151 


22qll.2 


1590 


X 


2028 


X 


2932 


8p23 


3280 


21 


3326 


15 


3804 


16pl3.3 


4172 


1, 15ql3-ql4 


4340 


1 


4609 


4pl3 


4647 


llq23, llq23.3 


4694 


12pl3.2 


4733 


14q32.1 


4855 


Iq23-lq24 


4908 


12pl3 


501 1 


7q32, 7q32-36 


5040 


8p23 


5089 


4qll 


5167 


12 


5278 


19ql3.2 


5563 


1,2 


5947 


llq23 


5974 


4q28, 4q28-q31 


6011 


4 


6290 


2pl3 


6322 


2pl3 


6345 


16q24.3 


6400 


10q25-26 


6892 


4 


7697 


2pll,4q28 


8166 


14q32 


8666 


16 


8671 


16 


9755 


3 


10044 


12q, 16 


10322 


Iqll.l, lq21 


10584 


3p21.3 


10744 


6 


11019 


llpll.2,22ql3 


11342 


7q21 


12907 


7q36.1 


13202 


15 


13285 


2q23-24, 11 q23. 2-24.2 


27697 


12pll.2 


28517 


2q31 


28518 


2q31 


29673 


16pl3.3 


29814 


11 
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SEO ID NO. 
in priority 
aoDlication 


Chromosomal location 


31422 


llql3 


31627 


11 


32962 


9q32-33 


33712 


3p21.3,3p2l.31 


35185 


2 


38631 


15 


48823 


13q34-qter, 17 


49018 


7q22 


49133 


lq32 


49387 


17q21 


50015 


5q31 


51380 


Xq28 



SEO ID NO. in 


Tissue Distribution 


37 


I;]l 


51 


A:55 B:4 C:l E:l F:30 G:19 H:9 


179 


F:I6K:4 


180 


F:5 K.;l 


183 


H:6 


201 


C:l 


326 


1:3 


362 


K:2 


440 


A:7 


452 


F:l G:5 


483 


G:4 


500 


1:1 


505 


A:2 


528 


K:2 


573 


F:3 K:l 


574 


A:l 


587 


H:l 1:3 


588 


F:6 K:2 


593 


A:2C:15F:2G:14 


599 


A:8 F:7 K:3 


603 


K:l 


621 


C:l 


628 


F:l K:3 


653 


B:6F:3G:1 1:1 K:7 


670 


K:ll 


678 


F:l K:2 


693 


B:7 G:7 


703 


F:9 K:4 


746 


A:9H:1 K:2 


770 


0:4 


775 


F:l K:13 


796 


G:l 


812 


A;13 F:l H:l 


940 


F:8 


988 


F:8K:1 


996 


K:4 
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^tLVJ 11/ ri\j, in 


Tissue Distribution 


^^ii 






A:12 B:130 C:16 D:7 F:l G:16 


TiTi 




TlQO 


B'4 C'l H'7 r6 


"1458 




"1590 





"1853 





1904 


li^i — ^ — ' 


"tTt^ 


~A^8 




B:l C:3 D:l G:l 


~2368 




~2553 


B:7 D:5 H:39 


~2556 


B:7 D:5 H:39 


~2658 


-^^ 


"5Mn 




~r7S5 


"hm 


~2800 


P ' „ , 


"5841 


ASr-l 


^5«?7 


-=q — = = 


-r-rr 


~fvi 


~9Q?5 


ICM 


^078 


"h^ 


1280 


' „ 


"rm 


„_ ' 


"tTrt 


IPt 


U^Q 




• 


l^m 




~3633 


• 


1fi78 


ito — ' 

' 


1714 




1706 


plj 


~3801 


K:2 


1804 


8:1 C:10 D:l G:2 1:11 


lag? 




~3985 


B:l C:5 D:l H:l J:6 


~4005 




"4063 


A:2 B:10 C:4 


~4088 


"X^fi 


"4111 




~4126 




"4172 





4261 


H:2 


4340 


A:3 0:3 1:1 






4609 


D:10 


4647 




4660 


D:l 


4664 


8:48 C;2 H:2 1:3 


4678 


C;5 D;17 0:4 






"^Ht 


D 5 F-2H-1 


4690 


D:3 F:l 1:1 K:l 


4694 


1:3 


4696 


1:1 


4733 


D:l 


4807 


B:ll H:l 


4809 


1:1 


4830 


K:l 
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SEO ID NO. in 


Tissue Distribution 


4855 


H:I 


4900 


A: I 


4908 


A:16 


4943 


F:6 K:2 


4947 


K:3 


4976 


J:2 


5000 


A;2 B:14C:17 D:5 F:! G:9 H:3 1:3 


5002 


B:2 


5005 


C:31 F:l J;2 K:2 


5011 


H:5 1:2 


5040 


B:l C:2 D:ll G:l J:l 


5058 


K:l 


5071 


F:l K:l 


5089 


1:1 


5117 


H:7 


5141 


F:2 K:l 


5162 


A:l B:8 


5167 


1:1 


5178 


G:l 


5192 


F:14K:3 


5214 


A:10B:19C:5 


5230 


A:3 8:6 C:9 D:l F:l G:4 H.3 1:2 


5240 


A:l 


5250 


A:3 B:39 C:29 D:3 H:3I:1K:1 


5262 


A:3 B:39C:34D:3 H:3 1:1 K:l 


5270 


3:8 


5278 


D:l 


5358 


0:1 


5453 


A:2C:6 


5494 


A:l D:l 


5499 


A:3 


5533 


A:2 B:7 C:l 


5563 


D:l F:l 


5609 


0:3 


5657 


0:1 


5691 


K:l 


5748 


A:3 B:l C:4 D:l 0:30 H:l 


5806 


A:2 B:7C:1 F:l 1:3 K:3 


5816 


B:5 


5824 


H:3 


5861 


B:16 C:27 D:13 F:7 H:21 1:12 K:4 


5885 


A:l 


5913 


K:I 


5947 


D:2 


5966 


0:11 


5970 


C:4 


5974 


D:12 


5983 


A:5 


5985 


B:2C:5H:1 1:1 K:2 


6011 


B:l 


6080 


B:l 1:1 


6081 


1:9 


6108 


F:5 


6159 


8:1 


6231 


A:l 


6238 


A: 15 8:5 G:14H:3 1:2 


6252 


0:7 


6283 


F:l K:l 
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SEO ID NO. in 


Tissue Distribution 


6290 


B:3 


6322 


C:l 


6329 


A:l 


6334 


F:51 K:13 


6345 


A:9G:1 


6350 


A:l B:2 


6358 


B:9H:1 


6384 


3:1 C:l 


6400 


K.1 


6418 


K:l 


6431 


A:l 8:1 K:l 


6453 


F:l 


6636 


A:6B:37C:11 D:l F:l G:4H:1 J:l 


6660 


F:l K:2 


6688 


A: I B:42 D:l G:3 1:3 K.3 


6727 


F:16G:5 K:103 


6835 


K:I 


6865 


All 1:1 


6892 


B:23 C:l H:2 1:1J:12 


7000 


B:l 


7041 


C:l F:17H:1 


7533 


G:2 


7535 


A:5 


7577 


F:l 


7697 


D:10F:3 


7712 


F:l K:3 


8009 


B:2C:2 H:l 


8078 


F:5 K.20 


8097 


1:3 


8166 


A:7 B:6 D:8 G:52 


8262 


H:l 


8341 


A:l F:2G:3 H:2Li 


8534 


F:l K:6 


8666 


B:4 D:l G:16K:1 


8671 


G:l K:2 


8744 


H;l 


8968 


A:12 G:3 H:l 


8994 


F:l K:4 


9297 


D:l 


9327 


K:l 


9332 


K:4 


9406 


B:10D:3H:39 


9407 


B:10D:3 H:39 


9668 


F:9K:41 


9679 


B:l F:l K:l 


9755 


K:5 


9868 


F:9 K: I 


10044 


A: 1 0 B: 1 0 D: 14 F:2 G:24 H:5 K:4 


10322 


8:1 F:2 K:3 


10526 


8:3 H:3 


10584 


A:2 


10650 


K:3 


10739 


A:l 8:1 0:3 J:l 


10743 


C:l J:l 


10744 


A:6 8:3 G:2 1:3 


10761 


H:l 


10880 


D:l G:l 


10942 


A:30 
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SEO ID NO. in 




11019 


F:4 


1 1278 


1:1 


1 1342 


F:l K:2 


11562 


F:l K:4 


11688 


F:2 K:12 


11735 


A:25 B:57C:7 D:l F:l G:l K:l 


11813 


A:8 


12039 


B:3 


12043 


A:6 


12048 


F:l H:l 


12098 


A:4 B:8 H:2 1:1 


12202 


K;2 


12220 


C:2 


12243 


A:l 3:132 C:38 D:31 F:6 G:ll 


12263 


H:4 


12276 


A: 18 


12490 


A;9 B:l 1 C:8 G:4 


12604 


A;6 B:14C:22 D:l F:l G.4 H:4 1:1 


12657 


B:l C:6 J:2 


12788 


K.:2 


12901 


G:2 


12907 


1.6 


13013 


D:5 


13202 


A:2 B:14 C:9 D:l G:3 H:3 J:3 K:l 


13229 


A:3 


13256 


F:2 K:5 


13267 


1:8 


13285 


B:l C:42 D:5 1:1 J: 10 


26638 


D:2 


26710 


A:90 8:48 G:27 


26726 


C:4 




A:24 


26982 


A:l G:l 


27084 


F:16 K:83 


27273 


C:3 


27301 


A:3 B:3 H:2 


27336 


B:l G:l 


27361 


F:2 


27374 


B:l D:l 


27627 


F:l K:l 


27697 


A:l C:l 


27877 


H:l 1:19 


28413 


H:l 


28517 


K:2 


28518 


G:3 


29120 


1:1 


29469 


A:2G:2 


29472 


A:l 


29557 


C:l 


29673 


D:l 


29814 


A:l 


30218 


F:7 K:l 


30446 


B:l C:l H:10 


30477 


G:7 


30583 


K:5 


30719 


A:l B:9 1:3 


31356 


H:2 


31422 


G:l 
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SEO ID NO. in 


Tissue Distribution 


31554 


K:l 


31627 


C:l 


31726 


C:l 


31744 


F:l 


31790 


A:25 


32102 


G:l 


32473 


B:l 


32475 


B:l 


32962 


G:l 


33130 


K:3 


33712 


B;l C:l 


35005 


A:l 


35185 


C:l 


35258 


K:l 


35326 


B.l F:3G:1 H:l 1:1 K:3 


35597 


F:i K:l 


35912 


A:l 


35984 


F:l 


36122 


K:l 


37337 


F:l K:2 


38112 


K:l 


38220 


A:l 


38311 


A:l 


38631 


K:l 


38749 


H:l 


38890 


0:21 


40163 


K:2 


40975 


B:8 D:l H:39 


40991 


B:8D:1 H;39 


42896 


H:l 


43190 


K:l 


44053 


C:ll D:l H:l 1.2 J:5 


45091 


J:l 


45179 


H:l 


45274 


C:4 H:2 


46679 


D:3 F:l H:l 


47171 


C:l 


48024 


B:4 C:5 


48548 


F:l 


48603 


F:41 K:19 


48670 


K:l 


48671 


K:2 


48823 


C:65 


48901 


B:i 0:2 1.2 


49018 


B;2C:2F:2 J:2 


49034 


D:l F:2 H;l I;3 K:2 


49133 


F:4K:2 


49140 


A:l C:l 


49261 


F:3 K:23 


49387 


F:2K:1 


49416 


B:18C:1 


49426 


J:51 


49493 


C;l F:l 1:6 


49640 


A:5F:1 


49863 


A:6C:2 1:1 


49871 


A.6 C:2 1:1 


50015 


K:2 


50049 


K:2 
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SEO ID NO. in 


Tissue Distribution 


501 12 


K:3 


50185 


1:6 


50241 


F;3 H:l 


50353 


F;l K:l 


50763 


A: 14 


50982 


A:2 






51212 


J:2 


51346 


A.2 B:l D:l 


51380 


B:l F:l 


51400 


F:3 


51796 


F:l K:2 


51954 


B:13C:1 D:l F:2 K:l 


52076 


A:2 



TABLE IV 



SEQ ID NO. in 

priority 

application 


Tissue source 


37 


adenocarcinoma(2), carcinoid(2), tcstis(I), tonsil(l) 


51 


adipose tissue, white(4), cerebelium(3), cochlea(l), colon tumor rer+(2), dorsal root 
ganglion(l), hippocampus(l), i<idney( 1), liver(l), malignant melanoma, metastatic to lymph 
node(l), muscie(2). normal leg muscle(l), parathyroid tumor(l), pectoral muscle (after 


201 


pancrcas(l) 


326 


ovarian tumor(l), uterus(l) 


440 


brain cortex(l), carcinoid tumor(l) 


483 


pbl(l), adenocarcinoma(l), astrocytoma(l), ovarian tumor(l), schizophrenic brain s-11 
frontal lobe(l) 


500 


colon(4), colon tumor rer+(l), pooled germ cell tumors(l) 


505 


total brain(l) 


528 


brain(2), cerebellum(l), colon(l), ovarian tumor(6) 


573 


melanoma (mewo cell line)(l) 


587 


germinal center b cell(l), lymphoma(l), parathyroid tumor(l) 


593 


ovarian tumor(l), placenta(l) 


621 


placenta.(2) 


653 


2 pooled tumors (clear cell type)(2), anaplastic oligodendroglioma(2), glioblastoma 
(pooled)(2) 


678 


ovarian tumor(l), prostate(l) 


693 


brain(l), placenta(l) 


703 


small cell carcinoma(2) 


746 


small cell carcinoma(l) 


770 


colon(l), frontal lobe(l), human pancreatic islets(l), normal leg muscle(l), ovarian tumor(l), 
pancreatic islet(2), senescent fibroblast(l), total brain(l) 


775 


anaplastic oligodendrogiioma(l), frontal lobe(l) 


796 


adrenal adenoma(l), adrenal gland(l), breast tumor(3), placenta(2) 


812 


heart(l), brain(l), frontal lobe(3), neuroepithelial cells(l), retina(l), small cell carcinoma(l), 
total brain(l) 


988 


germinal center b cell(l) 


1064 


2 pooled tumors (clear cell type)(l), ewing's sarcoma(2), adenocarcinoma(5), anaplastic 
oligodendroglioma(5), breast tumor(l), carcinoid(4), colon(3), colon tumor(l), colon tumor 
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rer+(2), frontal lobe(4), germinal center b cell(7), kidney tumor( 1 ), lung tumor( 1 ), metastatic 

mastectomy)(12), placenta(l), pooled germ cell tumors(4), senescent fibroblast(2), squamous 
cell carcinoma from base of tongue(l), tumor, 5 pooled (see description)( 1 ) 


1151 


anaplastic oligodendroglioma(4), carcinoid(l), colon tumor rer+(l), medulloblastoma(l), 
normal prostate(l), parathyroid tumor(l), tumor(l) 


1190 


colon(l) 


1853 


2 pooled high-grade transitional cell tumors(l), 2 pooled tumors (clear cell type)(l) 


2173 


2 pooled tumors (clear cell type)(2), b-ceii, chronic lymphotic leukemia(2), cd34+, cd38- 
from normal bone marrow donor(6), adenocarcinoma(l), alveolar rhabdomyosarcoma(l), 
anaplastic oligodendroglioma(3), breast(l), carcinoid(l), cerebellum(2), cochlea(3), 
colon(ll), colon tumor rer+(2), dorsal root ganglion(l), early stage papillary serous 
carcinoma(l), epithelium (cell line)(l), follicular lymphoma(l), fix)ntal lobe(15), germinal 
center b-cells(3), invasive adenocarcinoma(5), kidney(l), kidney tumor(2), larynx(l), 
liver(l), lung carcinoma(l), lymphoma(l), meningioma(l), moderately differentiated 
adenocarcinoma(2), moderately-difl^rentiated adenocarcinoma(l), muscle(l), normal 

papillary serous carcinoma(l), pectoral muscle (after mastectomy)(23), pooled germ cell 
tumors(2), prostate(l), stem cell 34+/38+(2), thyroid(l), tumor, 5 pooled (see descriptionX4), 

two pooled squamous cell carcinomas(2) 


2553 


brain(2), tumor(l) 






-pll 


frontal lobe(l) 


2843 


b-cell, chronic lymphotic leukemia(2), anaplastic oligodendroglioma(5), carcinoid(l). 


"2852 


frontal lobe(3) 


~2932 


stroma(l) 


3078 


frontal lobe(2) 




b-cell, chronic lymphotic leukcmia(l), germinal center b €611(1), leiomyosarcoma(l), pooled 
germ cell tumors(l) 


3326 


muscle(3), normal leg muscle(l), parathyroid tumor(l) 


_3387 


carcinoid( 1 ), pooled germ cell tumors(4) 


3439 


2 pooled tumors (clear cell type)(l), b-cell, chronic lymphotic leukcmia(3), cwing's 
lymphoma(l), parathyroid tumor(l), senescent fibroblast( 1 ), squamous cell carcinoma(l) 


3501 


anaplastic oligodendroglioma(l), breast(l), carcinoid(2), colon(l), colon tumor rer+(l), 
epithelium (cell line)(l), kidney tumor(l), ovarian tumor(l), parathyroid tumor(2), pooled 


"3633 


cerebeUuiTiif n°mild^ ceH Tardnoma( n^tota"b^^ ^ ^' 


"3714 


^'^l>o\ed7umoTs(deaTce^^^ colonO) ^^^^^'^^ ° ™ 


~3804 


heart(l) anaplastic oligodendroglioma(l) carcinoid(l) colon tumor(2) germinal center b 
eel 1(2), kidney tumor(l), lung tumor(3), lymphoid(2), moderately differentiated 
adenocarcinoma( 1), muscle(l), ovarian tumor(3), pectoral muscle (after mastectomyXl), 

squamous cell carcinoma(3) 


3892 


blood(l), total brain(2) 


3985 


anaplastic oligodendroglioma(l), breast(l) 


4005 


2 pooled tumors (clear cell typeXO, brain(2), germinal center b cell(l), muscle(l) 




lobe( 1 ), senescent fibroblast( 1 ), total bra'i'n(2) 


"ZT^^ 


anaplastic oligodendroglioind(l), cerebellum(l ). colon(l) _ _ 




retma(j) 


"4340 


heart(l), blood(l), brain(2), colon(6), frontal lobe(9), invasive tumor (cell line)(l), 
melanocyte(l), neuroepithelial cells(l), senescent fibroblast(l), small cell carcinoma(2), total 
brain(3) 


4436 


b-cell, chronic lymphotic leukemia(l), bulk tumor(l), colon(24), early stage papillary serous 
carcinoma(3), lung carcinoma(l), pancreatic cancer(l), pooled germ cell tumors(l) 


4609 


blood(l) 


4647 


colon(l), liver(l) 
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adenocarcinonia(l ) 


4664 


2 pooled tumors (clear cell type)(5), adenocarcinoma(3), brain(l), breast(l), coion(4), colon 
tumor rer+(l), frontal lobe(5), liver(l), neuroepithelial cells(l), normal prostate(l), ovarian 


"4678 


cXn^tumw^rTr+?^ brain(l), tumor(l) 


~46^ 


2°poole"d"tumors (clear cell type)(4) anaplastic oligodendroglionia(2) breast(3) carcinoid(l) 

glioblastoma (pooled)(l), pooled germ cell tumors(l) 


4687 


2 pooled tumors (clear cell typeXO, carcinoid(3), colon(3), normal prostate(l), pooled germ 


4690 


carcinoid(l), colon(3), normal prostate(l), pooled germ cell tumors(l) 


4694 


colon tumor, rer+(l) 


4733 


colon(3), iiver(14), pancreatic islet(l) 


4807 


2 pooled tumors (clear cell type)(l), alveolar rhabdomyosarcoma(l), carcinoid(3), colon(2), 
normal prostate(2), normal prostatic epithelial cells(l), ovarian tumor(23), prostate(2), serous 
adenocarcinoma( 1 ), total brain(2), tumor(l), tumor, S pooled (see description)(4) 


4809 


2 pooled timiors (clear cell type)(l), alveolar rhabdomyosarcoma(l), carcinoid(3), colon(2), 
normal prostate(2), normal prostatic epithelial cells(l), ovarian tumor(28), prostate(2), serous 
adenocarcinoma( 1 ), total brain(2), tumor(l), tumor, 5 pooled (see description)(5) 


~4855 


b-cell, chronic lymphotic leukemia(l), carcinoid(l) 


4900 


2 pooled tumors (clear cell type)(l), b-cell, chronic lymphotic leukemia(l), colon(l), liver(l), 
low-grade prostatic neoplasia(l), normal prostate(l), pooled germ cell tumors(l) 


4908 


alveolar rhabdomyosarcoma(l), colon(l), hemopoietic system(l), liver(l), malignant ascitic 
effLision(l), moderately-differentiated adenocarcinoma(l), neuroepithelial cells(l), 
parathyroid tumor(l), synovial membrane(l), uterus(l) 


4947 


parathyroid tumor(l), testis(2) 


5000 


2 pooled tumors (clear cell type)(3), b-cell, chronic lymphotic leukemia(2), anaplastic 
oligodendroglioma(5), breast(4), carcinoid(2), colon tumor, rer+(l), germ cell tumor(l), 
germinal center b cell(6), invasive prostate tumor(l), lung tumor(l), metastatic prostate bone 
lesion(2), normal prostate(l), parathyroid tumor(2), pectoral muscle (after mastectomy)(3), 
pooled germ cell tumors(l), senescent fibroblast(4), stroma(l), thyroid(l), tumor, 5 pooled 


~5002 


pTctorarmusX^after mastectomy)(l) senescent fibroblast(l) 


5005 


anaplastic oligodendroglioma(l), germinal center b cell(l), lobullar carcinoma in situ(l), 
lymphoma(l), oligodendroglioma(l), pooled germ cell tumors(8) 


5011 


breast cancer(l), normal prostate(5), seminal vesicles(l) 


5040 


bone marrow(18), brain(l), hematopoietic from ami patient(l), liver(2), normal cortical 
stroma(l) 




_5089 


parotia giana(, i ) 


5117 


adenocarcinoma(2), breast(2), carcinoid(l), colon(3), colon tumor rer+(l), epithelium (cell 

adenocarcinoma(2), normal prostate(3), normal prostatic epithelial cells(2), placenta(l), 
prostate(3), small cell carcinoma(l), squamous cell carcinoma(2), squamous cell carcinoma 

from base of tongue( 1 ) 


5162 


hippocampus(l), schizophrenic brain s-1 1 frontal lobe(l) 




colon(3), colon tumor rer+(2), pooled genn cell tumors(l) 


5214 


kidney(l), ovarian tumor(l), pituitar gland(l), total brain(2) 


5230 


2 pooled tumors (clear cell type)(5), placenta(l), breast(l), breast tumor(l), carcinoid(2), 
colon(l), colon carcinoma(l), colon mucosa(l), colon tumor rer+(4), germinal center b 
cell(l), kidney tumor(l), normal prostate(5), papillary serous carcinoma(l), parathyroid 
tumor(3), pectoral muscle (after mastectomy)(3), pooled germ cell tumors(2), senescent 
fibroblast(l), squamous cell carcinoma from base of tongue(l) 


Ji\ 

5240 


bone(l), colon(2), frontal lobe(2), glioblastoma (pooled)(l), liver(l), ovarian tumor(l), 


5250 


2 pooled tumors (clear cell type)(l), anaplastic oligodendroglioma(2), brain(5), carcinoid(l), 
colon(2), colon tumor rer+(l), pooled germ cell tumors(l) 


5262 


2 pooled tumors (clear cell type)(l), anaplastic oligodendroglioma(2), brain(5), carcinoid(l), 
colon(2), colon tumor rer+(l), pooled germ cell tumors(l) 


5270 


ovarian tumor(l) 


5278 


2 pooled tumors (clear cell type)(7), anaplastic oligodendroglioma(7), breast(l), breast 
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tumor(4), carcinoid(3), colon(l), colon tumor rer+(2), glioblastoma (pooled)(l), liver(3), 
pooled germ cell tumors(6), thyroid(2) 


5358 


2 pooled tumors (clear cell type)(l), anaplastic oligodendroglioma(2), brain(2), colon(2), 
germinal center b cell(4), melanocyte(l), normal prostate(3), parathyroid tumor(2), pectoral 
muscle (after mastectomy)(3), pooled germ cell tumors(4), senescent flbroblast(2) 


5494 


2 pooled tumors (clear cell type)(2), cd34+, cd38- from normal bone marrow donor(2), 
anaplastic oligodendroglioma(2), brain(2), colon(2), colon tumor rer+(l), early stage 
papillary serous carcinoma(l), germinal center b cell(3), glioblastoma (pooled)(3), 
moderately-differentiated adenocarcinoma(l), normal prostate(l), normal prostatic epithelial 
cells(l), omentum(l), ovarian tumor(4), ovaiy(l), parathyroid tumor(3), pectoral muscle 
(after mastectomy)(l), pooled germ cell tumors(l), senescent fibroblast(2), stem cell 
34+/38+(l), synovial membrane(l), synovial sarcoma(l), tumor(l) 


5499 


2 pooled tumors (clear cell type)(l), brain(2), pancreatic islet(2) 


5533 


2 pooled tumors (clear cell type)(2), anaplastic oligodendroglioma(3), breast(l), carcinoid(l), 
germinal center b cell(2), glioblastoma (pooled)(2), pooled germ cell tumors(2), senescent 
fibroblast( 1 ), small cell carcinoma(2) 


5563 


colon(3), kidney(l), liver(I), neuroepithelial cells(l), normal prostatic epithelial cells(l), 
ovarian tumor(l), senescent fibroblast(l), total brain(l) 


5691 


2 pooled high-grade transitional cell tumors(l), 2 pooled tumors (clear cell type)(3), 
adenocarcinoma(3), anaplastic oligodendroglioma(2), carcinoid(3), colon(2), germinal center 
b celi(l), glioblastoma (pooIed)(3), medulloblastoma(l), ovarian tumor(l), parathyroid 
tumor(2), pectoral muscle (after mastectomy)(2), prostate(l), three pooled meningiomas(l), 
total brain(l) 


5748 


bone(2), anaplastic oligodendroglioma(5), breast(l), carcinoid(l), colon tumor rer+(l), 
frontal lobe(2), germinal center b cell(l), glioblastoma (pooled)(l), ovarian tinnor(4), 
parathyroid tumor(2), pooled germ cell tumors(l), senescent fibroblast(l), tumor, 5 pooled 

(seedescription)(l) 


5806 


2 pooled tumors (clear cell type)(4), female, 19 years old, normal leg muscle(2), alveolar 
rhabdomyosarcoma(l), anaplastic oligodendroglioma(4), breast(l), carcinoid(3), germinal 
center b cell(l), glioblastoma (pooledX2), normal prostate(l) 


5816 


anaplastic oligodendroglioma(l), frontal lobe(4) 


5824 


2 pooled tumors (clear cell typeX2), b-cell, chronic lymphotic leukemia(2), 
adenocarcinoma(l), anaplastic oligodendroglioma(3), blood(l), breast(l), caFcinoid(9), 
cerebellum(l), colon(3), fibrotheoma(l), follicular lymphoma(l), germinal center b cell(2), 
glioblastoma (pooled)(2), kidney tumor(l), low-grade prostatic neoplasia(2), normal prostatic 
epithelial cells(l), ovarian tumor(2), parathyroid tumor(7), pooled germ cell tumors(l), 
senescent fibroblast(6), thyroid(l) 


5861 


adipose tissue, white(2), bone marrow from femur(l), carcinoid(l), epithelium(l), normal 
prostatic epithelial cells(l), pectoral muscle (after mastectomyXl) 


5885 


2 pooled tumors (clear cell typeXl), female, 19 years old, normal leg muscle(l), anaplastic 
oligodendroglioma(5X bone marrow stroma(l), colon tumor rert-(l), genninal center b 
cell(5), glioblastoma (jraoledXl), kidney tumor(l), melanocyte(2), moderately differentiated 

normal prostatic epithelial cells(2), ovarian tumor(2), parathyroid tumor(l), senescent 


5947 


2 pooled tumors (clear cell typeX4), adenocarcinoma(l), colon tumor(3), intestine(l), liver(l) 


5970 


b-cell, chronic lymphotic leukemia(l), carcinoid( I ), germinal center b cell(2), pooled germ 
cell tumors(3) 


5974 


blood(5), bone marrow(l), liver(4), reticulocyte(l) 


5983 


2 pooled tumors (clear cell typeXl), adenocarcinoma(l), carcinoid(l), germinal center b 
cell(4), medulloblastoma(l), pooled germ cell tumors(l), tumor, 5 pooled (see description)(l) 




2 pooled tumors (clear cell typeX3), b-cell, chronic lymphotic leukemia(2), anaplastic 
oligodendroglioma(5), breast(4), carcinoid(2), colon tumor, rer+(l), germ cell tumor(l), 

lesion(2), normal prostate(l), parathyroid tumor(2), pectoral muscle (after mastectomy)(3), 
pooled germ cell tumors(l), senescent fibroblast(4), stroma(l), thyroid(l), tumor, 5 pooled 
(see description)(2) 


6011 


brain(5), carcinoid(l), frontal lobe(l), lung carcinoma(l), retina(l) 


6080 


2 pooled tumors (clear cell type)(8), ewing's sarcoma(2), heart(l), adenocarcinoma(l), 
alveolar rhabdomyosarcoma(6), anaplastic oligodendroglioma(5), aorta(l), breast(4), bulk 
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germ cell seminonia(2), colon(3), germinal center b cell(2), glioblastoma (pooled)(l), 
kidney(2), lung carcinoma(l), metastatic prostate bone lesion(4), normal prostate(l), normal 
prostatic epithelial cells(l), oligodendroglioma(l), ovary(2), parathyroid tumor(5), pectoral 
muscle (after mastectomy)(19), pooled germ cell tumors(l), prostate(l), senescent 
fibroblast(4), tumor, 5 pooled (see description)(l) 


6081 


brain(l), germinal center b cell(l), testis(l) 


6108 


2 pooled tumors (clear cell type)(l), carcinoid(l), germinal center b cell(l), parathyroid 
tumor(l) 


6159 


2 pooled tumors (clear cell type)(2), ewing's sarcoma(l), schwannoma tumor(l), adipose 
tissue, white(l), adrenal adenoma(3), amygdala(l), anaplastic oligodendroglioma(l), 
astrocytoma(2), bone marrow stroma(3), borderline ovarian carcinoma(l), cochlea(3), colon 
tumor(l), epithelium (cell line)(4), frontal lobe(3), germinal center b cell(l), human 
pancreatic islets(l), kidney tumor(3), larynx(l), liver(l), lung carcinoma(l), lung tumor(l), 
normal leg muscle(l), oligodendroglioma(l), ovarian tumor(8), parathyroid tumor(l), 
pectoral muscle (after mastectomy)(8), prostate tumor(l), senescent fibroblast(3), small cell 
carcinoma(4) 


~623l 


adenocarcinoma(2), breast(3), colon(4), colon tumor(2), endometrioid ovarian metastasis(l), 
epithelium (cell line)(22), frontal lobe(l), germ cell tumor(3), invasive tumor (cell line)(13), 
moderately differentiated adenocaFcinoma(l), normal prostatic epithelial cells(l), ovarian 
tumor(4), ovary(l), pancreatic islet(l), placenta(trophoblast)(l), pooled germ cell tumors(2), 
squamous cell carcinoma(l), synovial sarcoma(l), tumor(3), tumor, 5 pooled (see 
description)(2), two pooled squamous cell carcinomas{2) 


6238 


bone(l), colon(2), ovarian tumor(2), small cell carcinoma(2), total brain(l) 


6252 


2 pooled tumors (clear cell type)(l), ewing's sarcoma(6), adenocarcinoma(l), alveolar 
rhabdomyosarcoma(3), anaplastic oligodendroglioma(2), brain(l), breast(2), breast tumor(l), 
carcinoid(2), cochlea(2), germ cell tumor(l), glioblastoma (pooled)(l), kidney(l), kidney 
tumor(6), liposarcoma(l), lung carcinoma(l), lymphoma(l), metastatic prostate bone 
lesion(6), muscle(l), normal prostate(l), normal prostatic epithelial cells(l), ovarian 
tumor(2), ovaiy(4), parathyroid tumor(l), pectoral muscle (after mastectomy)(5), pooled 
germ cell tumors(4), prostate(l), renal cell tumor(l), senescent fibroblast(3), stem cells(l), 
tumor, 5 pooled (see description)(4) 


6290 


2 pooled tumors (clear cell typeXO, b-cell, chronic lymphotic leukemia(l), heart(3), 
lymphoma(l), adenocarcinoma(l), adipose tissue, white(l), brain(3), carcinoid(5), 
cerebellum(5), colon(8), epithelium (cell line)(l), frontal lobe(l), germinal center b-cells(l), 
kidney tumor(l), medulloblastoma(l), melanoma (mewo cell line)(l), normal prostate(l), 
omentum(l), ovarian tumor(l), pancreatic islet(l), placenta(3), pooled frontal lobe(l), 
retina(l), retinal fovaea(l), schizophrenic brain s-11 frontal lobe(l), senescent fibroblast(l), 
small cell carcinoma(3), synovial membrane(l), total brain(2) 


6322 


b-cell, chronic lymphotic leukemia(l), heart(4), lymphoma(l), adipose tissue, white(l), 
brain(3), carcinoid(4), cerebellum(5), colon(8), epithelium (cell lineXl), frontal lobe(l), 
germinal center b-cells(l), kidney tumor(l), melanoma (mewo cell lineXl), omentum(l), 
ovarian tumor(l), pancreatic islet(l), placenta(3), pooled frontal lobe(l), retinal fovaea(l), 
schizophrenic brain s-11 frontal lobe(l), senescent fibroblast(l), small cell carcinoma(3), 
synovial membrane(l), total brain(2) 


6329 


b-cell, chronic lymphotic leukemia(3), heart(3), lymphoma(2), adenocarcinoma(6), adipose 
tissue, white(l), adrenal adenoma(2), anaplastic oligodendroglioma(2), bone marrow 
stroma(l), brain(2), breast(l), carcinoid(7), cerebellum(5), colon(8), epithelium (cell line)(l), 
frontal lobe(l), germ cell tumor(2), germinal center b-cells(l), kidney tumor(l), larynx(l), 
lung tumor(l), medulloblastoma(l), melanoma (mewo cell lineXl ), metastatic melanoma to 
bowel(l), moderately differentiated adenocarcinoma(l), normal prostate(2), omentum(l), 
ovarian tumor(l), pancreatic islet(2), papillary serous ovarian metastasis(l), parathyroid 
tumor(l), pectoral muscle (after mastectomy)(l), placenta(2), pooled frontal lobe(l), pooled 
germ cell tumors(3), prostate(l), retinal fovaea(l), schizophrenic brain s-1 1 frontal lobe(l), 

membrane(l), total brain(2), tumor(l), tumor, 5 pooled (see descriptionXl), two pooled 
squamous cell carcinomas(l) 


6334 


2 pooled tumors (clear cell type)(3), b-cell, chronic lymphotic leukemia(2), anaplastic 
oligodendroglioma(9), glioblastoma (pooled)(3), kidney(l), normal prostate(3), ovarian 
tumor(l), senescent fibroblast(2) 


6345 


testis(3) 
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6350 


2 pooled tumors (clear cell type)(2), b-cell, chronic lymphotic leukemia(l), bone(l), 
adenocarcinoma(l), anaplastic oligodendroglioma(lO), breast(l), breast tumor(l), colon(2), 
glioblastoma (pooled)(6), kidney(l), lung carcinoma(l), medulloblastoma(l), metastatic 
prostate bone lesion(l), normal prostate(]). pectoral muscle (after mastectomy)(4), pooled 
germ cell tumors(2), senescent fibroblast(2), squamous cell carcinoma(l), testis(l), tumor(l), 


~6358 


2'pooied^umore^(clear^c^^^^^^ anaplastic oligodendroglioma(4) brain(4) carcinoid(2) 
colon(3), colon tumor rer+(l), germinal center b cell(4), glioblastoma (pooled)(2), normal 
prostate(l), normal prostatic epithelial cells(l), parathyroid tumor(2), pectoral muscle (after 
mastectomy)(l), senescent fibroblast(3) 


6384 


epithelium(l), meningioma(l), parathyroid tumor(2), senescent fibroblast(l) 


6400 


blood(2), brain(2), colon(6), parathyroid tumor(l), tumor(l) 


6431 


adenocarcinoma(l ) 


6453 


pooled germ cell tumors(3) 


6636 


2 pooled tumors (clear cell type)(l), ewing's sarcoma(2), adenocarcinoma(3), anaplastic 
oligodendroglioma(3), breast tumor(l), carcinoid(4), colon(2), colon tumor(l), colon tumor 
rer+(2), frontal lobe(3), germinal center b cell(6), kidney tumor(l), ovarian tumor(4), 
parathyroid tumor(4), pectoral muscle (after mastectomy)(10), placenta(l), pooled germ cell 
tumors(2), senescent fibroblast(l), squamous cell carcinoma from base of tongue(l), tumor, 5 
pooled (see description)(l) 


6688 


b-cell, chronic lymphotic Ieukemia(3), alveolar rhabdomyosarcoma(l), anaplastic 
oligodendroglioma(l), breast(2), carcinoid(l), coIon(l), colon tumor(l), four pooled pituitary 
adenomas(l), germinal center b cell(2), glioblastoma (pooled)(l), kidney tumor(2), 
moderately-differentiated adenocarcinoma(l), muscle(2), pectoral muscle (after 
mastectomy)(l), pooled germ cell tumors(3), senescent fibroblast(3), synovial sarcoma(l), 
testis(l), tumor, 5 pooled (see description)(3) 


6727 


frontal lobe(l), schizophrenic brain s-1 1 frontal lobe(l) 


6865 


frontal lobe(4), germinal center b cell(l), muscle(l), ovarian tumor(2), pectoral muscle (after 
mastectomy)(3), senescent fibroblast(l), small cell carcinoma(l), thyroid(l) 


6892 


2 pooled tumors (clear cell type)(5), adenocarcinoma(l), anaplastic oligodendroglioma(5), 
brain(5), breast(3), breast tumor(l), carcinoid(5), cerebellum(l), colon(3), colon tumor 
reH-(2), frontal lobe(5), germinal center b cell(3), glioblastoma (pooled)(2), moderately- 
differentiated adenocarcinoma(l), normal prostate(3), ovary(2), parathyroid tumor(3), 
pectoral muscle (after mastectomy)(l), placenta(l), pooled germ cell tumors(5), senescent 
fibroblast(3), tumor(l), tumor, 5 pooled (see description)(l) 


7000 


anaplastic oligodendroglioma(2), frontal lobe(2) 


7041 


germinal center b cell(l), total brain(2) 


7533 


2 pooled tumors (clear cell type)(l), b-cell, chronic lymphotic leukemia(l), schwannoma 
tumor(l), anaplastic oligodendroglioma(l), astrocytoma(l), breast(l), cochlea(l), colon 
tumor rer+(2), germ cell tumor(l), germinal center b cell(2), glioblastoma (pooled)(3), 
hepatoma(4), medulloblastoma(l), metastatic prostate bone lesion(l), moderately- 
differentiated adenocarcinoma(l), ovarian tumor(l), prostate tumor(l), senescent 
fibroblast(l), squamous cell carcinoma(l), three pooled meningiomas(l) 


7535 


b-cell, chronic lymphotic leukemia(l), schwannoma tumor(l), anaplastic 
oligodendroglioma(l), astrocytoma(l), breast(l), cochlea(l), colon tumor rer+(2), germ cell 
tumor(l), germinal center b cell(2), glioblastoma (pooled)(3), hepatoma(4), 

adenocarcinoma(l), ovarian tumor(l), senescent fibroblast( 1 ), squamous cell carcinoma(l) 






-^1^1 


alveolar rhabdomyosareTm^^ ^ oligodeindrogliom^^^^^^^ 1 ) 
colon tumor rer+(l), germinal center b cell(2), glioblastoma (pooled)(l), normal prostate(l), 
ovary(2), pooled germ cell tumors(2), thyroid(2), tumor(l) 


8078 


frontal lobe(2) 




pooled germ cell tumors(3) 


8097 


colon(l) 


8166 


2 pooled tumors (clear cell type)(5), adenocarcinoma(6), anaplastic oligodendroglioma(3), 
breast tumor(l), carcinoid(l), colon(l), epithelium (cell line)(l), glioblastoma (pooled)(2), 
lung tumor(2), metastatic melanoma to bowel(l), moderately-differentiated 
adenocarcinoma(l), normal prostate(l), ovary(l), papillary serous ovarian metastasis(l). 
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parathyroid tumor(l), pooled germ cell tumors(4), renal cell tuinor(l), squamous cell 
carcinoma(5), synovial sarcoma(l), tumor(3), tumor, 5 pooled (see description)(3) 


8341 


2 pooled tumors (clear cell type)(l), anaplastic oligodendroglioma(2), germinal center b 
cell(3), glioblastoma (pooled)(l), normal prostate(l), oligodendroglioma(l), pectoral muscle 
(after mastectomy)(2), nimor( 1 ) 


8534 


germinal center b cell(4) 


8666 


2 pooled high-grade transitional cell tumors(l), 2 pooled tumors (clear cell type)(5), b-cell, 
chronic lymphotic leukemia(2), adenocarcinoma(3), alveolar rhabdomyosarcoma(3), 
amygdala(l), anaplastic oligodendroglioma(3), blood(2), breast(l), carcinoid(3), colon(3), 
germinal center b cell(4), muscle(l), normal prostate(l), ovarian tumor(6), parathyroid 
tumor(2), pectoral muscle (after mastectomy)(4), pheochromocytoma(l), placenta(l), 
senescent fibroblast(l 1), two pooled squamous cell carcinomas(l) 


8671 


2 pooled high-grade transitional cell tumors(l), 2 pooled tumors (clear cell type)(3), b-cell, 
chronic lymphotic leukemia(l), adenocarcinoma(6), alveolar rhabdomyosarcoma(3), 
amygdala(2), anaplastic oligodendroglioma(2), blood(2), breast(2), carcinoid(l), colon(2), 

nonnal prostate(l), ovarian tumor(5), parathyroid tumor(2), pectoral muscle (after 
mastectomy)(5), placenta(2), pooled germ cell tumors(l), renal cell tumor(l), senescent 
fibroblast(8) 


~8968 




9406 


brain(2), colon(l), tumor(l) 




brain(2), colon(l), tumor(l) 





2 pooled tumors (clear cell type)(8), adenocarcinonia(l), anaplastic oligodendroglioma(l), 
breast carcinoma in situ(3), cerebellum(l), oligodendroglioma(l), papillary serous 
carcinoma(2), parathyroid tumor(26), pooled germ cell tumors(l) 


9679 


ovarian tumor(3), parathyroid tumor(l), uterus(l) 


9755 


2 pooled tumors (clear cell type)(l), adenocarcinoma(l), carcinoid(l), ovarian n)mor(l), 
pooled germ cell tumors(3), senescent fibroblast(2), total brain(2) 


9868 


brain(l), senescent fibroblast(l) 


10044 


2 pooled high-grade transitional cell tumors(l), 2 pooled tumors (clear cell type)(3), b-cell, 
chronic lymphotic leukemia(l), adenocarcinoma(6), alveolar rhabdomyosarcoma(3), 
amygdala(2), anaplastic oligodendFoglioma(2), blood(l), breast(2), carcinoid( 1 ), colon(2), 
germinal center b cell(5), liver cancer(l), metastatic prostate bone lesion(l), normal 
prostate(l), osteosarcoma(l), ovarian tumor(5), parathyroid tumor(2), pectoral muscle (after 
mastectomy)(5), placenta(2), pooled germ cell tumors(l), renal cell tumor(l), senescent 
fibroblast(8) 


10322 


2 pooled nimors (clear cell typeX9), b-cell, chronic lymphotic leukemia(2), anaplastic 
oligodendroglioma(4), breast(l), carcinoid(3), colon(3), colon tumor rer+(l), germinal center 
b cell(l), glioblastoma (pooled)(4), kidney tumor(l), low-grade prostatic neoplasia(l), 
metastatic melanoma to bowel(l), normal prostate(2), ovary bulk tumor(l), parathyroid 
tumor(3), pectoral muscle (after mastectomy )(3), pooled germ cell tumors(2), senescent 
fibroblast(2), small cell carcinoma(2), synovial sarcoma(2), two pooled squamous cell 
carcinomas(3) 


10526 


2 pooled tumors (clear cell type)(2), ewing's sarcoma(4), adenocarcinoma(l), adipose tissue, 
white(l), alveolar rhabdomyosarcoma(3), anaplastic oligodendroglioma(5), brain(l), 
breast(4), breast tumor(l), carcinoid(2), colon(lO), epithelium (cell line)(l), frontal lobe(7), 
germ cell tumor(l), glioblastoma (pooled)(l), invasive prostate tumor(l), kidney tumor(8), 
lymphoma(4), metastatic prostate bone lesion(2), muscle(3), normal prostate(2), normal 
prostatic epithelial cells(3), ovarian tumor(l), ovary(2), parathyroid tumor(4), pectoral 
muscle (after mastectomy)(26), placenta(l), pooled germ cell tumors(6), prostate(9), 
senescent fibroblast(6), small cell carcinoma(l), tumor, 5 pooled (see description)(2) 


10584 


2 pooled tumors (clear cell type)(2), b-cell, chronic lymphotic leukemia(4), anaplastic 
oligodendroglioma(6), carcinoid(2), colon(3), colon tumor rer+(4), colon tumor, rer+(2), 
ovarian tumor(4) 


10650 


pooled germ cell tumors(2) 


10739 


2 pooled tumors (clear cell typeX5), ewing's sarcoma(l), liver(l), alveolar 
rhabdomyosarcoma(l), anaplastic oligodendroglioma(7), breast(3), breast tumor(l), 
carcinoid(9), colon(5), colon tumor rer+(3), frontal lobe(8), germinal center b cell(3), 
glioblastoma (pooled)(l 1), invasive prostate tumor(l), kidney(l), metastatic prostate bone 
lesion(l), moderately-differentiated adenocarcinoma(l), normal prostate(2), normal prostatic 
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epithelial cells(l), parathyroid tumor(4), pectoral muscle (after mastectomy)(6), placenta(l), 
pooled germ cell tumors(l), tumor(l), tumor, 5 pooled (see description)(l) 


10743 


2 pooled tumors (clear cell type)(5), ewing's sarcoma(l), liver(l), schwannoma tumor(l), 
alveolar rhabdomyosarcoma(l), anaplastic oIigodendrogIioma(7), breast(3), breast tumor(l), 
carcinoid(lO), colon(5), colon tumor rer+(3), germinal center b cell(3), glioblastoma 
(pooled)(ll), invasive prostate tumor(l), kidney(l), liver(l), metastatic prostate bone 
lesion(l), moderately-differentiated adenocarcinoma(l), muscle(l), normal prostate(2), 
normal prostatic epithelial cells(l), ovarian tumor(4), parathyroid tumor(4), pectoral muscle 
(after mastectomy)(6), piacenta(l), pooled germ cell tumors(2), thyroid(l), tumor(l), tumor, 
5 pooled (see description)(l) 


10744 


heart(2), four pooled pituitary adenomas(l), frontal lobe(3), glioblastoma (pooled)(l), 
liver(l), moderately-differentiated adenocarcinoma(l), peripheral blood(l), retina(l) 


10880 


2 pooled tumors (clear cell type)(3), ewing's sarcoma(l), adenocarcinoma(3), brain(l), 
carcinoid(8), germinal center b cell(l), glioblastoma (pooled)(5), kidney tumor(l), normal 
prostate(3), oiigodendrogiioma(l), parathyroid tumor(ll), senescent fibroblast(4), three 
pooled meningiomas(l), total brain(3) 


10942 


2 pooled tumors (clear cell type)(8), adenocarcinoma(l), anaplastic oligodendroglioma(l), 
breast carcinoma in situ(3), cerebellum(l), oligodendroglioma(l), papillary serous 
carcinoma(2), parathyroid tumor(26), pooled germ cell tumors(l) 


11019 


epidermis(l), ewing's sarcoma(2), heart(l), schwannoma tumor(l), adenocarcinoma(5), 
alveolar rhabdomyosarcoma(3), amygdala(l), brain(4), carcinoid(l), colon mucosa(l), 
endometrioid ovarian metastasis(l), epithelium (cell line)(2), frontal lobe(l), germ cell 
tumor(3), heart(l), invasive prostate tumor(2), kidney(2), kidney tumor(2), liposarcoma(l), 
liver(24), mantle cell lymphoma(3), medulloblastoma(l), metastatic prostate bone lesion(6), 
moderately-differentiated adenocarcinoma(l), muscle(l), normal prostatic epithelial cells(5), 
ovary(l), papillary serous ovarian metastasis(l), parathyroid tumor(l), placenta(2), pooled 
germ cell tumors(2), prostate(l), thyroid(l), tumor(l), tumor, 5 pooled (see description)(l), 
uterus(4) 


11278 


2 pooled tumors (clear cell type)(4), bone(4), heart(2), anaplastic oligodendroglioma(4), 
carcinoid(3), colon tumor rer+(l), epithelium (cell line)(l), frontal lobe(12), kidney(2), 
liposarcoma(l), liver(6), lung carcinoma(l), muscle(6), normal prostate(2), ovary(2), 
parathyroid tumor(6), pectoral muscle (after mastectomy )(3), schizophrenic brain s- 1 1 frontal 
lobe(l), senescent fibroblast(6), small cell carcinoma(4), synovial membrane(l) 


11342 


2 pooled tumors (clear cell type)(3), blood(l), germinal center b cell(4), normal 
epithellum(l}, pooled germ cell tumors(l), tumor, 5 pooled (sec dcscription)(l) 


Ti735 


tumors(8) 






12043 


spleen(l) 


12048 


2 pooled tumors (clear cell type)(l), anaplastic oligodendroglioma(l), brain(2), colon tumor 
rer+(2), spleen(2) 


12098 


astrocytoma(l), ovarian tumor(l) 


12202 


2 pooled tumors (clear cell type)(2), adenocaFcinoma(l), cochlea(l), germinal center b 
cell(2), pituitary(l) 


12243 


alveolar rhabdomyosarcoma(2), anaplastic oligodendroglioma(l), carcinoid(9), colon(l), 
colon tumor rer+(l), germinal center b cell(2), glioblastoma (pooled)(2), metastatic prostate 
bone lesion(l), normal prostate(2), ovary(2), pooled germ cell tumors(2), thyroid(2), 
tumor(l) 


12263 


anaplastic oligodendroglioma(l), carcinoid(l), germinal center b cell(2), invasive 
adenocarcinoma(l), liver and spleen(2), papillary serous carcinoma(l) 


12490 


2 pooled tumors (clear cell type)(3), b-cell, chronic lymphotic leukemia(l), ewing's 
sarcoma(l), adenocarcinoma(2), anaplastic oligodendroglioma(l), germinal center b cell(2), 
kidney tumor(l), larynx(l), mantle cell lymphoma(l), medulloblastoma(l), melanocyte(2), 
moderately-differentiated adenocarcinoma(l), ovarian tumor(12), parathyroid tumor(2), 
pooled germ cell tumors(4), prostate(2), tumor, 5 pooled (see description)(2) 


12604 


2 pooled tumors (clear cell type)(12), b-cell, chronic lymphotic leukemia(4), bone(l), ewing's 
sarcoma(l), adenocarcinoma(l), anaplastic oligodendroglioma(lO), carcinoid(3), colon(3), 
colon tumor rer+(l), germinal center b cell(l), glioblastoma (pooled)(3), invasive prostate 
tumor(l), kidney(l), larynx(l), muscle(]), normal leg muscle(l), normal prostate(3), ovarian 
tumor(l), parathyroid tumor(12), pectoral muscle (after mastectomy)(28), pooled germ cell 
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tuinors(3), prostate(3), renal cell tunior(l), senescent fibroblast(12), skeletal muscle(l) 


12657 


adenocarcinoma(l), carcinoid(2), colon tumor, rer+(2) 


12788 


2 pooled tumors (clear cell type)(3), adenocarcinoma(l), adrenal adenoma(l), alveolar 
rhabdomyosarcoma(l), carcinoid(l), colon(l), colon tumor rer+(l), germinal center b cell(l), 
glioblastoma (pooled)(2), ovary(l), parathyroid tumor(l), senescent fibroblast(2), tumor, 5 
pooled (see description)( 1 ) 


12901 


anaplastic oligodendroglioma(2), carcinoid(l), germinal center b cell(6), invasive tumor (ceil 
line)(l), parathyroid tumor( 1 ) 


12907 


adipose tissue, white(l), carcinoid(l), parathyroid tumor(2), pooled frontal lobe(l), tumor, 5 
pooled (see description)( 1 ) 




invasive tumor (cell line)( 1 ), parathyroid tumor(3) 


13202 


2 pooled tumors (clear cell typeXl), b-cell, chronic lymphotic leukemia(3), breast(l), breast 
tumor(l), carcinoid(2), colon tumor rer+(l), colon tumor, rer+(l), germinal center b cell(l), 
kidney tumor(l), lung carcinoma(l), lung tumor(l), meningioma(2), moderately 

carcinoma( 1 ), tumor, 5 pooled (see description)( 1 ) 






^mi — 


2°pooled tomTrs^'(cl^^^^ r^e)(2)^adenocardn^ anaplastic oligodendroglioma(l) 
parathyroid tumor(l), pooled germ cell tumors(3) 


13285 


2 pooled tumors (clear cell typeX6), carcinoid(l), colon tumor rer+(l), kidney(l), liver(l) 


26710 


2 pooled tumors (clear cell typeXO, pooled germ cell tumors(l ) 


27273 


anaplastic oligodendroglioma(l), cerebellum(l), germinal center b cell(2), moderately- 
differentiated adenocarcinoma(l), pectoral muscle (after mastectomy)(2) 


"27301 


astrocytoma(2}, ovarian tumor(l) 


27336 


2 pooled tumors (clear cell typeX2), b-cell, chronic lymphotic leukemia(6), 
adenocarcinoma( 1 ), anaplastic oligodendroglioma(9), carcinoid(6), colon(l), colon tumor 

(pooled)(l), pooled ^erm cell tumors(4), senescent fibrobIast(2) 


27361 


colon(l) 






27627 


2 pooled tumors (clear cell typeXl), anaplastic oligodendroglioma(l), frontal lobe(l), 
germinal center b cell(l), glioblastoma (pooledX2), ovarian tumor(l) 




skeletal muscle(2), thyroid( 1 ) 


27877 


carcinoid(l), colon(l), germinal center b cell(l) 


29469 


frontal lobe(4), germinal center b cell(l), ovarian tumor(2), pectoral muscle (afler 
mastectomy)(3), senescent fibrob!ast(l), small cell carcinoma(l), thyroid(l) 


"29557 


c wing's sarcoma(l), larynx(l), medulloblastoma(l), moderately-differentiated 
adenocarcinoma(l), ovarian tumor(7), parathyroid tumor(2), pooled germ cell tumors(l), 


"29673 


heart( 1 ' co Ion tumor( 1 ) — kidney tumor(l) — lung tumor(3) — lymphoid(2) — moderately 
differentiated adenocarcinoma(l), muscle(l), ovarian tumor(3), squamous cell carcinoma(l) 


29814 


brain(2), colon(l), pancreatic islet(l) 




small cell carcinoma(2) 


-^^^ 


tumor(l), ovarian tumor(l), parathyroid tumor(l), prostate(l), small cell carcinoma(l) 


30583 


carcinoid( 1 ), ovarian tumor(l), pooled germ cell tumors(2) 


30719 


2 pooled tumors (clear cell typeX3), b-cell, chronic lymphotic leukemia(l), breast(2), 
colon(l), colon tumor, rer+(l), germinal center b cell(l), glioblastoma (pooledX3), liver(2), 
normal prostate(2), pooled germ cell tumors(lO) 


31356 


pooled germ cell tumors(l) 


31422 


2 pooled tumors (clear cell typeXl), b-cell, chronic lymphotic leukemia(2), bone(l), 
anaplastic oligodendroglioma(l 1), bone marrow stroma(l), brain(2), breast(l), carcinoid(20), 
colon(6), colon tumor, rer+(l), germinal center b cell(2), glioblastoma (pooledX3), kidney 
tumor(l), larynx(l), medulloblastoma(l), normal prostate(l), ovarian tumor(l), parathyroid 
tumor(l), pectoral muscle (after mastectomy)(2), pooled germ cell tumors(2), senescent 
fibroblast(5), tumor(l), tumor, 5 pooled (see description)(3) 


31554 


2 pooled tumors (clear cell typeXl ), carcinoid(l), germinal center b cell(4) 


31627 


2 pooled tumors (clear cell typeXl), alveolar rhabdomyosarcoma(l), anaplastic 
oligodendroglioma(2), frontal lobe(7), normal prostate(l), oligodendroglioma(l), parathyroid 
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tumor(l), pectoral muscle (after inastectomyX3), pooled germ cell tumors(l) 


31744 


adenocarcinoma( 1 ) 


31790 


ovarian tumor(l), synovial membrane(l) 


32102 


2 pooled tumors (clear cell type)(l), b-cell, chronic lymphotic leukemia(2), heart(3), 
lymphoma(2), adenocarcinoma(6), adipose tissue, white(l), anaplastic oligodendroglioma(5), 
brain(2), carcinoid(9), cerebellum(5), co]on(7), epithelium (cell line)(l), frontal lobe(4), germ 
cell tumor(l), glioblastoma (pooled)(3), juvenile granulosa tumor(l), kidney tumor(2), 
medulloblastoma(l), melanoma (mewo cell line)(l), metastatic melanoma to bowel(l), 
moderately-differentiated adenocarcinoma(l), normal epithelium(l), normal prostate(3), 
omentum(l), ovarian tumor(2), pancreatic islet(l), parathyroid tumor(4), pectoral muscle 
(after mastectomy)(2), placenta(l), pooled fi-ontal lobe(l), pooled germ cell tumors(l), retinal 
fovaea(l), schizophrenic brain s-11 frontal lobe(l), senescent fibroblast(5), small cell 
carcinoma(4), synovial membrane( 1 ), total brain(2), tumor, 5 pooled (see description)( 1 ), two 
pooled squamous cell carcinomas(2) 


32473 


2 pooled tumors (clear cell type)(3), heart(l), adipose tissue, white(l), alveolar 
rhabdomyosarcoma(3), anaplastic oligodendroglioma(3), breast(2), colon(2), colon tumor 
rer+(l), invasive prostate tumor(l), kidney(l), metastatic prostate bone lesion(l), normal 
prostate(l), pectoral muscle (after mastectomy)(22), pooled germ cell tumors(2), senescent 
fibroblast(l) 


32475 


2 pooled tumors (clear cell type)(3), heart(l), adipose tissue, white(l), alveolar 
rhabdoniyosarcoma(3), anaplastic oligodendroglioma(3), breast(2), colon(2), colon tumor 

(after mastectomy)(22), pooled germ cell tumors(2), senescent fibroblast(l) 




brain frontal cortex(2), breast(l), germinal center b cell(4), juvenile granulosa tumor(l), 


35005 


bone(l), pooled germ cell tumors(l) 


35185 


spinal cord(l) 




brain(l), breast(2), carcinoid(l), colon(l), germinal center b cell(2), human pancreatic 
islets(l), liver(l), melanocyte(l), ovarian tumor(l), placenta(l), total brain(l) 


37337 


anaplastic oligodendroglioma(2), colon(2), glioblastoma (pooledX3), juvenile granulosa 
tumor(l), pooled germ cell tumors(3), tumor, 5 pooled (see description)(l), uterus(2) 


38220 


2 pooled tumors (clear cell typeXO, b-cell, chronic lymphotic leukemia(l), 
adenocarcinoma(2), breast(l), carcinoid(l), frontal lobe(l), geiminal center b cell(2), normal 
prostate(l), normal prostatic epithelial cells(l), pooled germ cell tumors(3), total brajn(2) 


~383Ti 


adenocarcinoma(l), placenta(l) 


38631 


brain( 1 ) 


38749 


2 pooled tumors (clear cell typeX4), breast(l), cochlea(l), colon tumor rer+(l), liver(l), 
moderately differentiated adenocarcinoma(l), normal prostate(l), pancreas (with no medical 
abnormal itiesXl), parathyroid tumor(l), prostate(l) 


40975 


brain(2), tumor(l) 


40991 


brain(2), tumor(l) 


44053 


2 pooled tumors (clear cell typeXl), bone(l), total brain(l) 


45179 


colon(l), metastatic prostate bone lesion(l) 


45274 


2 pooled tumors (clear cell typeX6), cd34+, cd38- from normal bone marrow donor(l), 
ewing's sarcoma(l), heart(l), lung(l), adenocarcinoma(8), adrenal adenoma(l), alveolar 
rhabdomyosarcoma(3), anaplastic oligodendroglioma(5), borderline ovarian carcinoma(17), 
brain(3), breast(4), breast carcinoma in situ(20), bronchioalveolar carcinoma(4), carcinoid(4), 
cerebellum(l), colon(4), colon tumor rer+(3), early stage papillary serous carcinoma(3), 
glioblastoma (pooledX2), invasive adenocarcinoma(8), invasive carcinoma(2), lobullar 
carcinoma in situ(2), low-grade prostatic neoplasia(l), lung carcinoma( 1 ), lung tumor(l), 
normal prostate(2), normal prostatic epithelial cells(l), oligodendroglioma(3), ovarian 

tumor(5), pectoral muscle (after mastectomyX7), pooled germ cell tumors(2), senescent 
fibroblast(3), stem cell 34+/38+(2), thymus(l) 


46679 


2 pooled tumors (clear cell type)(4), anaplastic oligodendroglioma(l), pectoral muscle (after 
mastectomy)(l), pooled germ cell tumors(l) 


48024 


2 pooled tumors (clear cell typeX3), heart(l), adipose tissue, white(l), alveolar 
rhabdomyosarcoma(3), anaplastic oligodendroglioma(3), breast(2), colon(2), colon tumor 
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prostate(l), pectoral muscle (after mastectomy )(22), pooled germ cell tumors(2), senescent 
fibroblast( 1 ) 


48548 


liver(4) 


48823 


2 pooled tumors (clear cell type)(l), bone(l), alveolar rhabdomyosarcoma(2), breast(l), 
epithelium (cell line)(l), glioblastoma (pooled)(l), lung(l), lung tumor(l), parathyroid 
tumor(l), pectoral muscle (after mastectomy )( 1 ), renal cell tumor(l) 


48901 


germinal center b ceil(l), pooled germ cell tumors(l) 


49018 


anaplastic oligodendroglioma(l), brain(l), breast(l), frontal lobe (see description)(l), 
testis(5) 


49034 


adipose tissue, white(l), pancreas (with no medical abnoimalitiesXl) 


49133 


retina( 1 ) 


49140 


2 pooled tumors (clear cell type)(l), adenocarcinoma(l), carcinoid(6), colon tumor reiH-(l), 
glioblastoma (pooled)(2), moderately differentiated adenocarcinoma(l), normal prostate(l), 
pancreatic islet(2), parathyroid tumor(3) 


49387 


colon(l), pooled germ cell tumors(l) 


49416 


anaplastic oligodendroglioma(l), carcinoid(3), cerebellum(2), frontal lobe(2), germinal center 


49493 


neuroepithelial cells(l), ovarian tumor(4), thyroid(l) 


49640 


2 pooled tumors (clear cell type)(l), anaplastic oligodendroglioma(l), brain(2), colon tumor 
rer+(l) spleen(2) 


49863 


b-cell, chronic lymphotic leukemia(l), carcinoid(l), germinal center b cell(2), pooled germ 


"49871 


b^cell"chronic lymphotic leukemia(l) carcinoid(l) germinal center b cell(2) pooled germ 
cell tumors(3) 


50185 


total brain(l) 


50763 


cerebral cortex(3), brain(2), colon(l), colon tumor rer+(l), ovarian tumor(l), schizophrenic 
brain s-1 1 frontal lobe(l) 


50982 


2 pooled tumors (clear cell typeX2), b-cell, chronic lymphotic leukemia(3), carcinoid(l), 
cochlea(l), germinal center b cell(2) 


51130 


adenocarcinoma( 1 ) 


51212 


placenta.(2) 


51346 


bone(l), small cell carcinoma(2) 


51380 


placenta(4) 


51400 


liver(l) 


51954 


germinal center b cell(l), total brain(l) 


52076 


normal prostatic epithelial cells(l) 



TABLE V 



SEQ ID NO. in 

priority 

application 


Low frequency expression 


High frequency expression 


37 


salivary gland 




51 


fetal brain, fetal kidney 


brain, salivary gland, liver 


179 


liver 




180 






183 


prostate 




326 


salivary gland 




362 


testis 




440 


brain 




452 


placenta 




483 


placenta 




500 


salivary gland 




528 


testis 
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573 


liver 




587 


salivary gland 




588 


liver 




593 


fetal kidney, placenta 




599 


liver 




628 


testis 




653 


testis 




670 


testis 




693 


placenta 




703 


liver 




746 


brain 




770 


placenta 




775 


testis 




812 


brain 




940 


liver 




988 


liver 




996 


testis 




1036 


salivary gland, brain 




1064 


brain, liver, prostate 


fetal brain 


1151 


brain 




1190 


salivary gland, prostate 




1458 


stomach/intestine 




1904 


prostate 




2028 


brain 




2368 


testis 




2553 


prostate 




2556 


prostate 




2690 


testis 




2755 


prostate 




2800 


liver, testis 




2843 


brain 




2932 


fetal liver 




3078 


prostate 




3280 


liver 




3326 


prostate 




3387 


prostate 




3439 


salivary gland 




3501 


brain 




3633 


brain 




3678 


testis 




3714 


liver 




3801 


testis 




3804 


salivary gland, fetal kidney 




3892 


brain 




3985 


stomach/intestine 




4005 


liver 




4063 


fetal brain 




4088 


testis 




4111 


brain 




4126 


liver 




4172 


liver 




4261 


prostate 




4436 


liver 




4609 


fetal liver 




4647 


fetal liver 




4660 


fetal liver 





263 



PCT/IBOl/00914 









—— 


/^^l liv^'r 




— — 


eta iver 




— — 

4^51 


letal liver 




-"l^- 


salivary gland 




-^l^l 


salivary gland 






f^!^i u^^*^ 




— — 

—5^ 


letal brain 






salivary gland 




— — 


prostate 




— — 






— — 


'T^ 

__iver 




-— — 

—^^^ 






— 


ur '■ 

stomach/intestine 




_5000 


fetal kidney 




"I^TT 


fetal kidney 




-55— 


P'^^f f ^ 




-5-^5 


letal liver 




-|— 


salivary gland 










— — 

^—^ 


Uver^^^^ 






fetal brain 




— — 

—— ^ 


salivary gland 






|!^^[ — ■ 




— — 




letal brain 







fetal kidney, fetal brain 




_5262 




_ete — 1 ney, eta rain 


-^P? 


fetal brain 




"Lri 


fetal liver 






fetal kidney 




— — 


fetal liver 




"I4I9 






5533 


fetal brain 




5563 


fetal liver 




5609 


placenta 







placenta 




_5816 


fetal brain 




5824 


prostate 




5861 


fetal liver, prostate, fetal kidney, 
salivary gland 




~5947 


fetal liver 




"sgfifi 


placenta 




-— — 


f ta! liveT^ 




"5974 






~5983 


brain '^^"^ 




^985 


fetal kidne 




"6080 


~T — ' A 

sa ivary g an 




-TT— 






"6108 


\\veT^'^ ^ ^" 




6238 


placenta, brain 




6252 


placenta 




6334 


liver 




6345 


brain 




6358 


fetal brain 




6636 


fetal brain 




6688 


fetal brain 
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Jestis ^ 










-— r 


T '^^"^^ f — fetal brain 




— — 


Hvct"^ ^ ~ 




-=— 


— i 

placenta 




— — 

~7^P 








feitanver 




"ttTt 


!^ — _ 




-— — 






— — 

— — ^ 


""T^^ — r~H 

— ; ^-^j — — 







placenta, fetal liver 




-^l^ 


prostate 











— — 

—^^2 


placenta 






prostate 




— — 




_brain 




"5^5^ 


-T^^. 




-|=^^ 


fetal liver 






jestis 




— 


prostate 




— — — 


prostate 




— — 

— 


jestis 







testis 




_2222_ 








THT i — nt 

eta iver.pacenta 




— — — 

— ^^-^ 


prostate 




^ 


testis _ 







stomach/ intestine 




"T^^^T 


salivary gland 




^ii^ 


prostate 




^^1^ 


fetal liver 











— — — 






— — — 

TiT^f 


— V i — J 

salivary gland 






je^s 














■ 




— — — 


brain ~~~~ 







"braiii ~~~~~ — 







-j^. 







^ ■ ■ 




^2243 


br^ln testis liver 

rain, es s, ver 


fetal brain fetal liver 

_e r — j_e ver 









^2276 


^^^^^ ^ 




^2604 


fetal kidne 







fetal kidne stomach/intestine 

_e_a y, m 







_esis 




— — — 


pacenta 







salivary gland 




^3013 


fetal liver 




13229 


brain 




13256 


testis 




13267 


salivary gland 




13285 


fetal brain 


fetal kidney, stomach/intestine 


26638 


fetal liver 




26710 


brain 
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26726 


fetal kidney 




-i|P^ 


_brain 




_27084 







27273 


fetal kidney 




J.\ . 


jj^^y 




zL]: 


fetal liver 




-zi^. 


salivary gland 






prostate 







testis 







placenta 






^ 


salivary gland 




_29673 


fetal liver 







jver 




-|5zz2 


prostate 






placenta 






_30583 


testis — _ — 




_30719 


fetal brain, salivary gland 




"T^S 


prostate 




"IttIs 


_brain 






J£!H! 







prostate 






_38890 


placenta 




' 


jestis 




_40975 


prostate 




— -^I?! 


prostate 




-i=^?2 


prostate 




-p5££ 


stomach/intestine, fetal kidney 






sto mac h/i ntest ine 







prostate 







fetal kidney 






_46679 


fetal liver 




-^|^±£ 


fetal kidney 







liver, testis 




^^^^ 


^^^'^ 




_48823 


fetal kidney 






salivary gland 




— 


stomach/intestine 






salivary gland 




"dSTTs 







— — 








— |— 


l^^T 






— — — — : 

fetal brain 




- 


stomach/intestine 







salivary gland 




— — — 


brain 




— — — 


_J2!D 




— — 







— — — 

-55— 


Jestis 










13Ti2 


^eiis 




50185 


salivary gland 




50241 


liver 




50763 


brain 




51130 


testis 




51212 


stomach/intestine 




51400 


liver 
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WHAT IS CLAIMED IS; 

I . An isolated polynucleotide, said polynucleotide comprising a nucleic acid sequence encoding: 



10 2. The polynucleotide of claim 1, wherein said polypeptide comprises a signal peptide. 

3. The polynucleotide of claim 1 , wherein said polypeptide is a mature protein. 

4. The polynucleotide of claim 1, wherein said polynucleotide comprises any one of the nucleic 
15 acid sequences shown as SEQ ID NOs:l-169, 339-455, or 561-784. 

5. The polynucleotide of claim 1, wherein said polynucleotide is operably linked to a promoter. 

6. An expression vector comprising the polynucleotide of claim 5. 

20 

7. A host cell recombinant for the polynucleotide of claim 1 . 

8. A non-human transgenic animal comprising the host cell of claim 7. 

25 9. A pharmaceutical composition comprising the polynucleotide of claim I , and a 
phannaceutically acceptable carrier. 

10. A method of making a GENSET polypeptide, said method comprising 

a) providing a population of host cells comprising the polynucleotide of 
30 claim 5; and 



5 



iii) 



A polypeptide comprising the amino acid sequence shown as SEQ 
IDNO:305; 

a polypeptide comprising any one of the amino acid sequences 
shown as SEQ ID NOs: 170-304, 306-338, 456-560, 785-918; or 
a biologically active fragment of any of said polypeptides. 



b) culturing said population of host cells under conditions conducive to the 
production of said polypeptide within said host cells. 



1 1 . The method of claim 10, further comprising purifying said polypeptide from said population of 
35 host cells. 
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12. An isolated polynucleotide, said polynucleotide comprising any one of the nucleic acid 
sequences shown as SEQ ID NOs:l -169, 339-455, or 561-784. 

13. A biologically active polypeptide encoded by the polynucleotide of claim 12. 

5 

14. An isolated polypeptide or biologically active fragment thereof, said polypeptide comprising 
any one of the amino acid sequences shown as SEQ ID NOs: 1 70-338, 456-560, or 785-91 8. 

15. The polypeptide of claim 14, wherein said polypeptide comprises a signal peptide. 

10 

16. The polypeptide of claim 14, wherein said polypeptide is a mature protein. 



17. An antibody that specifically binds to the polypeptide of claim 14. 



15 1 8. A pharmaceutical composition comprising the polypeptide of claim 14, and a pharmaceutical ly 
acceptable carrier. 



19. A method of making a GENSET polypeptide, said method comprising 

a) providing a population of cells comprising a polynucleotide encoding 
20 the polypeptide of claim 1 4, operably linked to a promoter, 

b) culturing said population of cells under conditions conducive to the 
production of said polypeptide within said cells; and 

c) purifying said polypeptide irom said population of cells. 



25 20. A method of determining whether a GENSET gene is expressed within a mammal, said method 
comprising the steps of: 

a) providing a biological sample from said mammal 

b) contacting said biological sample with either of: 

i) a polynucleotide that hybridizes under stringent cond itions to the 
30 polynucleotide of claim 1 ; or 

ii) a polypeptide that specifically binds to the polypeptide of claim 14; and 

c) detecting the presence or absence of hybridization between said polynucleotide 
and an RNA species within said sample, or the presence or absence of binding 
of said polypeptide to a protein within said sample; 

35 wherein a detection of said hybridization or of said binding indicates that said GENSET gene is 
expressed within said mammal. 
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21 . The method of claim 21 , wherein said polynucleotide is a primer, and wherein said 

hybridization is detected by detecting the presence of an amplification product comprising the 
sequence of said primer. 

5 22. The method of claim 21, wherein said polypeptide is an antibody. 

23. A method of determining whether a mammal has an elevated or reduced level of GENSET gene 
expression, said method comprising the steps of : 

a) providing a biological sample from said mammal; and 
1 0 b) comparing the amount of the polypeptide of claim 14, or of an UNA species 

encoding said polypeptide, within said biological sample with a level 
detected in or expected irom a control sample; 
wherein an increased amount of said polypeptide or said RNA species within said biological 
sample compared to said level detected in or expected from said control sample indicates that said 
IS mammal has an elevated level of said GENSET gene expression, and wherein a decreased amount 
of said polypeptide or said RNA species within said biological sample compared to said level 
detected in or expected irom said control sample indicates that said mammal has a reduced level of 
said GENSET gene expression. 

20 24. A method of identifying a candidate modulator of a GENSET polypeptide, said method 
comprising : 

a) contacting the polypeptide of claim 14 with a test compound; and 

b) determining whether said compound specifically binds to said 
polypeptide; 

25 wherein a detection that said compound specifically binds to said polypeptide indicates that 

said compound is a candidate modulator of said GENSET polypeptide. 

25. The method of claim 24, further comprising testing the biological activity of said GENSET 
polypeptide in the presence of said candidate modulator, wherein an alteration in the biological 

30 activity of said GENSET polypeptide in the presence of said compound in comparison to the 
activity in the absence of said compound indicates that the compound is a modulator of said 
GENSET polypeptide. 

26. A method for the production of a pharmaceutical composition comprising 

35 a) identifying a modulator of a GENSET polypeptide using the method of 

claim 24; and 

b) combining said modulator with a pharmaceutically acceptable carrier. 
270 
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252 
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( START 







STORE A FIRST SEQUENCE TO A MEMORY 






STORE A SECOND SEQUENCE TO A MEMORY 
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268 
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( START 
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SEQUENCE LISTING 
<110> GENSET SA 

<120> Full-length human cDNAs encoding potentially secreted proteins 

<130> 81. WO 

<150> US 60/197,873 

<151> 2000-04-18 

<150> US 60/224,009 

<151> 2000-08-07 

<150> US 60/260,328 

<151> 2001-01-08 

<150> US 60/224,006 

<151> 2000-08-04 

<160> 918 

<17 0> Patent. pm 



<210> 1 

<211> 615 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 88. .471 

<220> 

<221> sig_peptide 

<222> 88. .189 

<223> Von Heijne matrix 

score 13.1278588169685 

seq VLLLLSLLLTAXA/SS 

<220> 

<221> polyA_site 
<222> 600. .615 

<220> 

<221> polyA_signal 
<222> 577. .582 

<400> 1 

acttatctgc agacttgtag gcagcaactc accctcactc agaggtcttc tggttctgga 60 
aacaactcta gctcagcctt ctccacc atg age etc aga ctt gat acc acc cct 114 
Met Ser Leu Arg Leu Asp Thr Thr Pro 
-30 

tec tgt aac agt gcg aga cca ctt cat gcc ttg cag gtg ctg ctg ctt 162 

Ser Cys Asn Ser Ala Arg Pro Leu His Ala Leu Gin Val Leu Leu Leu 

-25 -20 -15 -10 

ctg tea ttg ctg ctg act get ytg get tec tec ace aaa gga caa ayt 210 

Leu Ser Leu Leu Leu Thr Ala Leu Ala Ser Ser Thr Lys Gly Gin Xaa 
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2 

-5 15 



aag 


aga 


aac 


ttg 


gcg 




ggc 


aaa 


gac gaa 


agt 


eta 


gac 


agt 


gac ttg 


258 


Lys 


Arg 


Asn 


Leu 


Ala 


Lys 


Gly 


Lys 


Asp Glu 


Ser 


Leu 


Asp 


Ser 


Asp Leu 








10 










15 








20 








tat 


get 


gaa 


etc 


ege 


tge 


atg 


tgt 


ata aag 


aca 


ace 


tet 


gga 


att eat 


306 


Tyr 


Ala 


Glu 


Leu 


Arg 


Cys Met 


Cys 


He Lys 


Thr 


Thr 


Ser 


Gly 


He His 






25 










30 








35 










ccc 


aaa 


aac 


ate 


eaa 


agt 


ttg 


gaa 


gtg ate 


ggg 


aaa 


gga 


ace 


cat tge 


354 


Pro 


Lys 


Asn 


He 


Gin 


Ser 


Leu 


Glu 


Val He 


Gly 


Lys 


Gly 


Thr 


His Cys 




40 










45 








50 








55 








gtc 


gaa 


gtg 


ata 


gcc 




ctg aag 




ggg 


agg 


aaa 


ate tge 




Asn 


Gin 


Val 


Glu 


Val 


He 


Ala 


Thr 


Leu Lys 


Asp 


Gly 


Arg 


Lys 


He Cys 












60 








65 










70 




ctg 


gac 


cca 


gat 


get 


ecc 


aga 


ate 


aag aaa 


att 


gta 


cag 


aaa 


aaa ttg 


450 


Leu 


Asp 


Pro 


Asp 


Ala 


Pro 


Arg 


He 


Lys Lys 


He 


Val 


Gin 


Lys 


Lys Leu 










75 










80 








85 






gca 


ggt 


gat 


gaa 


tet 


get 


gat 


taatttgtte tgtttetgee aaaettettt 


501 


Ala 


Gly 


Asp 


Glu 


Ser 


Ala 


Asp 



















90 



aactcccagg aagggtagaa ttttgaaacc ttgattttct agttctcatt tattcaggat 561 
acetattctt actgtattaa aatttggata tgtgttteaa aaaaaaaaaa aaaa 615 

<210> 2 

<211> 539 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 39. .518 

<220> 

<221> sig_peptide 

<222> 39. .92 

<223> Von Heijne matrix 

score 12.2866240999866 

seq LTLAVLFLTGSQA/RH 

<220> 

<221> polyA_site 
<222> 524. .539 

<220> 

<2 21> polyA_signal 
<222> 507. .512 

<400> 2 

agagactgcg agaaggaggt cccccacggc ccttcagg atg aaa get gcg gtg ctg 56 

Met Lys Ala Ala Val Leu 
-15 



acc 


ttg 


gcc 


gtg 


etc 


ttc 


ctg 


acg 


ggg 






get 


egg 


cat 


ttc 


tgg 


104 


Thr 


Leu 


Ala 

-10 


Val 


Leu 


Phe 


Leu 


Thr 

-5 


Gly 


Ser 


Gin 


Ala 


1 


His 


Phe 


Trp 




cag 




gat 


gaa 








age 




tgg 


gat 




gtg 


aag 




ctg 


152 


Gin 


Gin 


Asp 


Glu 


Pro 


Pro 


Gin 


Ser 


Pro 


Trp 


Asp 


Arg 


Val 


Lys 


Asp 


Leu 




5 










10 










15 










20 






act 


gtg 


tac 


gtg 


gat 


gtg 


etc 


aaa 


gac 


age 


ggc 


aga 


gac 


tat 


gtg 


200 


Ala 


Thr 


Val 


Tyr 


Val 


Asp 


Val 


Leu 


Lys 


Asp 


Ser 


Gly 


Arg 


Asp 


Tyr 


Val 












25 










30 










35 






tec 


cag 


ttt 


gaa 


ggc 


tec 


gee 


ttg 


gga 


aaa 


cag 


eta 


aac 


eta 


aag 


etc 


248 


Ser 


Gin 


Phe 


Glu 


Gly 


Ser 


Ala 


Leu 


Gly 


Lys 


Gin 


Leu 


Asn 


Leu 


Lys 


Leu 










40 










45 










50 








ctt 


gac 


aac 


tgg 


gac 


age 


gtg 


acc 


tec 


gcc 


ttc 


age 


aat 


ctg 


cgc 


gaa 


296 
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Leu 


Asp 


Asn 
55 


Trp 


Asp Ser 


Val 


Thr 

60 


Ser 


Ala 


Phe 


Ser 


Asn 
65 


Leu 


Arg 


Glu 




cag 




ggc 


cct 


gtg acc 


cag 


gag 


tty 


tgg 


gat 




ctg 




aag 


gag 


344 


Gin 


Leu 

70 


Gly 


Pro 


Val Thr 


Gin 
75 


Glu 


Phe 


Trp 


Asp 


Asn 
80 


Leu 


Glu 




Glu 




aca 


gag 


ggc 


ctg 


agg cag 


gag 


atg 


age 


aag 


gat 


ctg 


gag 


gag 


gtg 


aag 


392 


Thr 


Glu 


Gly 


Leu 


Arg Gin 


Glu 


Met 


Ser 








Glu 


Glu 


Val 


Lys 




85 








90 










95^ 










100 




gcc 


aag 


gtg 


cag 


ccc tac 


ctg 


gac 


gac 


ttc 


cag 


aag 


aag 


tgg 


cag 


gag 


440 










Pro Tyr 
105 


Leu 






110 


Gin 


Lys 


Lys 


Trp 


Gin 
115 


Glu 




gag 


atg 


gag 


etc 


tac cgc 


cag 


aag 


gca 


get 


ttc 


tta 


act 


ate 


eta 


aca 


488 


Glu 


Met 


Glu 


Leu 
120 


Tyr Arg 


Gin 


Lys 


Ala 
125 


Ala 


Phe 


Leu 


Thr 


He 
130 


Leu 


Thr 






ctt 


gga 


cca 


aat gga 


aat 


aaa 


get 


ttt 


tgatgaaaaa i 


laaaaaaaaa a 


539 


Ser 


Leu 


Gly 


Pro 


Asn Gly Asn 


Lys 


Ala 


Phe 

















135 140 



<210> 3 

<211> 796 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 134. .373 

<220> 

<221> sig_peptide 

<222> 134 . .211 

<223> Von Heijne matrix 

score 12.0462155590967 

seq LLLALLLPTQIYS/SE 

<400> 3 

atttgtggct ttcctggtat ataaggtctc gccggctcgc cgcgctcccc accttgcctg 60 
cgcccgcccg gagccagcgg ttctccaagc acccagcatc ctgctagacg cgccgcgcac 120 
cgacggaggg gac atg ggc aga gca atg gtg gcc agg etc ggg ctg ggg 169 
Met Gly Arg Ala Met Val Ala Arg Leu Gly Leu Gly 













-25 










-20 










-15 




ctg 


ctg 


ctg 


ctg 


gca 


ctg 


etc 


eta 


ccc 


acg 


cag 


att 


tat 


tec 


agt 


gaa 


217 


Leu 


Leu 


Leu 




Ala 
-10 


Leu 


Leu 


Leu 


Pro 


Thr 

-5 


Gin 


lie 


Tyr 


Ser 


Ser 
1 


Glu 






aca 


act 


gga 


act 


tea 


agt 


aac 


tec 


tec 


cag 


agt 


act 






tct 


265 


Thr 


Thr 


Thr 
5 


Gly 


Thr 


Ser 


Ser 


Asn 
10 


Ser 


Ser 


Gin 


Ser 


Thr 
15 


Ser 


Asn 


Ser 




ggg 


ttg 






aat 




act 


aat 




ace 


ace 


aag 


gtg 


get 


ggt 


ggt 


313 


Gly 


Leu 
20 


Ala 


Pro 


Asn 


Pro 


Thr 
25 


Asn 


Ala 


Thr 


Thr 


Lys 
30 


Val 


Ala 


Gly 


Gly 




gee 


ctg 


cag 


tea 


aca 


gcc 


agt 


etc 


ttc 


gtg 


gtc 


tea 


etc 


tet 


ctt 


ctg 


361 


Ala 


Leu 


Gin 


Ser 


Thr 


Ala 


Ser 


Leu 


Phe 


Val 


Val 


Ser 


Leu 


Ser 




Leu 




35 










40 










45 










50 





cay eye tac tct taagagaetc aggccaagaa acgtyttcta aatttcccca 413 
His Xaa Tyr Ser 

tcttctaaac ecaatccaaa tggegtctgg aagteeaatg tggeaaggaa aaacaggtet 4 73 

tcatcgaatc tactaattcc acacctttta ttgacacaga aaatgttgag aatcccaaat 533 

ttgattgatt tgaagaacat gtgagaggtt tgactagatg atggatgcca atattaaatc 593 

tgctggagtt tcatgtacaa gatgaaggag aggcaacate caaaatagtt aagaeatgat 653 

ttccttgaat gtggcttgag aaatatggac acttaatact accttgaaaa taagaataga 713 

aataaaggat gggattgtgg aatggagatt cagttttcat ttggttcatt aattctataa 773 

ggccataaaa caggtaatat aaa 796 



<210> 4 
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<211> 528 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 75. .377 

<220> 

<221> sig_peptide 

<222> 75. . 155 

<223> Von Heijne matrix 

score 11.8486806626293 

LFLGLLLLPLVVA/FA 

<220> 

<221> polyA_site 
<222> 513. .528 

<220> 

<221> polyA_signal 
<222> 498. .503 

<400> 4 

attggccaca gagacccagc ccgagtttcc catcgcactg agcactgaga tcctgctgga 60 



agctctgccg cage 


atg 


age 


tee 


gea 


gee 


ggg 


tte 


tge 


gee 


tea 


ege 


eee 


110 










Met 


Ser 


Ser 

-25 


Ala 


Ala 


Gly 


Phe 


Cys 
-20 


Ala 


Ser 


Arg 


Pro 




ggg 


ctg 


ctg 


ttc 


ctg 


ggg 


ttg 


ctg 


etc 


ctg 


cca 


ctt 


gtg 


gte 


gcc 


ttc 


158 


Gly 


Leu 


Leu 


Phe 


Leu 


Gly 


Leu 


Leu 


Leu 


Leu 


Pro 


Leu 


Val 


Val 


Ala 


Phe 




-15 










-10 










-5 










1 




gcc 


age 


get 


gaa 


get 


gaa 


gaa 


gat 


ggg 


gae 


etg 


eag 


tge 


etg 


tgt 


gtg 


206 


Ala 


Ser 


Ala 


Glu 
5 


Ala 


Glu 


Glu 


Asp 


Gly 
10 


Asp 


Leu 


Gin 


Cys 


Leu 
15 


Cys 


Val 




aag 


acc 


ace 


tee 


eag 


gte 


egt 


ecc 


agg 


eac 


ate 


acc 


age 


ctg 


gag 


gtg 


254 


Lys 


Thr 


Thr 
20 


Ser 


Gin 


Val 


Arg 


Pro 
25 


Arg 


His 


He 


Thr 


Ser 
30 


Leu 


Glu 


Val 




ate 


aag 


gee 


gga 


eee 


eae 


tge 


eee 


aet 


gee 


eaa 


etg 


ata 


gee 


acg 


etg 


302 


He 


Lys 
35 


Ala 


Gly 


Pro 


His 


Cys 
40 


Pro 


Thr 


Ala 


Gin 


Leu 
45 


He 


Ala 


Thr 


Leu 




aag 


aat 


gga 


agg 


aaa 


att 




ttg 


gac 






gcc 




etg 


tac 


aag 


350 


Lys 


Asn 


Gly 


Arg 


Lys 


He 


Cys 


Leu 


Asp 


Leu 


Gin 


Ala 


Pro 


Leu 


Tyr 


Lys 




50 










55 










60 










65 






ata 


att 






ctt 


ttg 


gag 


agt 


tagctactag ctgcctacgt 




397 


Lys 


He 


He 


Lys 


Lys 
70 




Leu 


Glu 


Ser 



















gtgtgcattt gctatatagc atacttcttt ttteeagttt eaatetaaet gtgaaagaae 457 
ttctgatatt tgtgttatcc ttatgatttt aaataaacaa aataaatcaa gttgcaaaaa 517 
aaaaaaaaaa a 528 



<210> 5 

<211> 522 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 76. .378 

<220> 

<221> sig_peptide 

<222> 76. .156 

<223> Von Heijne matrix 



[fije>'/^:/yv'GG208389e.s^] 



WO 02/083898 PCT/IBOl/00914 
5 

score 11.8486806626293 
LFLGLLLLPLVVA/FA 

<220> 

<221> polyA_site 
<222> 508. .522 

<220> 

<221> polyA_signal 
<222> 499. .504 

<400> 5 

aattggccac agagacccag cccgagtttc ccatcgcact gagcactgag atcctgctgg 60 
aagctctgcc gcagc atg age tec gca gcc ggg ttc tgc gcc tea cge ecc 111 
Met Ser Ser Ala Ala Gly Phe Cys Ala Ser Arg Pro 
-25 -20 



ggg 


ctg 


ctg 


ttc 


ctg 


ggg 


ttg 


ctg 


etc 


ctg 


cea 


ett 


gtg 


gtc 


gcc 


ttc 


159 


Gly 


Leu 


Leu 


Phe 




Gly 


Leu 








Pro 




Val 


Val 


Ala 


Phe 




-15 










-10 










-5 










1 




gcc 


age 


get 


gaa 




gaa 


gaa 


gat 


ggg 


gae 


etg 


eag 


tgc 


ctg 


tgt 


gtg 


207 


Ala 


Ser 


Ala 


Glu 
5 


Ala 


Glu 


Glu 


Asp 


Gly 
10 


Asp 


Leu 


Gin 


Cys 


Leu 
15 


Cys 


Val 




aag 


acc 


acc 


tec 


cag 


gtc 


cgt 


cee 


agg 


cac 


ate 


acc 


age 


ctg 


gag 


gtg 


255 


Lys 


Thr 


Thr 
20 


Ser 


Gin 


Val 


Arg 


Pro 
25 


Arg 


His 


He 


Thr 


Ser 
30 


Leu 


Glu 


Val 




ate 


aag 


gee 


gga 


cee 


cae 


tgc 


cee 


act 


gee 


caa 


etg 


ata 


gee 


aeg 




303 


He 


Lys 
35 


Ala 


Gly 


Pro 


His 


Cys 
40 


Pro 


Thr 


Ala 


Gin 


Leu 
45 


He 


Ala 


Thr 


Leu 




aag 


aat 


gga 


agg 


aaa 


att 


tgc 


ttg 


gae 


ctg 


caa 


gcc 


ccg 


ctg 


tac 


aag 


351 


Lys 




Gly 


Arg 


Lys 


He 


Cys 




Asp 


Leu 


Gin 


Ala 


Pro 


Leu 


Tyr 


Lys 




50 










55 










60 










65 




aaa 


ata 


att 


aag 


aaa 


ett 


ttg 


gag 


agt 


tagctactag ctgcctacgt 




398 


Lys 


He 


He 


Lys 


Lys 
70 


Leu 


Leu 


Glu 


Ser 



















gtgtgcattt gctatatagc atacttcttt tttccagttt caatctaact gtgaaagaac 458 
ttctgatatt tgtgttatcc ttatgatttt aaataaacaa aataaatcta aaaaaaaaaa 518 
aaaa 522 



<210> 6 

<211> 652 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 75. .332 

<220> 

<221> sig_peptide 

<222> 75. .128 

<223> Von Heijne matrix 

score 11.7816699306029 
seq VLLLLLLVEQAAA/LG 

<220> 

<221> polyA_site 
<222> 637. .652 

<220> 

<221> polyA_signal 
<222> 616. .621 



<400> 6 



[fije>'/^:/yv'GG208389e.s^] 



WO 02/083898 PCT/IBOl/00914 
6 

gactctttgg tcagggaact gctcgctgag cacagctgca cagtgctgtc agaacggccg 60 



atctccagcc < 


:aag 


atg 


att 


cca 


gca 


gtg 


gtc 


ttg 


etc 


tta 


etc 


ett 


ttg 


110 










Met 


He 


Pro 


Ala 
-15 


Val 


Val 


Leu 


Leu 


Leu 
-10 


Leu 


Leu 


Leu 




gtt 


gaa 


caa 


gca 


gcg 


gcc 


ctg 


gga 


gag 


cct 


cag 


etc 


tgc 


tat 


ate 


ctg 


158 


Val 


Glu 
-5 


Gin 


Ala 


Ala 


Ala 


Leu 
1 


Gly 


Glu 


Pro 


Gin 
5 


Leu 


Cys 


Tyr 


He 


Leu 
10 




gat 


gcc 


ate 


ctg 


ttt 


ctg 


tat 


gga 


att 


gtc 


etc 


ace 


etc 


etc 


tae 


tgt 


206 


Asp 


Ala 


He 


Leu 


Phe 
15 


Leu 


Tyr 


Gly 


He 


Val 
20 


Leu 


Thr 


Leu 


Leu 


Tyr 
25 


Cys 






ctg 


aag 






gtg 




aag 




get 








tat 


gag 






Arg 


Leu 


Lys 


He 
30 


Gin 


Val 




Lys 


Ala 
35 


Ala 


He 


Thr 


Ser 


Tyr 
40 


Glu 






tea 


gat 


ggt 


gtt 


tac 


a eg 


ggc 


ctg 


age 


ace 


agg 


aac 


cag 


gag 


act 


tac 


302 


Ser 


Asp 


Gly 
45 


Val 


Tyr 


Thr 


Gly 


Leu 
50 


Ser 


Thr 


Arg 


Asn 


Gin 
55 


Glu 


Thr 


Tyr 




gag 


act 


ctg 


aag 


cat 


gag 




cca 


cem 


cag 


tagccttaga atagatgcgg 


352 


Glu 


Thr 






His 


Glu 




Pro 


Pro 


Gin 

















60 65 



teatattctt ctttggcttc tggttcttcc agccctcatg tttggcatca catatgcctg 412 

catgceatta acaeeagetg gccetacccc tataatgatc ctgtgtecta aattaatata 4 72 

caecagtggt tcetcetcce tgttaaagac taatgctcag atgctgttta cggatattta 532 

tattctagtc tcaetctett gtceeaecct tcttctcttc cccattccea actccagcta 592 

aaatatggga agggagaace cecaataaaa etgceatgga etggaaaaaa aaaaaaaaaa 652 

<210> 7 

<211> 743 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 112. .387 

<220> 

<221> sig_peptide 

<222> 112. .183 

<223> Von Heijne matrix 

score 11.7298925418815 

seq FVLGLGLTPPTLA/QD 

<220> 

<221> polyA_site 
<222> 728. .743 

<220> 

<221> polyA_signal 
<222> 705. .710 

<400> 7 

acacaactgg aacccatctc eaggaaeaaa eagetggaae ccatctcccg ttgaagggaa 60 
actgccagat ttttgtaaga ttcttcctcc tgggagcctg tgttggaaga g atg gtg 117 

Met Val 

atg ggc ctg ggc gtt ttg ttg ttg gtc ttc gtg ctg ggt ctg ggt ctg 165 
Met Gly Leu Gly Val Leu Leu Leu Val Phe Val Leu Gly Leu Gly Leu 

-20 -15 -10 

acc cca ccg ace ctg get cag gat aac tec agg tac aca cac ttc ctg 213 
Thr Pro Pro Thr Leu Ala Gin Asp Asn Ser Arg Tyr Thr His Phe Leu 

-5 15 10 

ace cag cac tat gat gee aaa cca cag ggc egg gat gae aga tac tgt 2 61 

Thr Gin His Tyr Asp Ala Lys Pro Gin Gly Arg Asp Asp Arg Tyr Cys 

15 20 25 

gaa age ate atg agg aga egg gge etg aee tea ccc tge aaa gae ate 309 



[fije>'/^:/yv'GG208389e.s^] 



WO 02/083898 PCT/IBOl/00914 
7 

Glu Ser lie Met Arg Arg Arg Gly Leu Thr Ser Pro Cys Lys Asp lie 

30 35 40 

aac aca ttt att cat ggc aac aag cgc asa tea agg cca tct gtg aaa 357 
Asn Thr Phe lie His Gly Asn Lys Arg Xaa Ser Arg Pro Ser Val Lys 

45 50 55 

aca aga atg gaa acc etc aca gag aaa acc taagaataag caagtcttct 407 
Thr Arg Met Glu Thr Leu Thr Glu Lys Thr 

60 65 
ttccaggtca ccacttgcaa gctacatgga ggttccccct ggcctccatg ccagtaccga 4 67 
gccacagcgg ggttcagaaa cgttgttgtt gcttgtgaaa atggcttacc tgtccacttg 527 
gatcagtcaa ttttccgtcg tcegtaacca gcgggcecct ggtcaagtgc tggctctget 587 
gtccttgcct tccatttccc etctgcaccc agaacagtgg tggcaacatt cattgccaag 647 
ggeceaaaga aagagetacc tggacctttt gttttctgtt tgacaacatg tttaataaat 707 
aaaaatgtct tgatateagc aaaaaaaaaa aaaaaa 743 



<210> 8 

<211> 502 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 39. .371 



<220> 

<221> sig_peptide 

<222> 39. .86 

<223> Von Heijne matrix 

score 11.56948973398 
seq ILLSVALLAFSSA/QD 



<220> 

<221> polyA_site 
<222> 487. .502 



<220> 

<221> polyA_signal 
<222> 467. .472 



<400> 8 

agcataaagt tgggagtgac accagagcct tctgcaag 



tea gtg gee ctg ctg gee tte age tea get eag gac tta gat gaa gat 

Ser Val Ala Leu Leu Ala Phe Ser Ser Ala Gin Asp Leu Asp Glu Asp 

-10 -5 15 

gtc age caa gaa gac gtt ccc ttg gta ata tea gat gga gga gac tct 

Val Ser Gin Glu Asp Val Pro Leu Val lie Ser Asp Gly Gly Asp Ser 

10 15 20 

gag cag tte ata gat gag gag cgt eag gga eea eet ttg gga gga cag 

Glu Gin Phe He Asp Glu Glu Arg Gin Gly Pro Pro Leu Gly Gly Gin 

25 30 35 

caa tct caa ccc tct get ggt gat ggg aac cag aat gat ggc cct cag 

Gin Ser Gin Pro Ser Ala Gly Asp Gly Asn Gin Asn Asp Gly Pro Gin 

40 45 50 

cag gga cca ccc caa caa gga ggc cag cag caa caa ggt cca eea eet 

Gin Gly Pro Pro Gin Gin Gly Gly Gin Gin Gin Gin Gly Pro Pro Pro 

55 60 65 70 

cet cag gga aag cca caa gga cca cct ccc caa ggg ggc cgc cca caa 

Pro Gin Gly Lys Pro Gin Gly Pro Pro Pro Gin Gly Gly Arg Pro Gin 

75 80 85 

gga cct cca cag ggg cag tet cet eag taatetagga tteaatgaca 

Gly Pro Pro Gin Gly Gin Ser Pro Gin 
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90 95 
ggaagtgaat aagaagatga cagtgtttca aatgccttga aacataatgt gatcatgctc 451 
taacttcaat ataccaataa aataatcagc ttgctaaaaa aaaaaaaaaa a 502 

<210> 9 

<211> 401 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 36. .353 

<220> 

<221> sig_peptide 

<222> 36. .83 

<223> Von Heijne matrix 

score 11.56948973398 

seq ILLSVALLAFSSA/QD 

<220> 

<221> polyA_site 
<222> 386. .401 

<400> 9 

ataaagttgg gagtgacacc agagccttct gcaag atg ctt ctg att ctg ctg 53 

Met Leu Leu lie Leu Leu 
-15 



tea 


gtg 


gcc 


ctg 


ctg 


gcc 


ttc 


age 


tea 


get 


cag 


gac 


tta 


gat 


gaa 


gat 


101 


Ser 


Val 


Ala 




Leu 


Ala 


Phe 


Ser 


Ser 


Ala 


Gin 


Asp 


Leu 


Asp 


Glu 


Asp 




-10 










-5 










1 








5 






gtc 


age 


caa 


gaa 


gac 


gtt 


ccc 


ttg 


gta 


ata 


tea 


gat 


gga 


gga 


gac 


tct 


149 


Val 


Ser 


Gin 


Glu 
10 


Asp 


Val 


Pro 


Leu 


Val 
15 


He 


Ser 


Asp 


Gly 


Gly 
20 


Asp 


Ser 




gag 


cag 


ttc 


ata 


gat 


gag 


gag 


cgt 


cag 


gga 


cca 


cct 


ttg 


gga 


gga 


cag 


197 


Glu 


Gin 


Phe 
25 


He 


Asp 


Glu 


Glu 


Arg 
30 


Gin 


Gly 


Pro 


Pro 


Leu 
35 


Gly 


Gly 


Gin 




caa 


tct 


caa 




tct 


get 


ggt 


gat 


ggg 


aac 


cag 


gat 


gat 


ggc 


cct 


cag 


245 


Gin 


Ser 
40 


Gin 


Pro 


Ser 


Ala 


Gly 
45 


Asp 


Gly 


Asn 


Gin 


Asp 
50 


Asp 


Gly 


Pro 


Gin 




cag 


gga 


cca 


ccc 


caa 


caa 


gga 


gge 


cag 


cag 


caa 


caa 


ggt 


cca 


cca 


cct 


293 


Gin 


Gly 


Pro 


Pro 


Gin 


Gin 


Gly 


Gly 


Gin 


Gin 


Gin 


Gin 


Gly 


Pro 


Pro 


Pro 




55 










60 










65 










70 




cct 


cag 


gga 


aag 


cca 


caa 


gga 






caa 


cag 


gga 


ggc 


cag 


tct 


tgt 


341 


Pro 


Gin 


Gly 


Lys 


Pro 
75 


Gin 


Gly 


Pro 


Pro 


Gin 
80 


Gin 


Gly 


Gly 


Gin 


Ser 
85 


Cys 




tgc 


tgt 


gac 


aag 


taacagcctt ttttttaaat tgttttttat acaaaaaaaa 




393 


Cys 


Cys 


Asp 


Lys 
90 





























aaaaaaaa 401 



<210> 10 

<211> 961 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 
<222> 45. .467 

<220> 

<221> sig_peptide 
<222> 45. .116 
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<223> Von Heijne matrix 

score 10.983328612314 
seq LAGLCCLVPVSLA/ED 



<220> 

<221> polyA_site 
<222> 946. .961 



<220> 

<221> polyA_signal 
<222> 927. .932 



<400> 10 

ctcagcttca ggcaccacca ctgacctggg acagtgaatc gaca atg ccg tct tct 

Met Pro Ser Ser 

gtc teg tgg ggc ate etc etg etg gca gge etg tge tge etg gte ect 
Val Ser Trp Gly lie Leu Leu Leu Ala Gly Leu Cys Cys Leu Val Pro 
-20 -15 -10 -5 



gtc 


tec 


etg 




gag 


gat 


eec 


eag 


gga 


gat 


get 


gcc 


eag 


aag 


aca 


gat 


152 


Val 


Ser 


Leu 


Ala 


Glu 


Asp 


Pro 


Gin 


Gly 


Asp 


Ala 


Ala 


Gin 


Lys 


Thr Asp 












1 








5 










10 








aca 


tec 


eac 


cat 


gat 


eag 


gat 


eac 


cea 


ace 


ttc 


aac 


aag 


ate 


ace 


eec 


200 


Thr 


Ser 


His 
15 


His 


Asp 


Gin 


Asp 


His 
20 


Pro 


Thr 


Phe 




Lys 
25 


He 


Thr 


Pro 




aac 


etg 


get 


gag 


ttc 


gcc 


ttc 


age 


eta 


tac 


ege 


eag 


etg 


gca 


eac 


eag 


248 


Asn 


Leu 
30 


Ala 


Glu 


Phe 


Ala 


Phe 
35 


Ser 


Leu 


Tyr 


Arg 


Gin 

40 


Leu 


Ala 


His 


Gin 




tec 


aac 


age 


acc 


aat 


ate 


ttc 


ttc 




eca 


gtg 


age 


ate 


get 


aca 


gee 


296 


Ser 


Asn 




Thr 


Asn 


He 


Phe 


Phe 


Ser 


Pro 


Val 


Ser 


He 


Ala 


Thr 


Ala 




45 










50 










55 










60 




ttt 


gca 


atg 


etc 


tec 


etg 


ggg 


acc 


aag 


get 


gac 


act 




gat 


gaa 


ate 


344 


Phe 


Ala 


Met 


Leu 


Ser 
65 


Leu 


Gly 


Thr 


Lys 


Ala 

70 


Asp 


Thr 


His 


Asp 


Glu 

75 


He 




etg 


gag 


ggc 


etg 


aat 


ttc 


aac 


etc 


aeg 


gag 


att 


ccg 


gag 


get 


eag 


ate 


392 


Leu 


Glu 


Gly 


Leu 


Asn 


Phe 


Asn 


Leu 


Thr 


Glu 


He 


Pro 


Glu 


Ala 


Gin 


He 





80 85 90 

eat gaa ggc ttc cag gaa etc etc cgt acc etc aac cag cca gac age 440 
His Glu Gly Phe Gin Glu Leu Leu Arg Thr Leu Asn Gin Pro Asp Ser 

95 100 105 

eag etc eag etg ace ace gge aag aat taatgettte ttaaetaagt 487 
Gin Leu Gin Leu Thr Thr Gly Lys Asn 

110 115 

tactttgagt attaacaagt agcttatact gaggtgtgct tcaagggagc attatagggc 547 

agaaaacaaa ttaaaatgaa atgttgttgt tatagccatg cattgaccat taaatgcatt 607 

ttaatatete ggeaaaatga gatttgaaac tcattagttt aactcttaaa acatagtgtt 567 

gttaaacagt attttttggg ggegggtgga tagcttttts ttctatettg tacagaaaac 727 

agtaatggaa teaeteatgt teagaacctt gtaaatatgt ctgtaatatt aatgetteee 787 

tttattaaea caaatataea tteaaaaagg gagattttat ttagaeaeae attttetttt 847 

caccctaatc tatttagatt catacttatt gagaegggaa aaaetcgetg taaaataatg 907 

ccaacctaga taatgctata ataaattatt ttgtaeaeaa aaaaaaaaaa aaaa 961 



<210> 11 

<211> 643 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 91. .462 



<220> 

<221> sig_peptide 
<222> 91. . 180 



[fije>'/^:/yv'GG208389e.s^] 



WO 02/083898 

<223> Von Heijne matrix 

score 9.99993961154359 
seq LLFGFTLVSGTGA/EK 

<220> 

<221> polyA_site 
<222> 628. .643 

<220> 

<221> polyA_signal 
<222> 607. .612 

<400> 11 

acccctaccc cacgccccct cccgcgcgcg cggttaaatc cccgcacctg agcatcggct 60 
cacacctgca ccccgcccgg gcatagcacc atg cct get tgt cgc eta ggc ccg 114 
Met Pro Ala Cys Arg Leu Gly Pro 
-30 -25 



eta 


gee 


gee 


gee 


ete 


ete 


ete 


age 


etg 




etg 


ttc 


ggc 


tte 


acc 


eta 


162 


Leu 


Ala 


Ala 
-20 


Ala 


Leu 


Leu 


Leu 


Ser 
-15 


Leu 






Phe 


Gly 
-10 


Phe 


Thr 


Leu 




gtc 


tea 


gge 


aea 


gga 




gag 


aag 




ggc 


gtg 


tgc 


ccc 


gag 


etc 


cag 


210 


Val 


Ser 
-5 


Gly 


Thr 


Gly 


Ala 


Glu 

1 


Lys 


Thr 


Gly 


Val 
5 


Cys 


Pro 


Glu 


Leu 


Gin 
10 




get 


gac 


eag 


aac 


tgc 


aeg 


eaa 


gag 


tge 


gtc 


teg 


gac 


age 


gaa 


tgc 




258 


Ala 


Asp 


Gin 


Asn 


Cys 
15 


Thr 


Gin 


Glu 


Cys 


Val 
20 


Ser 


Asp 


Ser 


Glu 


Cys 
25 


Ala 




gac 






aag 


tgc 


tgc 


age 


gcg 


ggc 


tgt 


gcc 


acc 


tte 


tgc 


tct 


etg 


306 


Asp 


Asn 


Leu 


Lys 
30 


Cys 


Cys 


Ser 


Ala 


Gly 
35 


Cys 


Ala 


Thr 


Phe 


Cys 
40 


Ser 


Leu 




eec 


aat 


gat 


aag 


gag 


ggt 


tec 


tgc 


ccc 


cag 


gtg 


aac 


att 


aac 


ttt 


ccc 


354 


Pro 




Asp 
45 


Lys 


Glu 


Gly 


Ser 


Cys 
50 


Pro 


Gin 


Val 


Asn 


He 
55 


Asn 


Phe 


Pro 




cag 


etc 


ggc 


etc 


tgt 


egg 


gac 


cag 


tgc 


cag 


gtg 


gac 


age 


cag 


tgt 


cct 


402 


Gin 


Leu 
60 


Gly 


Leu 


Cys 


Arg 


Asp 
65 


Gin 


Cys 


Gin 


Val 


Asp 
70 


Ser 


Gin 


Cys 


Pro 




ggc 


cag 


atg 


aaa 


tgc 


tgc 


cgc 


aat 


ggc 


tgt 


ggg 


aag 


gtg 


tec 


tgt 


gtc 


450 


Gly 


Gin 


Met 


Lys 


Cys 


Cys 


Arg 


Asn 


Gly 


Cys 


Gly 


Lys 


Val 


Ser 


Cys 


Val 




75 










80 










85 










90 




act 




aat 


ttc 




:ag ccaccaccag gctgagcagt gaggagagaa 




502 


Thr 


Pro 


Asn 


Phe 





























agtttctgcc tggccctgca tctggttcca gcccacctgc cctccccttt ttcgggactc 562 
tgtattccct ettgggctga ecacagctte tecctttcce aaccaataaa gtaaccactt 622 
tcageaaaaa aaaaaaaaaa a 643 

<210> 12 

<211> 577 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 28 . .399 

<220> 

<221> sig_peptide 

<222> 28. .117 

<223> Von Heijne matrix 

score 9.99993961154359 

seq LLFGFTLVSGTGA/EK 

<220> 

<221> polyA_site 
<222> 562. .577 



PCT/lBOl/00914 
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<220> 

<221> polyA_signal 
<222> 540. .545 

<400> 12 

acctgcaccc cgcccgggca tagcacc atg cct get tgt cgc eta gge eeg eta 54 

Met Pro Ala Cys Arg Leu Gly Pro Leu 
-30 -25 



gee 


gee 


gee 


ete 


ete 


ete 


age 


etg 


etg 


etg 


tte 


gge 


tte 


ace 


eta 


gtc 


102 


Ala 


Ala 
-20 


Ala 


Leu 


Leu 


Leu 


Ser 
-15 


Leu 


Leu 


Leu 


Phe 


Gly 
-10 


Phe 


Thr 


Leu 


Val 




tea 


gge 


aea 


gga 


gea 


gag 


aag 




gge 


gtg 


tgc 


cec 


gag 


ete 




get 


150 


Ser 


Gly 


Thr 


Gly 


Ala 


Glu 


Lys 


Thr 


Gly 


Val 


Cys 


Pro 


Glu 




Gin 


Ala 




-5 










1 








5 










10 






gac 


cag 


aac 


tgc 


acg 


caa 


gag 


tgc 


gtc 


teg 


gac 


age 


gaa 


tgc 


gcc 


gac 


198 


Asp 


Gin 


Asn 


Cys 

15 


Thr 


Gin 


Glu 


Cys 


Val 

20 


Ser 


Asp 


Ser 


Glu 


Cys 

25 


Ala 


Asp 




aae 


ate 


aag 


tgc 


tgc 


age 


gcg 


gge 


tgt 


gee 




ttc 


tgc 


tct 


etg 


ccc 


246 


Asn 


lie 


Lys 
30 


Cys 


Cys 


Ser 


Ala 


Gly 
35 


Cys 


Ala 


Thr 


Phe 


Cys 
40 


Ser 


Leu 


Pro 




aat 


gat 


aag 


gag 


ggt 


tec 


tgc 


ccc 


cag 


gtg 


aac 


att 


aac 


ttt 


ccc 


cag 


294 




Asp 
45 


Lys 


Glu 


Gly 


Ser 


Cys 
50 


Pro 


Gin 


Val 


Asn 


He 
55 


Asn 


Phe 


Pro 


Gin 




etc 


gge 


etc 


tgt 


egg 


gac 


cag 


tgc 


cag 


gtg 


gac 


age 


cag 


tgt 


cct 


gge 


342 


Leu 


Gly 


Leu 


Cys 


Arg 


Asp 


Gin 


Cys 


Gin 


Val 


Asp 


Ser 


Gin 


Cys 


Pro 


Gly 




60 










65 










70 










75 




cag 


atg 


aaa 


tgc 


tgc 


cgc 


aat 


gge 


tgt 


ggg 


aag 


gtg 


tec 


tgt 


gtc 


act 


390 


Gin 


Met 


Lys 


Cys 


Cys 
80 


Arg 


Asn 


Gly 


Cys 


Gly 
85 


Lys 


Val 


Ser 


Cys 


Val 
90 


Thr 





ccc aat ttc tgagctccag ccacaggctg agcagtgagg agagaaagtt 439 
Pro Asn Phe 

tctgcctggc cctgcatctg gttccagccc acctgccctc ccctttttcg ggactctgta 499 

ttccctcttg ggctgaccac agcttctccc tttcccaacc aataaagtaa ccactttcag 559 

ctaaaaaaaa aaaaaaaa 577 



<210> 13 

<211> 785 

<212> DMA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 10. .216 

<220> 

<221> sig_peptide 

<222> 10. .60 

<223> Von Heijne matrix 

score 9.5148977432302 

FVFLLFVISLAAA/AH 

<400> 13 

aacccactt atg ttt eet gge ttt gtt ttt ttg etc ttt gta ata agt ett 51 
Met Phe Pro Gly Phe Val Phe Leu Leu Phe Val He Ser Leu 
-15 -10 -5 

get get get get cat ctt tgg gtc etg gcc gcc ttt atg gge cgt ate 99 
Ala Ala Ala Ala His Leu Trp Val Leu Ala Ala Phe Met Gly Arg He 

15 10 
acc gtg aag gtc tgc age ttc act cct gag gcc age aag acc gtg age 147 
Thr Val Lys Val Cys Ser Phe Thr Pro Glu Ala Ser Lys Thr Val Ser 

15 20 25 

cca cca gaa gga gcg aae aac teg aga cgc act get ttt aag age tgt 195 
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Pro Pro Glu Gly Ala Asn Asn Ser Arg Arg Thr Ala Phe Lys Ser Cys 
30 35 40 45 

aac act cac cac aaa ggt ctg tagcatcact cctgaagtca gcgggaccac 246 

Asn Thr His His Lys Gly Leu 
50 

gaacctacca gaaagaagaa actccataca gatctgaaaa tctgaaggaa caaactccgg 306 

acacaccatc tttaagaact gtaacactca ctgcgagggt ccacggcttc atttttgaag 366 

tcagccagac caagaaccca ccaattccgc acacacttct gctgtttcgt tccattctca 426 

aggaaaaact gtgatgggtc catttttaat cacatgcaaa ttcatgggca aatctttgct 486 

atggttatgt actttgattt agtagagttg tatcacatgt tcactcatgt gcccacaata 54 6 

gcagggtata ccaataggct gcttcaaaac aatcatgaaa agtaggaggt ggtgtttcca 606 

aaggtagtgg tcttgccaga aaaaaatgga aatactgtga agataaaaat aaacatctat 666 

taacacaaaa atagaaaaat aatataggta cacatacaca tacttttcta catatttttt 726 

gcagtcatat cagaggaaca atacagagga acaaaagtgt aaatcaaaaa aaaaaaaaa 785 



<210> 14 

<211> 1154 

<212> DNA 

<213> Homo sapiens 



<220> 
<221> CDS 
<222> 88. .318 



<220> 

<221> sig_peptide 

<222> 88. .186 

<223> Von Heijne matrix 

score 9.33181480765342 
seq MVLLSFALTLCSA/FW 



<220> 

<221> polyA_site 
<222> 1139. .1154 



<400> 14 

ctgtgggtgc ccaggaaggg actacacctc ttcgcagtgt tttatacctt tggtaatatc 60 
gcatcaattg ggagtaccat cttcctc atg gga cca gtg aaa cag ctg aag cga 114 
Met Gly Pro Val Lys Gin Leu Lys Arg 
-30 -25 



atg 


ttt 


gag 


cct 


act 


cgt 


ttg 


att 


gca 


act 


ate atg 


gtg 


ctg 


ttg 


agt 


162 


Met 


Phe 


Glu 


Pro 


Thr 


Arg 


Leu 


He 


Ala 


Thr 


He Met 


Val 


Leu 


Leu 


Ser 












-20 










-15 








-10 






ttt 


gca 


ctt 


acc 


ctg 


tgt 


tct 


gcc 


ttt 


tgg 


tgg cat 


aac 


awg 


gga 


ctt 


210 


Phe 


Ala 


Leu 


Thr 
-5 


Leu 


Cys 


Ser 


Ala 


Phe 
1 


Trp 


Trp His 


Asn 
5 


Xaa 


Gly 


Leu 




gca 


ctt 


ate 


ttc 


tgc 


att 


ttg 


cag 


tct 


ttg 


gca ttg 


acg 


tgg 


tac 


age 


258 


Ala 


Leu 


He 


Phe 


Cys 


He 


Leu 


Gin 


Ser 


Leu 


Ala Leu 


Thr 


Trp 


Tyr 


Ser 






10 










15 








20 












ctt 


tec 


ttc 


ata 


cca 


ttt 


gca 


agg 


gat 


get 


gtg aag 


aag 


tgt 


ttt 


gcc 


306 


Leu 


Ser 


Phe 


He 


Pro 


Phe 


Ala 


Arg 


Asp 


Ala 


Val Lys 


Lys 


Cys 


Phe 


Ala 




25 










30 










35 








40 





gtg tgt ctt gca taattcatgg ccagttttat gaagctttgg aaggcactat 358 
Val Cys Leu Ala 

ggacagaagc tggtggacag ttttgtaact atcttcgaaa cctctgtctt acagacatgt 418 

gecttttatc ttgcagcaat gtgttgcttg tgattcgaac atttgagggt tacttttgga 478 

ageaacaata cattctcgaa cctgaatgtc agtagcacag gatgagaagt gggttctgta 538 

tcttgtggag tggaatcttc ctcatgtacc tgtttcctct ctggatgttg tcccactgaa 598 

ttcccatgaa tacaaaccta ttcagcaaca gcacataagc cttgggtgca agtgattccc 658 

aggtggcaaa aggcagcccc atcagagatc acgggagcaa cagtaaggga cagagttttg 718 

gggtccactt gtccctcagc atggaagcca tcaccgtggt cctgcataga gtgagtctac 778 

ttctactctg gcatctgaga acaagtgact ctgctttaga caagcccctg gagagcctgg 838 

ccatggagtg aggtagaaaa gaagcacttt ttggtggtat atgctgtttc tgaatttgaa 898 
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tctaagcatt cttggtgttc tgattctgcg cctgactgtg gaatgtattt tctgacatta 958 

ggtgcagact ctttttctct tgggttttat ttcccatccc tccctccctc tccctgcata 1018 

tctttcttta tgacctcctg gcactagatg gggaaagaac agagcagcag cagcagaaaa 1078 

tgctgctgtt ttttttggac gagagcccat aggagttgaa aaatcctgct gctctcgtcc 1138 

aaaaaaaaaa aaaaaa 1154 

<210> 15 

<211> 766 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 84. .521 

<220> 

<221> sig_peptide 

<222> 84 . .161 

<223> Von Heijne matrix 

score 8.99656184291183 

seq GLLWAFCAPGARA/EE 

<220> 

<221> polyA_site 
<222> 751. .766 

<220> 

<221> polyA_signal 
<222> 728. .733 

<400> 15 

gaggaagagc gttttgggga cgggggctgg tgaggctcac gttggagggc ttcgcgtctg 60 
cttcggagac cgtaaggata ttg atg acc atg aga tec ctg etc aga acc ccc 113 
Met Thr Met Arg Ser Leu Leu Arg Thr Pro 
-25 -20 



ttc 


ctg 


tgt 


ggc 


ctg 


etc 


tgg 


gee 


ttt 


tgt 


gee 


cea 


ggc 




agg 




161 


Phe 


Leu 
-15 


Cys 


Gly 


Leu 


Leu 


Trp 
-10 


Ala 


Phe 


Cys 


Ala 


Pro 

-5 


Gly 


Ala 


Arg 


Ala 




gag 


gag 


eet 


gea 


gee 


age 


tte 




caa 




ggc 




atg 


ggc 


ctg 


gat 


209 


Glu 

1 


Glu 


Pro 


Ala 


Ala 
5 


Ser 


Phe 


Ser 


Gin 


Pro 
10 


Gly 


Ser 


Met 


Gly 


Leu 
15 


Asp 




aag 


aac 


aca 


gtg 


eac 


gac 




gag 


cat 


ate 


atg 


gag 


cat 


eta 


gaa 


ggt 


257 




Asn 


Thr 


Val 
20 


His 


Asp 


Gin 


Glu 


His 
25 


He 


Met 


Glu 


His 


30 


Glu 


Gly 




gtc 


ate 


aae 


aaa 




gag 


geg 


gag 


atg 


teg 


cea 


caa 


gaa 


ttg 


cag 


etc 


305 


Val 


He 


35 


Lys 


Pro 


Glu 


Ala 


Glu 
40 


Met 


Ser 


Pro 


Gin 


Glu 
45 


Leu 


Gin 


Leu 




cat 


tae 


tte 


aaa 


atg 


eat 


gat 


tat 


gat 


gge 


aat 


aat 


ttg 


ctt 


gat 


ggc 


353 


His 


Tyr 
50 


Phe 




Met 


His 


Asp 
55 


Tyr 


Asp 


Gly 


Asn 


Asn 

60 


Leu 


Leu 


Asp 


Gly 




tta 


gaa 


etc 


tec 


aea 




ate 


act 




gte 


eat 


aag 


gag 


gaa 


ggg 


agt 


401 


Leu 


Glu 


Leu 


Ser 


Thr 


Ala 


He 


Thr 


His 


Val 


His 


Lys 


Glu 


Glu 


Gly 


Ser 




65 










70 










75 










80 




gaa 


cag 


gea 


eca 


eta 


atg 


agt 


gaa 


gat 


gaa 


ctg 


att 


aac 


ata 


ata 


gat 


449 


Glu 


Gin 


Ala 


Pro 


Leu 
85 


Met 


Ser 


Glu 


Asp 


Glu 
90 


Leu 


He 


Asn 


He 


He 
95 


Asp 




ggt 


gtt 


ttg 


aga 


gat 


gat 


gac 


aag 


aac 


aat 


gat 


gga 


tae 


att 


gac 


tat 


4 97 


Gly 


Val 


Leu 


Arg 

100 


Asp 


Asp 


Asp 


Lys 


Asn 
105 


Asn 


Asp 


Gly 


Tyr 


He 
110 


Asp 


Tyr 




get 


gaa 


ttt 


gea 






ctg 


cag 


tagatgttat ttggeeatet eetggttata 


551 


Ala 


Glu 


Phe 


Ala 


Lys 


Ser 




Gin 





















115 120 
tacaaatgtg acccgtgata atgtgattga acactttagt aatgcgaaat aactcatttc 611 
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caactactgc tgcagcattt tggtaaaaac ctgtagcgat tcgttacact ggggtgagaa 671 

gagataagag aaatgaaaga gaagagaaat gggacatcta atagtcccta agtgstatta 731 
aataccttat tggacaagca aaaaaaaaaa aaaaa 766 

<210> 16 

<211> 808 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 155. .340 



<220> 

<221> sig_peptide 

<222> 155. .292 

<223> Von Heijne matrix 

score 8.64329745298384 
AVLLLILFAIVFG/LL 



<220> 

<221> polyA_site 
<222> 754. .808 



<220> 

<221> polyA_signal 
<222> 728. .733 



<400> 16 

cttttcctct caacagttgc ttctttgagt cagggtgcag ctctggtcac ctggcggcct 60 
cttcagctca gccctccaca aagtgtgagc ctgaaggacc accctgaatt gcccttgtag 120 
gacccagaac agctaccagc agaatcagat tctc atg gac caa ctg gta ttc aaa 175 
Met Asp Gin Leu Val Phe Lys 
-45 -40 
gag aca ate tgg aat gat gcg ttc tgg cag aac ccc tgg gac cag ggg 223 
Glu Thr lie Trp Asn Asp Ala Phe Trp Gin Asn Pro Trp Asp Gin Gly 

-35 -30 -25 

ggc ctg gca gtg att ate tta ttc ate acc get gtc ctg ett etc ate 271 
Gly Leu Ala Val He He Leu Phe He Thr Ala Val Leu Leu Leu He 

-20 -15 -10 

tta ttt gee ate gtg ttt ggt tta etc act tec aca gaa aac act cag 319 
Leu Phe Ala He Val Phe Gly Leu Leu Thr Ser Thr Glu Asn Thr Gin 

-5 15 
tgt gaa gcg ggt gaa gag gag tgacctgact tgctggggac tgagatggca 370 
Cys Glu Ala Gly Glu Glu Glu 
10 15 

gcaggggagg cgagctgacc tgcccccatt ccagtggtgg gccccttcgc ggttccctct 430 
ggctcagggg ccaagccctg gtgtcttcct ttcccaccag gaaaaagtct agtaaaatac 490 
tgtatctgge ttagggttgg teagaetagt aagatgggga ggetggtetg agaccaattc 550 
tggctccttg accctattgt ttttagggtt ccccgaccag aaccctaaaa gcacatggag 610 
aggatggctc cactgcctca ggtggaagga gctatggcta acaaggttct ctaacaggct 67 0 
caeaggecca gecageaatt tcacaaatec ttgacagaga aagacacaac caaatgaaat 730 
aaaaattcct tttcaaatct gctaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaramaaaa 790 
aaaaaaaaaa aaaaaaaa 808 



<210> 17 
<211> 772 

<212> DNA 

<213> Homo sapiei 



<220> 
<221> CDS 
<222> 155. .340 
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<220> 

<221> sig_peptide 

<222> 155. .292 

<223> Von Heijne matrix 

score 8.64329745298384 

AVLLLILFAIVFG/LL 

<220> 

<221> polyA_site 
<222> 751. .772 

<220> 

<221> polyA_signal 
<222> 728. .733 

<400> 17 

cttttcctct caacagttgc ttctttgagt cagggtgcag ctctggtcac ctggcggcct 
cttcagctca gccctccaca aagtgtgagc ctgaaggacc accctgaatt gcccttgtag 
gacccagaac agcaaccagc agaatcagat tctc atg gac caa ttg gta ttc aaa 
Met Asp Gin Leu Val Phe Lys 
-45 -40 



gag 


aca 


ate 


tgg 


aat 


gat 


gcg 


ttc 


tgg 


cag 


aac 


ccc tgg gac cag ggg 


223 


Glu 


Thr 


He 


Trp 


Asn 


Asp 


Ala 


Phe 


Trp 


Gin 


Asn 


Pro Trp Asp Gin Gly 












-35 










-30 




-25 




ggc 


ctg 


gca 


gtg 


att 


ate 


tta 


ttc 


ate 


ace 


get 


gte etg ett etc ate 


271 


Gly 


Leu 


Ala 


Val 


He 


He 


Leu 


Phe 


He 


Thr 


Ala 


Val Leu Leu Leu He 










-20 










-15 






-10 




tta 


ttt 


gcc 


ate 


gtg 


ttt 


ggt 


tta 


etc 


act 


tee 


aea gaa aae act eag 


319 


Leu 


Phe 


Ala 
-5 


He 


Val 


Phe 


Gly 


Leu 


Leu 


Thr 


Ser 


Thr Glu Asn Thr Gin 




tgt 


gaa 


gcg 


ggt 


gaa 


gag 


gag 


tgacetgaet tgetggggae tgagatggea 


370 


Cys 


Glu 


Ala Gly 


Glu 


Glu 


Glu 














10 










15 

















gcaggggagg cgagctgacc tgcccccatt ccagtggtgg gccccttcgc ggttccctct 430 

ggctcagggg ccaagccctg gtgtcttcct ttcccaccag gaaaaagtct agtaaaatac 4 90 

tgtatctggc ttagggttgg tcagactagt aagatgggga ggctggtctg agaccaattc 550 

tggctccttg accctattgt ttttagggtt ccccgaccag aaccctaaaa gcacatggag 610 

aggatggctc cactgcctca ggtggaagga gctatggcta acaaggttct ctaacaggct 670 

caeaggeeca geeagcaatt teaeaaatec ttgaeagaga aagaeaeaae caaatgaaat 730 

aaaaattect ttteaaatet aaaaaaaaaa aaaaatgega aa 772 

<210> 18 

<211> 546 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 13. .498 

<220> 

<221> sig_peptide 

<222> 13. .153 

<223> Von Heijne matrix 

score 4.65474235719409 

seq LALSSLRSLLLFA/GM 



<220> 
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<221> polyA_signal 
<222> 514. .519 

<400> 18 

aagcgctgac gc atg cgc ata get aac cgc acc egg ttc age teg cct ttc 51 
Met Arg lie Ala Asn Arg Thr Arg Phe Ser Ser Pro Phe 
-45 -40 -35 



ttg 




aga 


ggc 


gcc ggt 


tgg 


act 


cac 


ggg 


egg 


ggc 


atg 


atg 


gtg 


gtg 


99 


Leu 


Ala 


Arg 


Gly 


Ala 
-30 


Gly 


Trp 


Thr 


His 


Gly 
-25 


Arg 


Gly 


Met 


Met 


Val 
-20 


Val 




ggt 


acg 


ggc 


acc 


teg 


ctg 


gcg 


etc 


tec 


tec 


etc 


egg 


tee 


ctg 


ctg 


etc 


147 


Gly 


Thr 


Gly 


Thr 
-15 


Ser 


Leu 


Ala 


Leu 


Ser 
-10 


Ser 


Leu 


Arg 


Ser 


Leu 
-5 


Leu 


Leu 




ttt 


get 


ggg 


atg 


eag 


atg 


tac 


age 


egt 


eag 


etg 


gcc 


tec 


ace 


gag 


tgg 


195 


Phe 


Ala 


Gly Met 


Gin 


Met 


Tyr 


Ser 


Arg 


Gin 


Leu 


Ala 


Ser 


Thr 


Glu 


Trp 








1 








5 










10 












etc 


acc 


ate 


eag 


ggc 


ggc 


etg 


ett 


ggt 


teg 


ggt 


etc 


ttc 


gtg 


ttc 


teg 


243 


Leu 


Thr 


lie 


Gin 


Gly 


Gly 


Leu 


Leu 


Gly 


Ser 


Gly 


Leu 


Phe 


Val 


Phe 


Ser 




15 










20 










25 










30 




etc 


act 


gcc 


ttc 


aat 




ctg 


gag 


aat 


ett 


gtc 


ttt 


ggc 


aaa 


gga 


ttc 


291 


Leu 


Thr 


Ala 


Phe 


Asn 
35 


Asn 


Leu 


Glu 


Asn 


Leu 
40 


Val 


Phe 


Gly 


Lys 


Gly 
45 


Phe 




caa 




aag 


ate 


ttc 


cct 


gag 


att 






tge 


etc 


ctg 


ttg 


get 


etc 


339 


Gin 


Ala 


Lys 


He 
50 


Phe 


Pro 


Glu 


He 


Leu 

55 


Leu 


Cys 


Leu 


Leu 


Leu 
60 


Ala 


Leu 




ttt 


gea 


tet 


ggc 


etc 


ate 


cac 


cga 


gtc 


tgt 


gtc 


ace 


acy 


tge 


ttc 


ate 


387 


Phe 


Ala 


Ser Gly 


Leu 


He 


His 


Arg 


Val 


Cys 


Val 


Thr 


Thr 


Cys 


Phe 


He 








65 










70 










75 










ttc 


tec 


atg 


gtt 


ggt 


etg 


tac 


tae 


ate 


aac 


aag 


ate 


tee 


tee 


ace 


etg 


435 


Phe 


Ser 
80 


Met 


Val 


Gly 


Leu 


Tyr 
85 


Tyr 


He 


Asn 


Lys 


He 
90 


Ser 


Ser 


Thr 


Leu 




tac 


eag 


gea 


gea 


get 


eea 


gtc 


etc 


aca 


eea 


gee 


aag 


gte 


aca 


ggc 


aag 


483 


Tyr 


Gin 


Ala 


Ala 


Ala 


Pro 


Val 


Leu 


Thr 


Pro 


Ala 


Lys 


Val 


Thr 


Gly 


Lys 




95 










100 










105 










110 





age aag aag aga aac tgaccctgaa tgttcaataa agttgattet ttgcaaaaaa 538 
Ser Lys Lys Arg Asn 
115 

aaaaaaaa 546 

<210> 19 
<211> 767 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 132. .401 

<220> 

<221> sig_peptide 

<222> 132. .236 

<223> Von Heijne matrix 

score 8.35677668307059 

seq LGLGLVTLTASWA/AL 

<220> 

<221> polyA_site 
<222> 752.. 767 

<220> 

<221> polyA__signal 
<222> 741 . .746 
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<400> 19 

ccttcagcgc ccggccgaga gcccggtggc tgcgactgag agcctggtta ctgccggcac 60 

ctttggatcc tcggccaatt ctctgcttcg cctacacttg gcgtgcggtg taccagcctc 120 

cccgcgtcac c atg gaa aca gga gcc tct gca tec ate cca gag ctg ate 170 

Met Glu Thr Gly Ala Ser Ala Ser He Pro Glu Leu He 

-35 -30 -25 



tgt 


gaa 


get 


atg 


aga 


aga 


ate 


tgg 


age 




gga 


ttg 


ggg 


ttg gtg 


act 


218 




















Leu 








Leu Val 


Thr 








-20 










-15 










-10 








etg 


aeg 


gee 


age 


tgg 


gca 






tte 




gat 


ggc 


ttt 


gcg gtt 


ctt 


266 


Leu 


Thr 


Ala 


Ser 


Trp 


Ala 


Ala 




Phe 


His 


Asp 


Gly 


Phe 


Ala Val 


Leu 






-5 










1 








5 








10 




gga 


gga 


aac 


att 


gtg 


age 


gat 


etc 


age 


aca 


gta 


aga 


ttt 


gtt gca 


caa 


314 


Gly 


Gly 


Asn 


He 


Val 


Ser 


Asp 


Leu 


Ser 


Thr 


Val 


Arg 


Phe 


Val Ala 


Gin 












15 










20 








25 












ttc 


cag 


etc 


ctt 


gac 


gtg 


tgg 




agg 


aat 


ttc egg 


aag 


362 


Gin 


Gin 


His 


Phe 


Gin 


Leu 


Leu 


Asp 


Val 


Trp 


Thr 


Arg 


Asn 


Phe Arg 


Lys 










30 










35 










40 






cca 


etg 


ggc 


age 


atg 


tgc 


ttt 


gtt 


ttc 


ttg 


ttg 


etc 


cca 


taatcaatgt 


411 


Pro 


Leu 


Gly 


Ser 


Met 


Cys 


Phe 


Val 


Phe 


Leu 


Leu 


Leu 


Pro 









45 50 55 



tgggcatcaa gatctaaccc ttgaaccttc cacgaaccct gttgtcaata cctctgggtt 471 

tcegccggtt aegcttaatt ttgaeatatc ggtcagactg gtgccggata aacttcttgg 531 

ttctcttttt gacgatcttg ggcttcccaa gggttctgag gacggtttat aaaagtgcac 591 

tgaggggagg ctgagacagg agaattgctt gaacccggga ggcggaggtt gcagtgagcc 651 

gagatcgtgc ccatcctagg cgacagagag agactctatg tcaaaaataa ataaataaat 711 

aaataaataa ataaataaat aaataaataa ataaaatgec aaaaaaaaaa aaaaaa 767 

<210> 20 

<211> 596 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 41. .478 

<220> 

<221> sig_peptide 

<222> 41. .115 

<223> Von Heijne matrix 

score 8.18796433724836 

seq ATLLLVLCLQLGA/NK 

<220> 

<221> polyA_site 
<222> 581. .596 

<220> 

<221> polyA_signal 
<222> 560. .565 

<400> 20 

accacttctc tgggacacat tgccttctgt tttctccagc atg cge ttg etc cag 55 

Met Arg Leu Leu Gin 
-25 

etc ctg ttc agg gcc age cct gcc acc ctg etc ctg gtt etc tgc ctg 103 

Leu Leu Phe Arg Ala Ser Pro Ala Thr Leu Leu Leu Val Leu Cys Leu 

-20 -15 -10 -5 

cag ttg ggg gee aac aaa get cag gac aac act egg aag ate ata ata 151 

Gin Leu Gly Ala Asn Lys Ala Gin Asp Asn Thr Arg Lys He He He 

15 10 
aag aat ttt gae att eee aag tea gta cgt cca aat gac gaa gtc act 199 
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Gin 
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35 


Glu 
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Lys 


Glu 


Cys 
40 


Met 


Val Val 
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att 


age 


age 


ate 


ect 


eta 


eaa 


ggk 


gem 


ttt 


aae tat 


arg 


295 


Thr 


Tyr 


Leu 


He 


Ser 


Ser 


He 


Pro 


Leu 


Gin 


Gly 


Ala 


Phe 


Asn Tyr Xaa 




45 










50 










55 








60 






act 


gcc 


tgc 


eta 


tgt 


gac 


gac 


aat 


cca 


aaa 


acc 




tac tgg 


gac 


343 


Tyr 


Thr 


Ala 


Cys 


Leu 


Cys 


Asp 


Asp 


Asn 


Pro 


Lys 


Thr 


Phe 


Tyr Trp Asp 












65 










70 








75 






ttt 


tac 


acc 


aac 


aga 


act 


gtg 


caa 


att 


gca 


gcc 


gtc 


gtt 


gat gtt 


att 


391 


Phe 


Tyr 


Thr 


Asn 
80 


Arg 


Thr 


Val 


Gin 


He 
85 


Ala 


Ala 


Val 


Val 


Asp Val 
90 


He 




egg 


gaa 


tta 


ggc 


ate 


tgc 


ect 


gat 


gat 


get 


get 


gta 


ate 


eee ate 


aaa 


439 


Arg 


Glu 


95 


Gly 


He 


Cys 


Pro 


Asp 
100 


Asp 


Ala 


Ala 


Val 


He 
105 


Pro He 


Lys 




aac 


aac 


egg 


ttt 


tat 


act 


att 


gaa 


ate 


eta 


aag 


gta 


gaa 


taatggaagc 


488 


Asn 


Asn 
110 


Arg 


Phe 


Tyr 


Thr 


He 
115 


Glu 


He 


Leu 


Lys 


Val 
120 


Glu 









cetgtetgtt tgceacaece aggtgattte ctctaaagaa aettggctgg aatttctget 548 
gtggtetata aaataaaett ettaaeatge teaaaaaaaa aaaaaaaa 596 

<210> 21 

<211> 672 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 7. .366 

<220> 

<221> sig_pepticle 

<222> 7. . 102 

<223> Von Heijne matrix 

score 7.64815044221151 

VTLGIGFFALASA/LW 

<220> 

<221> polyA_site 
<222> 641. .672 



<400> 21 

agteat atg get tea eet aaa ggt ttt ttt aat tae ttg acc tat ttt 48 
Met Ala Ser Pro Lys Gly Phe Phe Asn Tyr Leu Thr Tyr Phe 
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agg 
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tea 
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Ser 


Lys 
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He 


Asp 


Glu 


Arg 


Cys 


Arg 


Gin 


Arg 


Pro 


Ser 


Met 




15 
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25 










30 




geg 


aaa 


att 
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tct 


eat 
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tgt 


gtt 


ttt 


att 


tet 


cga 


aat 


ttt 


240 


Ala 


Lys 


He 


Lys 


Ser 


His 


Ser 


Gin 


Cys 


Val 


Phe 


He 


Ser 


Arg 


Asn 


Phe 












35 










40 










45 






eat 


act 


ggg 


aga 


ttc 


caa 


tta 


cag 


caa 


tta 


aag 


ate 


att 


eta 


aag 


atg 


288 
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Thr 
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Arg 


Phe 


Gin 


Leu 


Gin 


Gin 


Leu 


Lys 


He 


He 


Leu 


Lys 


Met 










50 
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cag 
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336 
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Asn Pro Asn Leu Gin Gin Lys lie Ser Phe Val lie Pro Gin Arg Pro 

65 70 75 

get cca caa caa ate gca gca gtg tta cat taagcttatc aacattacca 386 
Ala Pro Gin Gin He Ala Ala Val Leu His 

80 85 
tctgattctt attacagcca aagtatagaa gcagctgatg actggttttc tgatgattct 446 
ctagtgaaaa ggaactctcc aatgccttct ctcggggaac ctctaatgga aaaagtattt 506 
tcatacctgt caaccatttc attagaagag ggtactgaaa gtgtactgaa tgacacttta 566 
tgaccatcaa aaagatgact acattaaggg aaaatgttca tgaagaaaca cagaggttga 626 
aatataaaac cttcaacaga aaaaaaaaaa aaaatgcgaa aagctt 672 

<210> 22 

<211> 438 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 84. .317 

<220> 

<221> sig_peptide 

<222> 84 . .140 

<223> Von Heijne matrix 

score 7.64030745849671 

seq ALVLALMISMISA/DS 

<220> 

<221> polyA_site 
<222> 421. .438 

<220> 

<221> polyA_signal 
<222> 397. .402 

<400> 22 

atagaaaagg acatctcttg agacttcact tcagcttcac tgacttcttg actctcctct 60 
tgagtaaaag gactcagcca act atg aag ttt ttt gtc ttt get tta gtc ttg 113 
Met Lys Phe Phe Val Phe Ala Leu Val Leu 
-15 -10 



get 
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att 
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att 
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tea cat 


gaa 
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cat 


161 
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Met 


He 
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Met 
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Asp 
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Glu 
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5 


Arg 


His 
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tat 


aga 
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gaa 


aag 
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tac 
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ate 


209 


His 


Gly 


Tyr 


Arg 


Arg 


Lys 


Phe 


His 


Glu 


Lys 


His His 


Ser 


Tyr 


His 


He 








10 










15 








20 










aca 


eta 


eta 


cca 


ctt 


ttt 


gaa 


gaa 


tea 


tea 


aag age 


aat 


gca 


aat 


gaa 


257 


Thr 


Leu 


Leu 


Pro 


Leu 


Phe 


Glu 


Glu 


Ser 


Ser 


Lys Ser 


Asn 


Ala 


Asn 


Glu 






25 










30 








35 












aaa 


cac 


tat 


aat 


tta 


ctg 


tat 


act 


ctt 


tgt 


ttc agg 


ata 


ctt 


gee 


ttt 


305 


Lys 


His 


Tyr 


Asn 


Leu 


Leu 


Tyr 


Thr 


Leu 


Cys 


Phe Arg 


He 


Leu 


Ala 


Phe 




40 










45 










50 








55 




tea 


att 


gtc 


act 


tgatgatata attgc 


:aattt as 


lactgttaa gctgtgttca 




357 


Ser 


He 


Val 


Thr 



























gtactgtttc tgaataatag aaatcacttc tctaaaagca ataaatttca agcacatttt 417 
taaataaaaa aaawawaaaa a 438 

<210> 23 

<211> 617 

<212> DNA 

<213> Homo sapiens 



<220> 
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<221> CDS 
<222> 141. .401 

<220> 

<221> sig_peptide 

<222> 141. .200 

<223> Von Heijne matrix 

score 7.42708462116258 

LLVFLAGFPVLDA/ND 

<220> 

<221> polyA_site 
<222> 603. .617 

<400> 23 

gatttctccc ggaacctctg ctcagcctgg tgaaccacac aggcccgagt ttcacccagt 60 
ccccactcca cggtgcagct gcggcttatc tctcagccca gcgagatgcc agccttcctg 120 

tcccgggcca gcgctctgac atg cag aag gtg acc ctg ggc ctg ctt gtg ttc 173 

Met Gin Lys Val Thr Leu Gly Leu Leu Val Phe 













-20 








-15 






-10 
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ttt 


cct 
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ctg 


gac 
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aat 
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He 
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gtc atg 


agt 
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317 


Cys 


Ala 


Gly 


Val 


Leu 


Cys 


Ala 
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Gly 


He 


He 


He 


Val Met 
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Ala 
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35 












tgc 


aaa 


tgc 


aag 


ttt 


ggc 


cag 


aag 


tec 


ggt 


eae 


eat cca 


ggg 


gag 


365 


Lys 


Cys 


Lys 
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Phe 


Gly 


Gin 


Lys 
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His Pro 


Gly 


Glu 
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45 










50 
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ate 
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tea 


gee 


caa 
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411 


Thr 


Pro 


Pro 


Leu 


He 


Thr 


Pro 


Gly 


Ser 


Ala 


Gin 


Ser 


















60 










65 















agaccagctg aaattgggtg gaggaccgtt ctctgtcccc aggtcctgtc tctgcacaga 471 

aacttgaact ccaggatgga attcttcctc ctctgctggg actcctttgc atggcaggge 531 

ctcatctcac ctctcgeaag agggtetctt tgtteaattt ttttttatet aaaatgattg 591 

tgeetetgec caaaaaaaaa aaaaaa 617 



<210> 24 

<211> 673 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 69. .383 

<220> 

<221> sig_peptide 

<222> 69. . 125 

<223> Von Heijne matrix 

score 7.20796835452081 

VSIMLLLVTVSDC/AV 

<400> 24 

acaaggctga gcgggaggaa gcgagaggca tctaagcagg cagtgttttg ccttcacccc 60 
aagtgacc atg aga ggt gcc acg cga gtc tea ate atg etc etc eta gta 110 
Met Arg Gly Ala Thr Arg Val Ser He Met Leu Leu Leu Val 
-15 -10 
act gtg tct gac tgt get gtg ate aea ggg gee tgt gag cga gat gtc 158 
Thr Val Ser Asp Cys Ala Val He Thr Gly Ala Cys Glu Arg Asp Val 
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45 
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Phe Arg 
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55 


His 


His 
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Cys 
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ttg 
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tgc tec 
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ttc 
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agg 


tac 


350 


Pro 


Cys 


Leu 


Pro 


Asn 


Leu 


Leu 


Cys Ser 


Arg 


Phe 


Pro 


Asp 


Gly 




Tyr 




60 










65 








70 










75 




cgc 


tgc 


tec 


atg 


gac 


ttg 


aag 


aac ate 


aat 


ttt 


taggcgcttg ectggtctca 


403 


Arg 


Cys 


Ser 


Met 


Asp 


Leu 


Lys 


Asn He 


Asn 


Phe 















80 85 



ggatacccac catcctttte ctgageacag ectggatttt tatttctgcc atgaaaccca 4 63 

gctcccatga eteteccagt ecctasactg actaccctga tetctcttgt etagtacgea 523 

catatgcaca caggcagaca tacctcccat catgacatgg tccccaggct ggcctgagga 583 

tgtcacaget tgaggctgtg gtgtgaaagg tggceagcet ggttctcttc cetgetcagg 643 

ctgcactcaa aaaaaaaaaw aaatgcgaaa 673 

<210> 25 

<211> 678 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 69.. 383 

<220> 

<221> sig__peptide 

<222> 69.. 125 

<223> Von Heijne matrix 

score 7.20796835452081 

VSIMLLLVTVSDC/AV 

<220> 

<221> polyA_site 
<222> 652. .678 

<400> 25 

acaaggctga gcgggaggaa gcgagaggca tctaagcagg cagtgttttg ccttcacccc 60 
aagtgacc atg aga ggt gcc acg cga gtc tea ate atg etc etc eta gta 110 
Met Arg Gly Ala Thr Arg Val Ser He Met Leu Leu Leu Val 
-15 -10 
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403 
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Arg Cys Ser Met Asp Leu Lys Asn He Asn Phe 

80 85 

ggatacccac catccttttc ctgagcacag cctggatttt tatttctgcc atgaaaccca 463 

gctcccatga ctctcccagt ccctacactg actaccctga tctctcttgt ctagtacgca 523 

catatgcaca caggcagaca tacctcccat catgacatgg tccccaggct ggcctgagga 583 

tgtcacagct tgaggctgtg gtgtgaaagg tggccagcct ggttctcttc cctgctcagg 643 

ctgcactcaa aaaaaaaaaa aaatgcgaaa agctt 678 

<210> 25 

<211> 860 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 129. .434 

<22G> 

<221> sig_peptide 

<222> 129. .236 

<223> Von Heijne matrix 

score 7.04760006506154 

seq PFFLITLVGVWA/VV 

<220> 

<221> polyA_site 
<222> 845. .860 

<220> 

<221> polyA_signal 
<222> 826. .831 

<400> 26 

tccggggccc gggagccaac cgagggcgtt cctgtcgggg ctgcagcggc gggagggagc 60 
ccagtggagg cgccctcccg aagcgccact gcccatgctg accacccagc cctccggctg 120 
ctgatgtc atg agt aac acc act gtg ccc aat gcc ccc cag gcc aac age 170 
Met Ser Asn Thr Thr Val Pro Asn Ala Pro Gin Ala Asn Ser 
-35 -30 -25 



gac 


tec 


atg 


gtg 


ggc 


tat 


gtg 


ttg 


ggg 




ttc 


ttc 


etc 


ate 


aee 




218 


Asp 


Ser 


Met 

-20 


Val 


Gly 


Tyr 


Val 


Leu 
-15 


Gly 


Pro 


Phe 


Phe 


Leu 
-10 


He 


Thr 


Leu 




gtc 


ggg 


gtg 


gtg 


gtg 


get 


gtg 


gta 


atg 


tat 


gta 


cag 


aag 




aag 


egg 


266 


Val 


Gly 
-5 


Val 


Val 


Val 


Ala 


Val 
1 


Val 


Met 


Tyr 


Val 
5 


Gin 


Lys 


Lys 


Lys 


Arg 
10 




gtg 


gac 


egg 


ctg 




cat 


cac 


ctg 


etc 




atk 


tac 


age 


tat 


gac 


cca 


314 


Val 


Asp 


Arg 


Leu 


Arg 
15 


His 


His 




Leu 


Pro 
20 


Xaa 


Tyr 


Ser 


Tyr 


Asp 
25 


Pro 




get 


gag 


gaa 


ctg 


cat 


gag 


get 


gag 


ear 


cag 


ctg 


etc 


tct 


gac 


atg 


gga 


362 


Ala 


Glu 


Glu 


30 


His 


Glu 


Ala 


Glu 


Gin 
35 


Gin 


Leu 


Leu 


Ser 


Asp 
40 


Met 


Gly 




gac 


ccc 


aag 


gtg 


gta 


cat 


ggc 


tgg 


cag 


akt 


ggc 


tac 


cag 


cac 


aag 


egg 


410 


Asp 


Pro 


Lys 
45 


Val 


Val 


His 


Gly 


Trp 
50 


Gin 


Xaa 


Gly 


Tyr 


Gin 
55 


His 


Lys 


Arg 




atg 


cca 


ctg 


ctg 


gat 


gtc 


aag 




tgacctgacc cccttgcccc acccttcaga 


4 64 


Met 


Pro 
60 






Asp 


Val 


65 


Thr 





















geetggggte ctggaetgce tggggeeetg ecatetgett eeeetgctgt eaectggcte 524 

cccctgctgg gtgctgggtc tecatttcte cctccaccca ccctcagcag catctgcttc 584 

ccatgccctc accatcacct cactgccccc aggccttctg ccctttgtgg gtgttgagct 644 

cacegcccac ceacaggcac tcataggaag aggetttcet tetgggatgg cggcggctgg 704 

tagacacctt tgctttctet agecctcetg ggctgggctt gggcacaaat ceccaggcag 764 

gctttggagt tgtttccatg gtgatggggc cagatgtata gtattcagta tatattttgt 824 

aaataaaatg ttttgtggct aaaaaaaaaa aaaaaa 860 
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<210> 27 

<211> 1443 

<212> DNA 

<213> Homo sapiens 



PCT/IBOl/00914 



<220> 
<221> CDS 
<222> 30. .440 

<220> 

<221> sig_peptide 

<222> 30. .83 

<223> Von Heijne matrix 

score 7.06387130837154 

seq VLGLVLLSVTVQG/KV 

<220> 

<221> polyA_site 
<222> 1428. .1443 

<220> 

<221> polyA_signal 
<222> 1409. .1414 

<400> 27 

cagcctagca ctctgaccta gcagtcaac atg aag get etc att gtt ctg ggg 53 

Met Lys Ala Leu He Val Leu Gly 

-15 

ctt gtc etc ctt tct gtt acg gtc cag ggc aag gtc ttt gaa agg tgt 101 

Leu Val Leu Leu Ser Val Thr Val Gin Gly Lys Val Phe Glu Arg Cys 

-10 -5 15 

gag ttg gcc aga act ctg aaa aga ttg gga atg gat ggc tac agg gga 14 9 

Glu Leu Ala Arg Thr Leu Lys Arg Leu Gly Met Asp Gly Tyr Arg Gly 

10 15 20 

ate age eta gca aae tgg atg tgt ttg gcc aaa tgg gag agt ggt tac 197 
He Ser Leu Ala Asn Trp Met Cys Leu Ala Lys Trp Glu Ser Gly Tyr 

25 30 35 

aac aca cga get aca aac tac aat get gga gac aga age act gat tat 245 
Asn Thr Arg Ala Thr Asn Tyr Asn Ala Gly Asp Arg Ser Thr Asp Tyr 

40 45 50 

ggg ata ttt cag ate aat age cgc tac tgg tgt aat gat ggc aaa aee 293 
Gly He Phe Gin He Asn Ser Arg Tyr Trp Cys Asn Asp Gly Lys Thr 
55 60 65 70 

cca gga gca gtt aat gcc tgt cat tta tec tgc agt ggg tgg eat gga 341 
Pro Gly Ala Val Asn Ala Cys His Leu Ser Cys Ser Gly Trp His Gly 

75 80 85 

gaa ate gtt gtc aaa aca gag atg tee gte agt atg tte aag gtt gtg 389 
Glu He Val Val Lys Thr Glu Met Ser Val Ser Met Phe Lys Val Val 

90 95 100 

gag tgt aac tec aga att tte ctt ctt cag etc att ttg tct etc tea 437 
Glu Cys Asn Ser Arg He Phe Leu Leu Gin Leu He Leu Ser Leu Ser 

105 110 115 

cat taagggagta ggaattaagt gaaaggtcac actaccatta ttttcccctt 4 90 

His 

caaacaaata atatttttac agaagcagga gcaaaatatg gcctttcttc taagagatat 550 
aatgttcact aatgtggtta ttttatatta agcctacaac atttttcagt ttgcaaatag 610 
aactaatact ggtgaaaatt tacctaaaac cttggttatc aaatacatct ccagtacatt 670 
eegttetttt tttttttttt tttttttttt ttgagacagt ctcgetctgt egceeagget 730 
ggagtgcagt ggegcaatet eggetcaetg caacetccae cteeegggtt eaegeeattc 790 
tectgectea gceteecgag tagetgggat tacgggcgec egeeaeeacg eceggetaat 850 
tttttgtatt tttagtagag acagggtttc accgttttag ccaggatggt ctcgatctee 910 
tgaccttgtg atccacecac ctcggcctcc caaagtgctg cgattacagg cgtgagccac 970 
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tgcgcccggc cacattcagt tyttatcaaa gaaataaccc agacttaatc ttgaatgata 1030 

cgattatgcc caatattaag taaaaaatat aagaaaaggt katcttaaat agatcttagg 1090 

caaaatacca gctgatgaag gcatctgatg ccttcatctg ttcagtcatc tccaaaaaca 1150 

gtaaaaataa ccactttttg ttgggcaata tgaaattttt aaaggagtag aataccaaat 1210 

gatagaaaca gactgcctga attgagaatt ttgatttttt aaagtgtgtt tctttctaaa 1270 

ttgctgttcc ttaatttgat taatttaatt catgtattat gattaaatct gakgcakatg 1330 

agcttacaag tattgaaata attactaatt aatcacaaat gtgaagttat gcatgatgta 1390 

aaaaatacaa acattctaat taaaggcttt gcaacacaaa aaaaaaaaaa aaa 1443 



<210> 28 

<211> 622 

<212> DNA 

<213> Homo sapiens 



<220> 
<221> CDS 
<222> 82. .384 



<220> 

<221> sig_peptide 

<222> 82. .168 

<223> Von Heijne matrix 

score 6.9066614461057 
seq VCMCVFSTQGALG/EM 



<220> 

<221> polyA_site 
<222> 607. .622 



<220> 

<221> polyA_signal 
<222> 570. .575 



<400> 28 

gctacctcct ggtgtctgtt gcctggtgtg tgggaaaccc ccacacgctg gaccacagaa 60 



gtcttctgtg ttgactgttg a 


atg 


agg 


gaa 


gag 




aag 




ttt 


gag 


aga 


111 














Met 


Arg 


Glu 


Glu 


Lys 
-25 


Lys 


Pro 


Phe 


Glu 


Arg 
-20 




gag 


aga 


gag 


agt 


gtg 


tgt 


gtg 


tgt 


atg 


tgt 


gtg 


ttt 


tec 


act 


caa 


gga 


159 


Glu 


Arg 


Glu 


Ser 


Val 
-15 


Cys 


Val 


Cys 


Met 


Cys 
-10 


Val 


Phe 


Ser 


Thr 


Gin 
-5 


Gly 




get 


ttg 


ggg 


gaa 


atg 


get 


gca 


cac 


ttc 


ata 


gat 


gaa 


aag 


ctg 


agg 




207 


Ala 


Leu 


Gly 


Glu 
1 


Met 


Ala 


Ala 


His 
5 


Phe 


He 


Asp 


Glu 


Lys 
10 


Leu 


Arg 


Pro 




agt 


gag 


ggg 


aat 


ggt 


cac 


aga 


ggg 


aea 


ctg 


gat 


age 


ctg 


agt 


teg 


gac 


255 


Ser 


Glu 
15 


Gly 


Asn 


Gly 


His 


Arg 
20 


Gly 


Thr 




Asp 


Ser 
25 


Leu 


Ser 


Ser 


Asp 




caa 


gag 


tec 


tac 


ate 


ccc 


tec 


ace 


gee 


gae 


cee 


aee 


eag 


get 


ggc 




303 


Gin 


Glu 


Ser 


Tyr 


He 


Pro 


Ser 


Thr 


Ala 


Asp 


Pro 


Thr 


Gin 


Ala 


Gly 


Pro 




30 










35 










40 










45 




gag 


ctg 


ctg 


cac 


aag 


aae 


ttg 


cee 


gtg 


aee 


tec 


ega 


tec 


eag 




ctg 


351 


Glu 






His 


Lys 
50 


Asn 




Pro 


Val 


Thr 
55 


Ser 


Arg 


Ser 


Gin 


Pro 
60 


Leu 






tct 


gac 


etc 


geg 


ate 


cea 


gee 


get 


gee 


etc 


tgacctcccg ateccagceg 


404 


Pro 


Ser 


Asp 


Leu 


Ala 


He 


Pro 


Ala 


Ala 


Ala 


Leu 















65 70 



ctgccetcge tgagcttgga ecaegtgect gagacccgtc tttcctcatt taaaccctgc 4 64 

tcggggtggg gggagctcca caccggcggg gaacgtgctt gttcacagtg tgttcttagc 524 

etetagaaca gagcctcagg cacaggaggc acttcacatt tattgaataa atagatgaat 584 

gaateagtgt catcctccca acaaaaaaaa aaaaaaaa 622 



<210> 29 
<211> 658 
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<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 114. .497 

<220> 

<221> sig_peptide 

<222> 114. .191 

<223> Von Heijne matrix 

score 6.8337397741578 

seq VFLSLFLIQLLIS/FS 

<400> 29 

ctcagcaccc agggcggtgg taggtcacag tctctgggcg ggtctcagtg tccaacactg 60 
tagctggtgc ctgccaggtt cccagtggct ggggtcacca ggtctgaaga gag atg 116 

Met 



tgc 


tgg 




egg 


gca 


tgg 


ggc 


cag 


ate 


etc 


etg 


eea 


gtt 


ttc 


etc 


tec 


164 


Cys 


Trp 


Leu 


Arg 


Ala 


Trp 


Gly 


Gin 


He 


Leu 


Leu 


Pro 


Val 


Phe 


Leu 


Ser 




-25 










-20 










-15 










-10 




etc 


ttt 


etc 


ate 


caa 


ttg 


ett 


ate 


age 


ttc 


tea 


gag 


aat 


ggt 


ttt 


ate 


212 


Leu 


Phe 


Leu 


He 


Gin 
-5 


Leu 


Leu 


He 


Ser 


Phe 
1 


Ser 


Glu 




Gly 
5 


Phe 


He 




cac 


age 




agg 




aat 


cag 


aaa 


eea 


aga 


gat 


ggg 


aat 


gaa 


gag 


gaa 


260 


His 


Ser 


Pro 


Arg 


Asn 


Asn 


Gin 


Lys 


Pro 


Arg 


Asp 


Gly 


Asn 


Glu 


Glu 


Glu 








10 










15 










20 










tgt 




gta 


aag 


aag 


agt 


tgt 


caa 


ttg 


tgc 


aea 


gaa 


gat 


aag 


aaa 


tgt 


308 


Cys 


Ala 


Val 


Lys 


Lys 


Ser 


Cys 


Gin 


Leu 


Cys 


Thr 


Glu 


Asp 


Lys 


Lys 


Cys 






25 










30 










35 












gtt 


tgg 


tgt 


agt 


gaa 


gaa 




gea 


tgc 




aaa 


tac 


tgt 


ttt 




tat 


356 


Val 


Trp 


Cys 


Ser 


Glu 


Glu 


Lys 


Ala 


Cys 


Xaa 


Lys 


Tyr 


Cys 


Phe 


Pro 


Tyr 




40 










45 










50 










55 




ttc 


ggt 


tgt 




ttc 


agt 


tet 


ata 


tat 


tgg 


tta 


aac 


tgt 


aaa 


gtt 


gae 


404 


Phe 


Gly 


Cys 


Arg 


Phe 


Ser 


Ser 


He 


Tyr 


Trp 


Leu 


Asn 


Cys 


Lys 


Val 


Asp 












60 










55 










70 






atg 


ttt 


gga 


ate 


atg 


atg 


ctt 


Gta 


etc 


att 


gca 


gta 


tta 


att 


aca 


gga 


452 


Met 


Phe 


Gly 


He 


Met 


Met 


Leu 


Leu 




He 


Ala 


Val 


Leu 


He 


Thr 


Gly 










75 










80 










85 








ttc 


gtt 


tgg 


tac 


tgc 


tgc 


gee 


tat 


cac 


ttt 


tac 


etg 


cag 


gat 


ata 




497 


Phe 


Val 


Trp 


Tyr 


Cys 


Cys 


Ala 


Tyr 


His 


Phe 


Tyr 


Leu 


Gin 


Asp 


He 










90 










95 










100 











tgatgaatag ataattgaaa agagatcctc cagaaagagc agaaggaagt ttcttcaatg 557 
gcttccttca ggattttaat eatccttaca gcctctttga gaatgattga acttecaaat 617 
teeetgaagt taaaatttta aattctatta aacatttttt e 658 

<210> 30 

<211> 619 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 74. .499 

<220> 

<221> sig_peptide 

<222> 74. .145 

<223> Von Heijne matrix 

score 6.56258453552342 

seq LSICLSAVATATG/AE 



<220> 
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<221> polyA_site 
<222> 556. .619 

<400> 30 

agcgactagg acgcgcctgc gcagaggcgg cagcaccacc ggggttgact ccgggggcgc 60 
ggcgaggaga gac atg agg ctg age tgg ttc egg gtc ctg aca gta ctg 109 
Met Arg Leu Ser Trp Phe Arg Val Leu Thr Val Leu 
-20 -15 



tec 


ate 


tge 


ctg 


age 


gee 


gtg 


gee 


aeg 


gee 


aeg 


ggg 




gag 


ggc 


aaa 


157 


Ser 


lie 


Cys 
-10 


Leu 


Ser 


Ala 


Val 


Ala 
-5 


Thr 


Ala 


Thr 


Gly 


1 


Glu 


Gly 


Lys 




agg 


aag 


ctg 


cag 


ate 


ggg 


gtc 


aag 


aag 


egg 


gtg 


gae 


cae 


tgt 


cec 


ate 


205 


Arg 


Lys 


Leu 


Gin 


lie 


Gly 


Val 


Lys 


Lys 


Arg 


Val 


Asp 


His 


Cys 


Pro 


He 




5 










10 










15 










20 




aaa 


teg 


cgc 


aaa 


ggg 


gat 


gtc 


ctg 


cae 


atg 


cae 


tae 


aeg 


ggt 


aag 


ctg 


253 


Lys 


Ser 


Arg 


Lys 


Gly 

25 


Asp 


Val 


Leu 


His 


Met 

30 


His 


Tyr 


Thr 


Gly 


Lys 

35 


Leu 




gaa 


gat 


ggg 


aca 


gag 


ttt 






age 


ctg 




cag 


aac 


cag 


cec 


ttt 


301 






Gly 


Thr 
40 


Glu 


Phe 


Asp 


Ser 


Ser 
45 


Leu 


Pro 


Gin 


Asn 


Gin 
50 


Pro 


Phe 




gtc 


ttc 


tec 


ctt 


ggc 




ggc 


cag 


gtc 


ate 


aag 


ggc 


tgg 


gae 


cag 


ggg 


349 


Val 


Phe 


Ser 
55 


Leu 


Gly 


Thr 


Gly 


Gin 
60 


Val 


He 


Lys 


Gly 


Trp 

65 


Asp 


Gin 


Gly 




ctg 


ctg 


ggg 


atg 


tgt 


gag 


ggg 


gaa 


aag 


cgc 


aas 


ykg 


gtg 


ate 




tee 


397 


Leu 


70 


Gly 


Met 


Cys 


Glu 


Gly 
75 


Glu 


Lys 




Xaa 


Xaa 
80 


Val 


He 


Pro 


Ser 




gag 


eta 


ggg 


tat 


gga 


gas 


egg 


gga 


get 


cec 


cca 


aag 


att 




ggc 


ggt 


445 


Glu 


Leu 


Gly 


Tyr 


Gly 


Xaa 


Arg 


Gly 


Ala 


Pro 


Pro 


Lys 


He 


Pro 


Gly 


Gly 




85 










90 










95 










100 




gca 


ace 


ctg 


gtg 


ttc 


gar 


gtg 


gag 


ctg 


etc 


aaa 


ata 


gag 


ega 


ega 


act 


493 


Ala 


Thr 


Leu 


Val 


Phe 
105 


Glu 


Val 


Glu 


Leu 


Leu 
110 


Lys 


He 


Glu 


Arg 


Arg 
115 


Thr 




gag 


ctg 


taaccagact ggggaggggc agggggagag gcccccatca 


gggaccagae 


549 



Glu Leu 

tgttecaaaa aaaaaaeaaa aaacaaaaae aaacaaaaaa acacttaaaa gecgaaaaaa 609 
aaaaaaaaaa 619 

<210> 31 

<211> 573 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 65. .274 

<220> 

<221> sig_peptide 

<222> 65. . 136 

<223> Von Heijne matrix 

score 6.54728084640774 

ILLTRPACLGSWA/EI 

<220> 

<221> polyA_site 
<222> 556. .573 

<220> 

<221> polyA_signal 
<222> 540. .545 



<400> 31 

cactgattet ttggccatcg ctactaeaat tttagaattg ctggcaattt tttctgcaca 60 
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ctgg atg get tct tec acc agt gtc tea aca gga caa ate ttg etg aea 109 



Met Ala Ser Ser Thr Ser Val Ser Thr Gly Gin He Leu Leu Thr 













-20 








-15 






-10 




aga 


ect 


get 


tge 


ttg 


ggg 


tee 


tgg 


get 


gag 


ate egg 


tea 


eeg gtg 


agg 


157 


Arg 


Pro 


Ala 


Cys 


Leu 
-5 


Gly 


Ser 


Trp 


Ala 


Glu 

1 


He Arg 


Ser 


Pro Val 
5 


Arg 




ace 


ate 


tee 


ate 


gee 


age 


gae 


tte 


cea 


aca 


gca egg 


gtg 


agt etc 


tgg 


205 


Thr 


He 


Ser 


He 


Ala 


Ser 


Asp 


Phe 


Pro 


Thr 


Ala Arg 


Val 


Ser Leu 


Trp 








10 










15 








20 








gtg 


ccg 


ccc 


gca 


cct 


ggg 


atg 


gtt 


cct 


att 


aag ate 


tec 


ggc tgt 


gca 


253 


Val 


Pro 


Pro 


Ala 


Pro 


Gly 


Met 


Val 


Pro 


He 


Lys He 


Ser 


Gly Cys 


Ala 






25 










30 








35 












tgg 




ttc 


tea 


ccg 


gca 


tagatgatat cacacatcc 


it ggcaagctcc 




304 


Asn 


Trp Ala 


Phe 


Ser 


Pro 


Ala 


















40 










45 





















cagcccccgc caaaggcata gccattgaca gcagcgatga ctggattctt gacctgggtg 364 

aggtggtccc agtgcttcaa gaacttgctg gagtaacagt cctggaaact caggttctgc 424 

atttccttga tatcagctct agtgaaaagg gcggtagttg gtggtggaac cagaaacgga 484 

cgccggtgct tggagcggtt cttaaattat atttaaaaaa gtaacttttt gtataaataa 544 

aagaaaatgg gacaaaaaaa aaaaaaaaa 573 



<210> 32 

<211> 497 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 
<222> 51. .389 



<220> 

<221> sig_peptide 

<222> 51. .119 

<223> Von Heijne matrix 

score 6.43297166361141 
seq LGALSGAAALGFA/SY 



<220> 

<221> polyA_site 
<222> 482. .497 



<220> 

<221> polyA_signal 
<222> 4 62.. 4 67 



<400> 32 

ctcttgggtg gcctgggcgg cgttggtccg gtgcgtcctg ttctacagct atg gcc 56 

Met Ala 

ggg cca get gca get ttc cgc cgc ttg ggc gcc ttg tec gga get gcg 104 
Gly Pro Ala Ala Ala Phe Arg Arg Leu Gly Ala Leu Ser Gly Ala Ala 
-20 -15 -10 

gcc tta ggc ttc get tec tac ggg gcg cae ggc gee maw ttc eea gat 152 

Ala Leu Gly Phe Ala Ser Tyr Gly Ala His Gly Ala Xaa Phe Pro Asp 

-5 15 10 

gcc tac ggg aag gag ctg ttt gae aag gee aac aaa cae cae ttc tta 200 

Ala Tyr Gly Lys Glu Leu Phe Asp Lys Ala Asn Lys His His Phe Leu 

15 20 25 

cac age ctg gee ctg tta ggg gtg cce cat tge aga aag cca etc tgg 248 
His Ser Leu Ala Leu Leu Gly Val Pro His Cys Arg Lys Pro Leu Trp 

30 35 40 

get ggg tta ttg eta get tec gga acg ace tta tte tge ace age ttt 296 
Ala Gly Leu Leu Leu Ala Ser Gly Thr Thr Leu Phe Cys Thr Ser Phe 
45 50 55 
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tac tac cag get ctg agt gga gac ccc age ate eag aet ttg gee eet 344 

Tyr Tyr Gin Ala Leu Ser Gly Asp Pro Ser lie Gin Thr Leu Ala Pro 

60 65 70 75 

gcg gga ggg acc ctg eta etc ttg gge tgg ctt gee ttg get ett 389 

Ala Gly Gly Thr Leu Leu Leu Leu Gly Trp Leu Ala Leu Ala Leu 

80 85 90 

tgagctccct tttgcttaat tactgggttt tctgcgcagt tttttttttt aaagagttgg 449 
agtaagaaga ggattaaaaa ggaaaggcaa acaaaaaaaa aaaaaaaa 4 97 

<210> 33 

<211> 484 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 14. .352 

<220> 

<221> sig_peptide 

<222> 14 . .82 

<223> Von Heijne matrix 

score 6.43297166361141 

seq LGALSGAAALGFA/SY 

<220> 

<221> polyA_site 
<222> 469. .484 

<220> 

<221> polyA_signal 
<222> 443. .448 



<400> 33 

etgttctaea get atg gee ggg cca get gca get ttc cgc cgc ttg ggc 49 









Met Ala Gly Pro Ala Ala Ala Phe Arg Arg Leu Gly 


















-20 










-15 










gcc 


ttg 


tec 


gga 


get 


gcg 


gcc 


tta 


ggc 


ttc 


get 


tec 


tae 


ggg 


gcg 


eac 


97 


Ala 


Leu 
-10 


Ser 


Gly 


Ala 


Ala 


Ala 
-5 


Leu 


Gly 


Phe 


Ala 


Ser 
1 


Tyr 


Gly 


Ala 


His 
5 




ggc 


gcc 


maa 


ttc 


cca 


gat 


gcc 


tac 


ggg 


aag 


gag 


ctg 


ttt 


gac 


aag 


gee 


145 


Gly 


Ala 


Xaa 


Phe 


Pro 

10 


Asp 


Ala 


Tyr 


Gly 


Lys 

15 


Glu 


Leu 


Phe 


Asp 


Lys 

20 


Ala 




aae 


aaa 






tty 


tta 




age 


ctg 


gcc 


ctg 


tta 


ggg 


gtg 






193 


Asn 


Lys 


His 


His 
25 


Phe 


Leu 


His 


Ser 


Leu 
30 


Ala 


Leu 


Leu 


Gly 


Val 
35 


Pro 


His 




tgc 


aga 


aag 


cca 


etc 


tgg 


get 


ggg 


tta 


ttg 


eta 


get 


tee 


gga 




acc 


241 


Cys 


Arg 


Lys 
40 


Pro 


Leu 


Trp 


Ala 


Gly 
45 


Leu 


Leu 


Leu 


Ala 


Ser 
50 


Gly 


Thr 


Thr 




tta 


tty 




acc 


age 


ttt 


tae 


tae 


cag 


get 


ytg 


agt 


gga 








289 


Leu 


Phe 
55 


Cys 


Thr 


Ser 


Phe 


Tyr 
60 


Tyr 


Gin 


Ala 


Leu 


Ser 
65 


Gly 


Asp 


Pro 


Ser 




ate 


cag 


act 


ttg 


gcc 


eet 


gcg 


gga 


ggg 


acc 


ctg 


eta 


etc 


ttg 


ggc 


tgg 


337 


He 


Gin 


Thr 


Leu 


Ala 


Pro 


Ala 


Gly 


Gly 


Thr 


Leu 


Leu 


Leu 


Leu 


Gly 


Trp 




70 










75 










80 










85 




ctt 


gcc 


ttg 


get 


ctt 


tgagctccct tttgattaat tactgggttt tctgggcagt 


392 


Leu 


Ala 


Leu 


Ala 


Leu 
90 



























tttttttttt taaagagttg gagtaagaag aggattaaaa aggaaaggca aataaacttt 452 
ggagtctttg ttcatcaaaa aaaaaaaaaa aa 484 



<210> 34 
<211> 739 
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<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 111. .272 

<220> 

<221> sig_peptide 

<222> 111. .161 

<223> Von Heijne matrix 

score 6.48133858229514 

LTFAQLLFATVLG/IA 

<220> 

<221> polyA_site 
<222> 718.. 739 

<220> 

<221> polyA_signal 
<222> 688. .693 

<400> 34 

aattgtagac gtcagtgttg gtgttaacat ttcagatttg aagagatttc ctgggtgtgg 60 
agtgtgactt tccaaaacca gcttttcctt gagctgtatt tgttgcagca atg ttt 116 

Met Phe 

agg aga ttg act ttt gca caa ctg ctt ttt gcc act gtc ctt gga att 164 

Arg Arg Leu Thr Phe Ala Gin Leu Leu Phe Ala Thr Val Leu Gly lie 

-15 -10 -5 1 

get gga gga gta tat att ttt caa cca gta ttt gaa cag tat gcc aaa 212 

Ala Gly Gly Val Tyr lie Phe Gin Pro Val Phe Glu Gin Tyr Ala Lys 

5 10 15 

gat cag aag gaa tta aaa gaa aag atg cag ttg gta caa gaa tea gaa 260 
Asp Gin Lys Glu Leu Lys Glu Lys Met Gin Leu Val Gin Glu Ser Glu 

20 25 30 

gag aag aaa agt taatactaca tggagttagg cctggcgcag tggctcacgc 312 
Glu Lys Lys Ser 
35 

ctgtaatccc agcactttgg gaggccgagg cgggtggatc aggtggtcag gagttcaaga 372 
ccagcctgac caacatggtg aaaccctgtc tctactaaaa atacaaaaat cagccaggct 432 
tggtggcatg cgcctgtaat cccagctact tgggaggctg aggcaggagc atcacttgaa 4 92 

cctgggaggc agaggtggca gtgagtcaag atcacgctgc tgcactccag cctgggtgac 552 
agagcgagac tccatctaaa aaaacaaaaa gcaaaaaaac ccacaaatac ctcatggaga 612 
tgaactgtaa taattgctta aagttccatt taattatgtt aactctaatc tagcaaaaac 672 
atagatgtac ttaaaaataa atcatggata atgatttttt aacccaaaaa aaaaaaaaaa 732 
tgtgaaa 739 

<210> 35 

<211> 432 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 47. .295 

<220> 

<221> sigpeptide 

<222> 47 . . 91 

<223> Von Heijne matrix 

score 6.32210467320472 

seq LFLSLPVLVVVLS/IV 
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<220> 

<221> polyA_site 
<222> 420. .432 

<220> 

<221> polyA_signal 
<222> 395. .400 

<400> 35 

aaccaagccc tccagcaagg attcagagtg cccctccggc ctcgcc atg agg etc 55 

Met Arg Leu 
-15 



ttc 


ctg 




ctt 


ccg 


gtc 


ctg 


gtg 


gtg 


gtt 


ctg 


teg 


ate 


gte 


ttg 


gaa 


103 


Phe 


Leu 


Ser 
-10 


Leu 


Pro 


Val 


Leu 


Val 
-5 


Val 


Val 


Leu 


Ser 


He 
1 


Val 


Leu 


Glu 




ggc 


cca 


gcc 


cca 


gcc 


cag 


ggg 


ace 


cca 


gac 


gtc 


tec 


agt 


gcc 


ttg 


gat 


151 


Gly 


Pro 


Ala 


Pro 


Ala 


Gin 


Gly 


Thr 


Pro 


Asp 


Val 


Ser 


Ser 


Ala 


Leu 


Asp 




5 










10 










15 










20 




aag 


ctg 


aag 


gag 


ttt 


gga 


aac 


aca 


ctg 


gag 


gac 


aag 


get 


egg 


gaa 


etc 


199 


Lys 


Leu 


Lys 


Glu 


Phe 
25 


Gly 


Asn 


Thr 


Leu 


Glu 
30 


Asp 


Lys 


Ala 


Arg 


Glu 
35 


Leu 




ate 


age 


cgc 


ate 


aaa 


cag 


agt 


gaa 


ctt 


tct 


gcc 


aag 


atg 


egg 


gag 


tgg 


247 


lie 


Ser 


Arg 


lie 
40 


Lys 


Gin 


Ser 


Glu 


Leu 
45 


Ser 


Ala 


Lys 


Met 


Arg 
50 


Glu 


Trp 




ttt 


tea 


gag 


aca 


ttt 


cag 


aaa 


gtg 


aag 


gat 


aaa 


etc 


aag 


att 


gac 


tea 


295 


Phe 


Ser 


Glu 
55 


Thr 


Phe 


Gin 


Lys 


Val 
60 


Lys 


Asp 


Lys 


Leu 


Lys 
65 


lie 


Asp 


Ser 





tgaggacctg aagggtgaca tcccaggagg ggcctctgaa atttcccaca ccccagcgcc 355 
tgtgctgagg acteectcca tgtggeccea ggtgccacca ataaaaatec tacagaaaac 415 
taawaaaaaa aaaaaaa 432 



<210> 36 

<211> 839 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 66. .359 

<220> 

<221> sig_peptide 

<222> 66. .194 

<223> Von Hei:ne matrix 

score 6.07207487250823 

seq FLIYVALLRVSEC/LP 

<220> 

<221> polyA_site 
<222> 824 . .839 

<220> 

<221> polyA_signal 
<222> 805. .810 

<400> 36 

gggegegagg eggecaccgt ggagagcaga gcgcggcgge tggaagetge taagtcagag 60 

ccgcg atg ttc egg att gag gge etc gcg ceg aag ctg gac ccg gag gag 110 
Met Phe Arg He Glu Gly Leu Ala Pro Lys Leu Asp Pro Glu Glu 
-40 -35 -30 

atg aaa egg aag atg cgc gag gat gtg ate tec tee ata egg aae ttt 158 
Met Lys Arg Lys Met Arg Glu Asp Val He Ser Ser lie Arg Asn Phe 
-25 -20 -15 
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Val Ala Leu Leu Arg Val Se 



etc 


ate 


tae 


Leu 


He 


Tyr 






-10 


gac 


tgt 


gat 


Asp 


Cys 




5 






agg 


ggt 


eat 


Arg 


Gly 


His 


gga 


ggc 




Gly 


Gly 


Ser 






ata 


His 


His 


He 



: ggg gag ete 
: Gly Glu Leu Th: 



gte 


agt gag 


tgt 


etc 


ccg 


ggc 


tgt 


Val 


Ser Glu 


Cys 


Leu 

1 


Pro 


Gly Cys 


acc 


gac ggg 


cac 


cec 


tta 




eta 


Thr 


Asp Gly 


His 


Pro 


Leu 


Thr 


Leu 




15 










20 


gag 




ggt 


age 


ggg 


gag 




Glu 


Leu Asn 


Gly 


Ser 


Gly 


Glu 


Gin 




30 








35 






gga cag 


cat 


atg 


aag 


aca 


gga 


He 


Gly Gin 


His 


Met 


Lys 


Thr 


Gly 


45 








50 







tgaatgcacg atatgaagag cctggttaca gtttcgactc 399 
55 

ctctctgcaa gtgaataggc ccagaaaggt gtaagagact ctttgaatgg acataaaatt 4 59 

etgcttgtta agaacaagtt tggctctggt aactgacctt caaagctaaa atataaaact 519 

atttgggaag tatgaaaega tgtctcgtga tctggtgtac ccttatccct gtgacgtttg 579 

gcetetgaea atactggtat aattgtaaat aatgtcaaac tccgttttct ageaagtatt 639 

aagggagctg tgtctgaaat ggcactgtct tgtcagtcat ttctgtttac etttttctte 699 

tgceeagagt gtatttgtga aragtctctt atattatgtt ttgtggaaat cagcacacaa 759 

ccacaatgac atttaagcac aggatcatta ttagtctatg tttttaataa acatatcaat 819 

tatgaaaaaa aaaaaaaaaa 839 



<210> 37 

<211> 1114 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 117. .362 



<220> 

<221> sig_peptide 

<222> 117. .245 

<223> Von Heijne matrix 

score 5.65876793443964 
VVSFALIATLVYA/LF 



<220> 

<221> polyA_site 
<222> 1083. .1114 



<400> 37 

ataaggggac gtctagtggg ttgcccggga ggggtggcgg gagcggtcct ggaaataatc 60 
tgtcctctgt cgccgggaac tggcgaggta gttccttcgc ggtggagaga cctgga atg 119 

Met 



gee 


aaa 


tat 


caa 


ggt 


gaa 


gtt 


caa 


agt 


ttg 




ctg 


gat 


gat 


gat 


tea 


167 


Ala 


Lys 


Tyr 
-40 


Gin 


Gly 


Glu 


Val 


Gin 
-35 


Ser 


Leu 


Lys 


Leu 


Asp 
-30 


Asp 


Asp 


Ser 




gtt 


ata 


gaa 


gga 


gta 


age 


gac 




gta 


ett 


gtg 


gca 


gtt 


gtg 


gtc 


agt 


215 


Val 


He 
-25 


Glu 


Gly 


Val 


Ser 


Asp 
-20 


Gin 


Val 


Leu 


Val 


Ala 
-15 


Val 


Val 


Val 


Ser 




ttc 


get 


ttg 


att 


get 


acc 


ctg 


gta 


tat 


gca 


ett 


ttc 


aga 


aat 


gta 


cat 


253 


Phe 


Ala 


Leu 


He 


Ala 


Thr 


Leu 


Val 


Tyr 


Ala 


Leu 


Phe 


Arg 


Asn 


Val 


His 




-10 










-5 










1 








5 






caa 


aac 


att 


cac 


cca 


gaa 




cag 


gag 


eta 


gta 


agg 


gta 


ett 


eg a 


gaa 


311 


Gin 


Asn 


He 


His 
10 


Pro 


Glu 


Asn 


Gin 


Glu 

15 


Leu 


Val 




Val 


20 


Arg 


Glu 






ett 


caa 


aca 


gaa 


cag 




gca 




get 


gac 






gca 


gtt 


eta 


359 


Gin 


Leu 


Gin 
25 


Thr 


Glu 


Gin 


Asp 


Ala 
30 


Pro 


Ala 


Asp 


Ser 


Thr 
35 


Ala 


Val 


Leu 
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cac tgacatgtac tgtcccatct gcctgcacca agcctccttc ccggtggaga 412 
His 

ccaactgtgg acatcttttt 
ggcttggggc aatcagttgt 
ccagaacagc ttagaccatg 
cttactccta acagtatttg 
ggatattaat gattataacc 
agcatgttca aagttagagg 
caagtattct ttgactatga 
agcactttgg gaggccaagg 
ggaaacatga tgaaacccca 
cacatgcctg tagtccctgc 
aggttgaggc tatagtgagc 
acccttcctc aaaaaacaag 

<210> 38 
<211> 1112 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 117. .362 

<220> 

<221> sig_peptide 
<222> 117. .245 
<223> Von Heijne matrix 

score 5.65876793443964 

seq VVSFALIATLVYA/LF 

<220> 

<221> polyA_site 
<222> 1097. .1112 

<220> 

<221> polyA_signal 
<222> 1089. .1094 

<400> 38 

ataaggggac gtctagtggg ttgcccggga ggggtggcgg gagcggtcct ggaaataatc 60 
tgtcctctgt cgccgggaac tggcgaggta gttccttcgc ggtggagaga cctgga atg 119 

Met 



gcc 


aaa 


tat 


caa 


ggt 


gaa 


gtt 


caa 


agt 


ttg 




ctg 


gat 


gat 


gat 


tea 


167 


Ala 


Lys 


Tyr 
-40 


Gin 


Gly 


Glu 


Val 


Gin 
-35 


Ser 


Leu 


Lys 


Leu 


Asp 
-30 


Asp 


Asp 


Ser 




gtt 


ata 


gaa 


gga 


gta 


age 


gae 


eaa 


gta 


ctt 


gtg 


gea 


gtt 


gtg 


gte 


agt 


215 


Val 


He 
-25 


Glu 


Gly 


Val 


Ser 


Asp 
-20 


Gin 


Val 


Leu 


Val 


Ala 
-15 


Val 


Val 


Val 


Ser 




ttc 


get 


ttg 


att 


get 


ace 


ctg 


gta 


tat 


gca 


ctt 


ttc 


aga 


aat 


gta 


cat 


263 


Phe 


Ala 


Leu 


He 


Ala 


Thr 


Leu 


Val 


Tyr 


Ala 


Leu 


Phe 


Arg 


Asn 


Val 


His 




-10 










-5 










1 








5 






caa 




att 


cac 


cca 


gaa 


aac 


cag 


gag 


eta 


gta 


agg 


gta 


ctt 


cga 


gaa 


311 


Gin 


Asn 


He 


His 
10 


Pro 


Glu 


Asn 


Gin 


Glu 
15 


Leu 


Val 


Arg 


Val 


Leu 
20 


Arg 


Glu 




cag 


ctt 


caa 


aca 


gaa 


cag 


gat 


gca 


cet 


get 


gae 


teg 


aea 


gea 


gtt 


eta 


359 


Gin 


Leu 


Gin 
25 


Thr 


Glu 


Gin 


Asp 


Ala 

30 


Pro 


Ala 


Asp 


Ser 


Thr 
35 


Ala 


Val 


Leu 





cac tgacatgtac tgtcccatct gcctgcacca agcctccttc ccggtggaga 412 

His 

ccaactgtgg acatcttttt tgtggtgcct gcattattgc ttactggcga tatggttcat 472 

ggcttggggc aatcagttgt ccaatctgta gacaaacgag acatggccac attgcattgt 532 

ccagaacagc ttagaccatg acagttagca tcgaagccac ctgaggaggg aggcagtaac 592 



tgtggtgcct gcattattgc ttactggcga tatggttcat 472 

ccaatctgta gacaaacgag acatggccac attgcattgt 532 

acagttagca tcgaagccac ctgaggaggg aggcagtaac 592 

gtgaagatga tcagtctcag gatgttctga gattgcatca 652 

ggagattctc agggcaaccc agatctgtaa gtaatgctaa 712 

aagacacatt tcttetcttt tgtaaagtga ggtttaccaa 772 

gaaatcttgg eeaggeaeag tagetaaegc ctataatcct 832 

caggtggatc acttgagccc aggagtttga gaccagcctt 892 

tctctagaaa aaacaccaaa aaattggaea agagtgttgg 952 

ttcttgggag gctgaaatgg gaggatcaee tgageecagg 1012 

catgatcgca ctattgcact cccacctggg tggeagtgag 1072 

aaaagaaaaa aaaaaaaaaa aa 1114 
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cttactccta acagtatttg gtgaagatga tcagtctcag gatgttctga gattgcatca 652 

ggatattaat gattataacc ggagattctc agggcaaccc agatctgtaa gtaatgctaa 712 

agcatgttca aagttagagg aagacacatt tcttctcttt tgtaaagtga ggtttaccaa 772 

caagtattct ttgactatga gaaatcttgg ccaggcacag tagtaacgcc tataatccta 832 

gcactttggg aggccaaggc aggtggatca cttgagccca ggagtttgag accagccttg 892 

gaaacatgat gaaaccccat ctctagaaaa aacaccaaaa aattggacaa gagtgttggc 952 

acatgcctgt agtccctgct tcttgggagg ctgaaatggg aggatcacct gagcccagga 1012 

ggttgaggct atagtgagcc atgatcgcac tattgcactc ccacctgggt ggcagtgaga 1072 

cccttcctca aaaaacaaga aaagaaaaaa aaaaaaaaaa 1112 



<210> 39 

<211> 1112 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 117. .362 



<220> 

<221> sig_peptide 

<222> 117. .245 

<223> Von Heijne matrix 

score 5.65876793443964 
seq VVSFALIATLVYA/LF 



<220> 

<221> polyA_site 
<222> 1097. .1112 



<220> 

<221> polyA_signal 
<222> 1089. .1094 



<400> 39 

ataaggggac gtctagtggg ttgcccggga ggggtggcgg gagcggtcct ggaaataatc 60 
tgtcctctgt cgccgggaac tggcgaggta gttccttcgc ggtggagaga cctgga atg 119 

Met 



gcc 


aaa 


tat 


caa 


ggt 


gaa 


gtt 


caa 


agt 


ttg 


aaa 


ctg 


gat 


gat 


gat 


tea 


167 


Ala 


Lys 


Tyr 
-40 


Gin 


Gly 


Glu 


Val 


Gin 
-35 


Ser 


Leu 


Lys 


Leu 


Asp 
-30 


Asp 


Asp 


Ser 




gtt 


ata 


gaa 


gga 


gta 


age 


gac 




gta 


ctt 


gtg 


gca 


gtt 


gtg 


gtc 


agt 


215 


Val 


He 
-25 


Glu 


Gly 


Val 


Ser 


Asp 
-20 


Gin 


Val 




Val 


Ala 
-15 


Val 


Val 


Val 


Ser 




ttc 


get 


ttg 


att 


get 




ctg 


gta 


tat 




ett 


ttc 


aga 


aat 


gta 


cat 


263 


Phe 


Ala 


Leu 


lie 


Ala 


Thr 


Leu 


Val 


Tyr 


Ala 


Leu 


Phe 


Arg 


Asn 


Val 


His 




-10 










-5 










1 








5 








aac 


att 


cac 


cca 


gaa 


aac 


cag 


gag 


eta 


gta 


agg 


gta 


ctt 


eg a 


gaa 


311 


Gin 


Asn 


He 


His 
10 


Pro 


Glu 


Asn 


Gin 


Glu 
15 


Leu 


Val 


Arg 


Val 


Leu 
20 


Arg 


Glu 




cag 


ctt 


caa 


aca 


gaa 


cag 








gst 




teg 






gtt 


eta 


359 


Gin 


Leu 


Gin 
25 


Thr 


Glu 


Gin 


Asp 


Ala 
30 


Pro 


Xaa 


Asp 


Ser 


Thr 
35 


Ala 


Val 


Leu 




cac 


tgacatgtac tgtcccatct gcctgcacca agcctccttc 


eeggtggaga 




412 



His 

ccaactgtgg acatcttttt tgtggtgcct gcattattgc ttactggcga tatggttcat 472 

ggcttggggc aatcagttgt ceaatetgta gaeaaaegag aeatggecae attgcattgt 532 

ceagaaeage ttagaeeatg aeagttagea tegaagccae etgaggaggg aggeagtaae 592 

cttaetecta aeagtatttg gtgaagatga teagteteag gatgttctga gattgeatea 652 

ggatattaat gattataacc ggagattctc agggcaaccc agatctgtaa gtaatgctaa 712 

agcatgttca aagttagagg aagacacatt tcttctcttt tgtaaagtga ggtttaccaa 772 

caagtattct ttgactatga gaaatcttgg ccaggcacag tagtaacgcc tataatccta 832 

gcactttggg aggccaaggc aggtggatca cttgagccca ggagtttgag accagccttg 892 
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ctctagaaaa aacaccaaaa aattggacaa gagtgttggc 952 
tcttgggagg ctgaaatggg aggatcacct gagcccagga 1012 
atgatcgcac tattgcactc ccacctgggt ggcagtgaga 1072 
aaagaaaaaa aaaaaaaaaa 1112 

<210> 40 

<211> 1112 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 117.. 362 



WO 02/083898 

gaaacatgat gaaaccccat 
acatgcctgt agtccctgct 
ggttgaggct atagtgagcc 
cccttcctca aaaaacaaga 



<220> 

<221> sig_peptide 

<222> 117.. 245 

<223> Von Heijne matrix 

score 5.65876793443964 
seq WSFALIATLVYA/LF 



<220> 

<221> polyA_site 
<222> 1097. .1112 



<220> 

<221> polyA_signal 
<222> 1089. .1094 



<400> 40 

ataaggggac gtctagtggg ttgcccggga ggggtggcgg gagcggtcct ggaaataatc 
tgtcctctgt cgccgggaac tggcgaggta gttccttcgc ggtggagaga cctgga atg 

Met 

gcc aaa tat caa ggt gaa gtt caa agt ttg aaa ctg gat gat gat tea 
Ala Lys Tyr Gin Gly Glu Val Gin Ser Leu Lys Leu Asp Asp Asp Ser 

-40 -35 -30 

gtt ata gaa gga gta age gac caa gta ctt gtg gca gtt gtg gtc agt 
Val lie Glu Gly Val Ser Asp Gin Val Leu Val Ala Val Val Val Ser 
-25 -20 -15 



ttc 


get 


ttg 


att 


get 


aec 


ctg 


gta 


tat 


gca 


ctt 


ttc 


aga 


aat 


gta 


cat 


263 


Phe 


Ala 


Leu 


lie 


Ala 


Thr 


Leu 


Val 


Tyr 


Ala 


Leu 


Phe 


Arg 


Asn 


Val 


His 




-10 










-5 










1 








5 






caa 


aac 


att 


cac 


cca 


gaa 


aac 


cag 


gag 


eta 


gta 


agg 


gta 


ctt 


cga 


gaa 


311 


Gin 


Asn 


lie 


His 
10 


Pro 


Glu 


Asn 


Gin 


Glu 
15 


Leu 


Val 


Arg 


Val 


Leu 
20 


Arg 


Glu 




cag 


ctt 


caa 


aca 


gaa 


cag 


gat 


gca 


cct 


get 


gac 


teg 


aca 


gca 


gtt 


eta 


359 


Gin 


Leu 


Gin 

25 


Thr 


Glu 


Gin 


Asp 


Ala 

30 


Pro 


Ala 


Asp 


Ser 


Thr 
35 


Ala 


Val 


Leu 





cac tgacatgtac tgtcccatct gcctgcacca agcctccttc ccggtggaga 412 
His 

ccaactgtgg acatcttttt tgtggtgcct gcattattgc ttactggcga tatggttcat 472 

ggcttggggc aatcagttgt ccaatctgta gacaaacgag acatggccac attgcattgt 532 

ccagaacagc ttagaccatg acagttagca tcgaagccac ctgaggaggg aggcagtaac 592 

cttactccta acagtatttg gtgaagatga tcagtctcag gatgttctga gattgcatca 652 

ggatattaat gattataacc ggagattctc agggcaaccc agatctgtaa gtaatgctaa 712 

agcatgttca aagttagagg aagacacatt tettctcttt tgtaaagtga ggtttaecaa 772 

caagtattet ttgactatga gaaatcttgg ccaggcacag tagtaacgcc tataatccta 832 

gcactttggg aggccaaggc aggtggatca cttgagccca ggagtttgag accagcettg 8 92 

gaaacatgat gaaaccccat ctctagaaaa aacaccaaaa aattggacaa gagtgttggc 952 

acatgcctgt agtccctgct tcttgggagg ctgaaatggg aggatcacct gagcccagga 1012 

ggttgaggct atagtgagcc atgatcgcac tattgcactc ccacctgggt ggcagtgaga 1072 

cccttcctca aaaaacaaga aaagaaaaaa aaaaaaaaaa 1112 
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<210> 41 

<211> 508 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 17. .313 

<220> 

<221> sig_peptide 

<222> 17. .100 

<223> Von Heijne matrix 

score 5.67441405983879 

seq LGYLVLSEGAVLA/SX 

<220> 

<221> polyA_site 
<222> 493. .508 

<220> 

<221> polyA_signal 
<222> 468. .473 

<400> 41 

agcaccgaag actgcg atg act tct gca ctg acc cag ggg ctg gag cga ate 52 

Met Thr Ser Ala Leu Thr Gin Gly Leu Glu Arg He 
-25 -20 



cca 


gac 


eag 


ete 


ggc 


tae 


ctg 


gta 


ctg 


agt 


gaa 


ggt 


gca 


gtg 


ctg 


geg 


100 


Pro 


Asp 
-15 


Gin 


Leu 


Gly 


Tyr 


Leu 
-10 


Val 


Leu 


Ser 


Glu 


Gly 
-5 


Ala 


Val 


Leu 


Ala 




tea 


tyt 


ggg 


gae 


ctg 


gag 


aat 


gat 


gag 


cag 


gca 


gcc 


agt 


gcc 


ate 


tyt 


148 


Ser 
1 


Xaa 


Gly 


Asp 


Leu 

5 


Glu 


Asn 


Asp 


Glu 


Gin 
10 


Ala 


Ala 


Ser 


Ala 


He 
15 


Xaa 




gag 


ctg 


gtc 


age 


aca 


gee 


tge 


ggt 


ttc 


egg 


ctg 


eac 


cge 


ggc 


atg 


aat 


196 


Glu 


Leu 


Val 


Ser 
20 


Thr 


Ala 


Cys 


Gly 


Phe 
25 


Arg 


Leu 


His 


Arg 


Gly 
30 


Met 


Asn 




gtg 


ccc 


ttc 


aag 


cge 


ctg 


tct 


gtg 


gtc 


ttt 


gga 


gaa 


eac 


aca 


ctg 


ctg 


244 


Val 


Pro 


Phe 
35 




Arg 


Leu 


Ser 


Val 
40 


Val 


Phe 


Gly 


Glu 


His 
45 


Thr 


Leu 


Leu 




gtg 


acg 


gtg 


tea 


gga 


cag 


agg 


gtg 


ttt 


gtg 


gtg 


aag 


agg 


cag 


aac 


cga 


292 


Val 


Thr 
50 


Val 


Ser 


Gly 


Gin 


Arg 
55 


Val 


Phe 


Val 


Val 


Lys 
60 


Arg 


Gin 


Asn 


Arg 




ggt 


egg 


gag 


eee 


att 


gat 


gtc 


tgagcctgcc ggagggcgag ggtcggagaa 


343 


Gly 


Arg 


Glu 


Pro 


He 


Asp 


Val 























65 70 

gcggattggg ecttetgetc geceacetec eaceeetacc tggacgggce caggcttggg 403 

gaetctgage tgtgttaagg agaacaaggg eaaggagace teeetttgtg etccctcact 463 

ccctaataaa catgagtctg atgttcteca aaaaaaaaaa aaaaa 508 

<210> 42 

<211> 623 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 69. .365 

<220> 

<221> sig_peptide 

<222> 69. .152 

<223> Von Heijne matrix 
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score 5.54338579743238 
seq LGYLVLSEGAVLA/SS 

<220> 

<221> polyA_site 
<222> 608.. 623 

<220> 

<221> polyA_signal 
<222> 582.-587 

<400> 42 

acgtgaccgg ggcctgaagc cggaagctac ctatctggta gggagctccc ccagcaccga 60 
agactgcg atg act tct gca ctg acc cag ggg ctg gag cga ate cca gac 110 
Met Thr Ser Ala Leu Thr Gin Gly Leu Glu Arg lie Pro Asp 
-25 -20 -15 



cag 


etc 


ggc 


tac 


ctg 


gta 


ctg 


agt 


gaa 


ggt 




gtg 


ctg 


gcg 


tea 


tct 


158 


Gin 


Leu 


Gly 


Tyr 


Leu 
-10 


Val 


Leu 


Ser 


Glu 


Gly 
-5 


Ala 


Val 


Leu 


Ala 


Ser 
1 


Ser 




ggg 


gac 


ctg 


gag 


aat 


gat 


gag 


cag 


gca 


gee 


agt 


gcc 


ate 


tyt 


gag 


ctg 


206 


Gly 


Asp 


Leu 


Glu 


Asn 


Asp 


Glu 


Gin 


Ala 


Ala 


Ser 


Ala 


He 


Xaa 


Glu 


Leu 








5 










10 










15 










gtc 


age 


aca 


gee 


tge 


ggt 


ttc 


egg 


ctg 


cac 


cge 


ggc 


atg 


aat 


gtg 


ccc 


254 


Val 


Ser 


Thr 


Ala 


Cys 


Gly 


Phe 


Arg 


Leu 


His 


Arg 


Gly 


Met 


Asn 


Val 


Pro 






20 










25 










30 












ttc 


aag 


cge 


ctg 


tct 


gtg 


gtc 


ttt 


gga 


gaa 


cac 


aca 


ctg 


ctg 


gtg 


aeg 


302 


Phe 


Lys 


Arg 


Leu 


Ser 


Val 


Val 


Phe 


Gly 


Glu 


His 


Thr 


Leu 


Leu 


Val 


Thr 




35 










40 










45 










50 




gtg 


tea 


gga 


cag 


agg 


gtg 


ttt 


gtg 


gtg 


aag 


agg 


cag 


aac 


cga 


ggt 


egg 


350 


Val 


Ser 


Gly 


Gin 


Arg 


Val 


Phe 


Val 


Val 


Lys 


Arg 


Gin 


Asn 


Arg 


Gly 


Arg 












55 










60 










65 







gag ccc att gat gtc tgagcctgcc ggagggegag ggtcggagaa gcggattggg 405 
Glu Pro He Asp Val 
70 

tectgggcet etgtgatgag geaggcacac ctgtcggtct tggcttgctg ctagaactag 4 65 

ggccttytgs tcgcccacct cccaccccta cctggacggg cceaggettg gggactytga 525 

getgtgttaa ggagaacaag ggcaaggaga cetccctttg tgctccetca ctcectaata 585 

aacatgagte tgatgttctc cgaaaaaaaa aaaaaaaa 623 

<210> 43 

<211> 1166 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 2. .559 

<220> 

<221> sigpeptide 
<222> 2. .46 

<223> Von Heijne matrix 

score 5.34456047740956 
seq VMAASMARGGVSA/RV 

<220> 

<221> polyA_site 
<222> 1151. .1166 

<400> 43 

g atg cat gtc atg gcc gee tee atg gee egg gga ggc gtg agt gee agg 4 9 

Met His Val Met Ala Ala Ser Met Ala Arg Gly Gly Val Ser Ala Arg 
-15 -10 -5 1 
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gtt 


eta 


etg 


eag 


get 


gee 


agg 


gge 


acc 


tgk 


tgg 


aac 


aga 


cct 


ggg 


gge 


97 


Val 


Leu 


Leu 


Gin 


Ala 


Ala 


Arg 


Gly 


Thr 


Xaa 


Trp 


Asn 


Arg 


Pro 


Gly Gly 




















10 










15 








act 


tec 


ggg 


teg 


ggg 


gag 


ggg 


gtg 


geg 


ctg 


ggg 


aca 




aga 


aag 


ttt 


145 


Thr 


Ser 


Gly 


Ser 


Gly 


Glu 


Gly 


Val 


Ala 


Leu 


Gly 


Thr 


Thr 


Arg 


Lys 


Phe 








20 










25 










30 










caa 


geg 


aca 


kgc 


teg 


cge 


ccg 


get 


gga 


gag 


gag 


gac 


geg 


gge 


gge 


ccg 


193 


Gin 


Ala 


Thr 


Xaa 


Ser 


Arg 


Pro 


Ala 


Gly 


Glu 


Glu 


^5^ 


Ala 


Gly 


Gly 


Pro 






35 










40 






















gag 


egg 




ggg 




gtg 


gtg 




gtg 


gtg 


ttc 


gta 




cge 


tea 


gge 


241 






Pro 




Asp 






Asn 


Val 


Val 


Phe 


Val 


Asp Arg 


Ser 


Gly 




50 










55 










60 










65 




cag 


egg 


ate 


eca 


gtg 


agt 


gge 


aga 


gtc 


ggg 


gae 




gtt 


ctt 


cae 


etg 


289 


Gin 


Arg 


He 


Pro 


Val 


Ser 


Gly 




Val 


Gly 


Asp 


Asn 


Val 


Leu 


His 


Leu 












70 










75 










80 






gcc 


cag 


cgc 


cac 


ggg 


gtg 


gac 


ctg 


gaa 


ggg 


gcc tgt 


gaa 


gcc 


tec 


ctg 


337 


Ala 


Gin 


Arg 


His 


Gly 


Val 


Asp 


Leu 


Glu 


Gly 


Ala 


Cys 


Glu 


Ala 


Ser 


Leu 





85 90 95 

gcc tge tee aec tgc cat gtg tat gtg agt gaa gae eae etg gat etc 385 
Ala Cys Ser Thr Cys His Val Tyr Val Ser Glu Asp His Leu Asp Leu 

100 105 110 

ctg cct cct ccc gag gag agg gaa gac gac atg eta gac atg gcc ccc 433 
Leu Pro Pro Pro Glu Glu Arg Glu Asp Asp Met Leu Asp Met Ala Pro 

115 120 125 

cte cte cag gag aac teg egg ctg gge tgc eag att gtg ctg aca ccg 481 
Leu Leu Gin Glu Asn Ser Arg Leu Gly Cys Gin He Val Leu Thr Pro 
130 135 140 145 

gag ctg gaa rga geg gaa ttc acc ctg ccc aag ate acc agg aac ttc 529 
Glu Leu Glu Xaa Ala Glu Phe Thr Leu Pro Lys He Thr Arg Asn Phe 

150 155 160 

tac gtg gat gge cat gtc ccc aag ccc cac tgacatgaac acctggacca 579 
Tyr Val Asp Gly His Val Pro Lys Pro His 

165 170 
ttccacattg ccatggcccc agggcccaga ttgagggaat agccaggtgc cagccctgec 639 
cagagtgcgg acaggcccgg gagagacgtg gaagcccctg tgaaggacaa cacccctgct 699 
tgggagagag tcccatgtcc aggctctggt ggggacaggg eccctagtgg ggtggccttc 759 
cecaggeccc tgagaatcag ggtttgagta ggagtggact catattggag ctgcaataaa 819 
tcgataacac aggccccagc atgttgagtg tccttggggg acagatctgg ggtctacagg 879 
tggctcacac ctgtaatcct agcactttgg gaagccaaga tgggaggatc acttgaggcc 939 
aggagcttaa gaccatcctg ggcaacatgg cgagacctcg tctctataaa aacagtaaaa 999 
attagcgtgg agtggtggta agtgcctgta gtcccagcta ctcgggagac tgaggtggga 1059 
ggatcactta gcccagggga tttaggctgc agtggagcgg tgatcaggcc attccactcc 1119 
agcctgggtg acagagtgag gccctgtctc taaaaaaaaa aaaaaaa 1166 

<210> 44 

<211> 649 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 18. .386 

<220> 

<221> sig_peptide 

<222> 18. .161 

<223> Von Heijne matrix 

score 5.26064226788865 

seq LLMLLLFLSELQY/YL 

<220> 

<221> polyA_site 
<222> 634. .64 9 
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<220> 

<221> polyA_signal 
<222> 611. .616 

<400> 44 

ctttccggcc ggtcccc atg gag gcg ctg ggg aag ctg aag cag ttc gat 
Met Glu Ala Leu Gly Lys Leu Lys Gin Phe Asp 
-45 -40 

gcc tac ccc aag act ttg gag gac ttc egg gtc aag acc tgc ggg ggc 

Ala Tyr Pro Lys Thr Leu Glu Asp Phe Arg Val Lys Thr Cys Gly Gly 

-35 -30 -25 

gcc acc gtg acc att gtc agt ggc ctt etc atg ctg eta ctg ttc ctg 

Ala Thr Val Thr lie Val Ser Gly Leu Leu Met Leu Leu Leu Phe Leu 

-20 -15 -10 

tec gag ctg cag tat tac etc acc acg gag gtg cat cct gag etc tac 

Ser Glu Leu Gin Tyr Tyr Leu Thr Thr Glu Val His Pro Glu Leu Tyr 

-5 15 10 



gtg 


gac 


aag 


teg 


egg 


gga 


gat 


aaa 


ctg 


aag 


ate 


aac 


ate 


gat 


gta 


ctt 


242 


Val 


Asp 


Lys 


Ser 
15 


Arg 


Gly 


Asp 


Lys 


Leu 
20 


Lys 


He 


Asn 


He 


Asp 
25 


Val 


Leu 




ttt 


ecg 


eac 


atg 


cct 


tgt 


gcc 


tat 


ctg 


agt 


att 


gat 


gcc 


atg 


gat 


gtg 


290 


Phe 


Pro 


His 


Met 


Pro 


Cys 


Ala 


Tyr 


Leu 


Ser 


He Asp 


Ala 


Met 


Asp 


Val 








30 










35 










40 










gcc 


gga 


gaa 


cag 


car 


ctg 


gat 


gtg 


gaa 


cac 


aac 


ctg 


ttc 


aag 


caa 


cga 


338 


Ala 


Gly 
45 


Glu 


Gin 


Gin 


Leu 


50^ 


Val 


Glu 


His 


Asn 


55 


Phe 




Gin 


Arg 




eta 


gat 


aaa 


gat 


ggc 


ate 




gtg 


age 


tea 


gag 


get 


gag 


egg 


cat 


gmt 


386 




Asp 


Lys 


Asp 


Gly 


He 


Pro 


Val 


Ser 


Ser 


Glu 


Ala 


Glu 


Arg 


His 


Xaa 




60 










65 










70 










75 





taatgcagga caccagtgga aceaaeataa tttgctatag gaetggatgt ggagtggagg 44 6 

agagcaaagt catggttgtt gattccaacg ettteagect gagcaaatga atgacttgag 506 

atggagatga ctggaggaga aggttttttt ggtgggaggg aggagtgget attggatatc 566 

emetgetgtg taacaaagta ctecaaaagt tagtagtttg atacaataaa aatgtgtttt 626 

tteagteaaa aaaaaaaaaa aaa 649 

<210> 45 

<211> 836 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 87. .317 

<220> 

<221> sigpeptide 

<222> 87. .173 

<223> Von Heijne matrix 

score 5.19818357425197 

seq LLLKLQRLPQAEP/VE 

<220> 

<221> polyAsite 
<222> 821. .836 

<220> 

<221> polyAsignal 
<222> 805. .810 

<400> 45 

gagtgtgggg ccagccgtgg aggctccagg tgttctctet gccccagcag agcccggcag 60 
gagccccaac aggaagccag cgcggc atg get gee acc gac ttc gtg cag gag 113 
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Met Ala Ala Thr Asp Phe Val Gin Glu 





cgc 


gcc 


gtg 


ggc 


gag 


agg 


ctg 


ctg 


etc 


aag 


ctg 


cag 


aga 






161 


Met 


Arg 


Ala 


Val 


Gly 


Glu 


Arg 


Leu 


Leu 


Leu 






Gin 






Pro 




-20 










-15 










-10 










-5 




cag 


get 


gag 


CGC 


gtg 


gag 


ate 


gtg 


gcc 


tte 


tea 


gte 


ate 


ate 


ctt 


tte 


209 


Gin 


Ala 


Glu 


Pro 


Val 
1 


Glu 


He 


Val 


Ala 


Phe 


Ser 


Val 


He 


He 
10 


Leu 


Phe 




aca 


get 


aet 


gtt 


ctg 


ctg 


ttg 


ctg 


5 

ctg 


ata 


gcc 


tgc 


age 


tgc 


tge 


tge 


257 


Thr 


Ala 


Thr 


Val 


Leu 










He 


Ala 


Cys 


Ser 


Cys 


Cys 


Cys 





aet eae tgc tge tge cet gag egg aga gge agg aag gte eag gtg eag 305 
Thr His Cys Cys Cys Pro Glu Arg Arg Gly Arg Lys Val Gin Val Gin 

30 35 40 

ceg aea eea cea tgaeggaegg gcgatggetg aggagaaget ggagaggaga 357 
Pro Thr Pro Pro 
45 

tggccaatgc catgacacag gccatcagcc tggccctgca gcccttaccc ctcaagacca 417 

ggctcccctg gccccagctc tggcccagcc caggtacctg gacactgaca acttgagccc 477 

taccaaggaa acaagggctg gtataggtgc aaacctctca tctgccagtg gacactgggt 537 

gctggggagt cagctgtttc aaagactggg tcaactgcct gggcttcttc gcctacctgc 597 

actttttaac aaaaeaagga agtaggggte eecatacctt gatggagaac agtecccacc 657 

tgtgggeaat tggeeettgg ggetetgetg ataeatgeea aagaggagea aggeaateag 717 

aggggctttg tgeaataget tetgeateeg agetceegee agagegtgag eatgteagta 777 

ttctagtcca gtatttgcca gtttccaagt aaaagctttt gtgaaaaaaa aaaaaaaaa 836 



<210> 46 

<211> 482 

<212> DNA 

<213> Homo sapiens 



<220> 
<221> CDS 
<222> 17.. 337 



<220> 

<221> sig_peptide 

<222> 17. .97 

<223> Von Heijne matrix 

score 4.95181310301952 
seq VLSTLSLVQFSPS/GR 



<220> 

<221> polyA_site 
<222> 467. .482 



<220> 

<221> polyA_signal 
<222> 442. .447 



<400> 46 

actttagctg ctgcag atg ctg ttt gaa gaa get ctg ccc etc age tgc tct 
Met Leu Phe Glu Glu Ala Leu Pro Leu Ser Cys Ser 
-25 -20 
gat cct gtg ctt age act ctt age ctg gtg cag tte age ccc agt gga 
Asp Pro Val Leu Ser Thr Leu Ser Leu Val Gin Phe Ser Pro Ser Gly 
-15 -10 -5 1 

agg acc cag gac ctg etc tct cca ggg gtg gag aac ctg teg gtg ctg 
Arg Thr Gin Asp Leu Leu Ser Pro Gly Val Glu Asn Leu Ser Val Leu 

5 10 15 

gac gtg tec cct ctg ggc ttg gcc tgc tgt ctg etc act etc acc atg 
Asp Val Ser Pro Leu Gly Leu Ala Cys Cys Leu Leu Thr Leu Thr Met 
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tec tgc cca ggg cct gac cct cct gag ggg ccc ggg acc cag cgt gtg 24 4 

Ser Cys Pro Gly Pro Asp Pro Pro Glu Gly Pro Gly Thr Gin Arg Val 

35 40 45 

tgg caa ggg get eta egg ate eta eag etc cca gga gee cca gat ggg 292 
Trp Gin Gly Ala Leu Arg lie Leu Gin Leu Pro Gly Ala Pro Asp Gly 
50 55 60 65 

gtt tea cca tat cag cca gtc tgg tct cga act cct gac etc aag 337 
Val Ser Pro Tyr Gin Pro Val Trp Ser Arg Thr Pro Asp Leu Lys 

70 75 80 

tgattcgcca gcctcgtcct cccaaagtgc tgggattaca gatgtgagcc accatgccca 397 
tggagaatgt attacttttg taatttaaaa aataatagta ataaaataaa gatctttaca 457 
atgtttcaga aaaaaaaaaa aaaaa 482 



<210> 47 

<211> 806 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 136. .471 



<220> 

<221> sig_peptide 

<222> 136. .252 

<223> Von Heijne matrix 

score 4.89253405263876 
seq IILFSAIVGFIYG/YV 



<220> 

<221> polyA_site 
<222> 792. .806 



<220> 

<221> polyA_signal 
<222> 775. .780 



<400> 47 

ttcgctctcc cgggcttaga aggcccggct actgacgcgc agtgccagac cttacccctc 60 
acggtcctta agtctcggtc gccctcgcct cgcagcctgc cacccgcgct cagctgcccg 120 
cctcctcagc cagcc atg ctg gag cat ctg age teg ctg ccc acg cag atg 171 

Met Leu Glu His Leu Ser Ser Leu Pro Thr Gin Met 
-35 -30 



gat 






ggc 


eag 


aag 


eta 


get 


gaa 


cag 


atg 


ttt 


cag 


gga 


att 


att 


219 


Asp 


Tyr 


Lys 


Gly 


Gin 


Lys 


Leu 


Ala 


Glu 


Gin 


Met 


Phe 


Gin 


Gly 


He 


He 








-25 










-20 










-15 










ctt 


ttt 


tct 


gca 


ata 


gtt 


gga 


ttt 


ate 


tae 


ggg 


tac 


gtg 


get 


gaa 


eag 


267 


Leu 


Phe 


Ser 


Ala 


He 


Val 


Gly 


Phe 


He 


Tyr 


Gly 


Tyr 


val 


Ala 


Glu 


Gin 






-10 










-5 










1 








5 




ttc 


ggg 


tgg 


act 


gtc 


tat 


ata 


gtt 


atg 


gcc 


gga 


ttt 


get 


ttt 


tea 


tgt 


315 


Phe 


Gly 


Trp 


Thr 


Val 


Tyr 


He 


Val 


Met 


Ala 


Gly 


Phe 


Ala 


Phe 


Ser 


Cys 












10 










15 










20 






ttg 


ctg 


aca 


ctt 


cct 


cca 


tgg 


ccc 


ate 


tat 


cgc 


egg 


eat 


cct 


etc 


aag 


363 


Leu 


Leu 


Thr 


Leu 


Pro 


Pro 


Trp 


Pro 


He 


Tyr 


Arg 


Arg 


His 


Pro 


Leu 


Lys 










25 










30 










35 








tgg 


tta 


cct 


gtt 


caa 


gca 


cag 


acg 


aca 


aga 


aae 


eag 


ggg 


aaa 


gaa 


aaa 


411 


Trp 


Leu 


Pro 


Val 


Gin 


Ala 


Gin 


Thr 


Thr 




Asn 


Gin 


Gly 


Lys 


Glu 


Lys 








40 










45 










50 










tta 


aga 


ggc 


atg 


eta 


aaa 


ata 


att 


gag 


gtt 


ttc 


atg 


att 


cag 


cae 


etg 


459 


Leu 




Gly 


Met 


Leu 




He 


He 


Glu 


Val 


Phe 


Met 


He 


Gin 


His 


Leu 





55 60 65 

ett ttg ttt etg tgagatgage taaattgett tcataeeeca gataagagct 511 
Leu Leu Phe Leu 
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70 

aaaaccacct aatgctctta tggcacagct gtgtatagat ttagttctct ttatacttca 571 

tttctagccc agttgggttt tgatttatat aagtagttta gaccttctct tcataatctt 631 

gctctgagat ggggaacaga acacacaagt atgaagtttc tttcaggtgt aaataatgaa 691 

aaataaatgc ctcataaatg atagtacaat gtaactatca aagttttata attcattatg 751 

agttaaccat tttaatgttt ccaattaaac ctcatagtgc aaaaaaaaaa aaaaa 806 

<210> 48 

<211> 582 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 148.. 360 

<220> 

<221> sig_peptide 

<222> 148 . .216 

<223> Von Heijne matrix 

score 5.918755252201 

SALMLLPCRPVLT/SV 

<220> 

<221> polyA_site 
<222> 566. .582 

<220> 

<221> polyAsignal 
<222> 544 . .549 

<400> 48 

aacgtcatct aggagcaccg agcagcttgg ctaaaagtaa gggtgtcgtg ctgatggccc 60 
tgtgcgcact gacccgckct sygctctctg aacctggcgc ccccgaccgt cgccgcccct 120 
gccccgagtc tgttccccgc cgcccag atg atg aac aat ggc etc etc caa cag 174 

Met Met Asn Asn Gly Leu Leu Gin Gin 
-20 -15 



ccc 


tct 


gcc 


ttg 


atg 


ttg 


etc 


ccc 


tgc 


cge 


cca 


gtt 


ctt 


act 


tct 


gtg 


222 


Pro 


Ser 


Ala 


Leu 


Met 
-10 


Leu 


Leu 


Pro 


Cys 


Arg 
-5 


Pro 


Val 


Leu 


Thr 


Ser 
1 


Val 




gcc 


ctt 


aat 


gcc 


aac 


ttt 


gtg 


tec 


tgg 


aag 


agt 


cgt 


ace 


aag 


tae 


ace 


270 


Ala 




Asn 
5 


Ala 


Asn 


Phe 


Val 


Ser 
10 


Trp 


Lys 


Ser 


Arg 


Thr 

15 


Lys 


Tyr 


Thr 




att 


aca 


cca 


gtg 


aag 


atg 


agg 


aag 


tct 


ggg 


ggc 


cga 


gac 


cac 


aca 


ggt 


318 


He 


Thr 


Pro 


Val 


Lys 


Met 


Arg 


Lys 


Ser 


Gly Gly 


Arg Asp 


His 


Thr 


Gly 






20 










25 










30 












got 


gga 


aac 


gtg 


cgt 


age 


aac 


agt 


agg 


ccg 


agt 


ate 


caa 


cgt 






360 


Ala 


Gly Asn 


Val 


Arg 


Ser 


Asn 


Ser Arg 


Pro 


Ser 


He 


Gin 


Arg 








35 










40 










45 















tgatcataac aaacgggtca ttggcaaggc aggtcgcaac cgctggctgg gcaagaggcc 420 

taacagtggg cggtggcacc gcaagggggg ctgggctggc cgaaagattc ggccactacc 480 

ccccatgaag agttacgtga agctgectte tgcttetgee caaagetgat atecctgtae 54 0 

tctaataaaa tgcccccccc ccctcaaaaa aaaaaaaaaa aa 582 



<210> 49 

<211> 583 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 149. .361 



[fije>'/^:/yv'GG208389e.s^] 



WO 02/083898 

<220> 

<221> sig_peptide 
<222> 149. .217 
<223> Von Heijne matrix 

score 5.918755252201 

SALMLLPCRPVLT/SV 

<220> 

<221> polyA_site 
<222> 567. .583 

<220> 

<221> polyAsignal 
<222> 545. .550 

<400> 49 

aacgtcatct aggagcaccg agcagcttgg ctaaaagtaa gggtgtcgtg ctgatggccc 60 
tgtgcgcact gacccgcgct ctgcstctct gaacctggcg cccccgaccg tcgccgcccc 120 
tgccccgagt ctgttccccg ccgcccag atg atg aac aat ggc etc etc caa 172 
Met Met Asn Asn Gly Leu Leu Gin 
-20 



cag 


ccc 


tct 


gcc 


ttg 


atg 


ttg 


etc 


ccc tgc 


egc 


cca 


gtt 


ett 


act 


tct 


220 


Gin 


Pro 


Ser 


Ala 


Leu 


Met 


Leu 


Leu 


Pro Cys 


Arg 


Pro 


Val 


Leu 


Thr 


Ser 




-15 










-10 








-5 










1 




gtg 


gcc 


ett 


aat 


gcc 


aac 


ttt 


gtg 


tec tgg 


aag 


agt 


cgt 


ace 


aag 


tac 


268 


Val 


Ala 


Leu 


Asn 
5 


Ala 


Asn 


Phe 


Val 


Ser Trp 
10 


Lys 


Ser 


Arg 


Thr 
15 


Lys 


Tyr 




ace 


att 


aca 


oca 


gtg 


aag 


atg 


agg 


aag tct 


ggg 


ggc 


cga 


gac 


cac 


aca 


316 


Thr 


He 


Thr 
20 


Pro 


Val 


Lys 


Met 


Arg 

25 


Lys Ser 


Gly 


Gly 


Arg 
30 


Asp 


His 


Thr 




ggt 


get 


gga 


aac 


gtg 


cgt 


age 


aac 


agt agg 


ccg 


agt 


ate 


caa 


cgt 




361 


Gly 


Ala 


Gly Asn 


Val 




Ser 


Asn 




Pro 


Ser 


He 


Gin Arg 








35 










40 








45 













tgateataac aaaegggtea ttggcaaggc aggtcgeaac egctggctgg geaagaggee 421 

taacagtggg cggtggcace gcaagggggg ctgggctggc egaaagattc ggccaetacc 481 

ecccatgaag agttacgtga agctgectte tgcttctgee caaagctgat atecctgtae 541 

tctaataaaa tgcccccccc ccctcaaaaa aaaaaaaaaa aa 583 



<210> 50 

<211> 746 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 83. .358 

<220> 

<221> sig_peptide 

<222> 83. .127 

<223> Von Heijne matrix 

score 4.68058603039206 

VLAGSLLGPTSRS/AA 

<400> 50 

ctttttttcc tttgttccag ggctggagcg gctctgggct ccggaatcgc ccgcagccgg 60 
tactgcggga cccactgcgg at atg get gte ttg get gga tee etg ttg ggc 112 

Met Ala Val Leu Ala Gly Ser Leu Leu Gly 

-15 -10 
ccc acg agt agg teg gca gcg ttg etg ggt ggc agg tgg etc cag ccc 160 
Pro Thr Ser Arg Ser Ala Ala Leu Leu Gly Gly Arg Trp Leu Gin Pro 
-5 15 10 

egg gcc tgg etg ggg ttc cca gac gcc tgg ggc etc ccc acc ccg cag 208 



PCr/lBOl/00914 
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Arg Ala Trp Leu Gly Phe Pro Asp Ala Trp Gly Leu Pro Thr Pro Gin 

15 20 25 

cag gcc egg ggc aag get cgc ggg aat gag tat cag ccg age aac ate 256 
Gin Ala Arg Gly Lys Ala Arg Gly Asn Glu Tyr Gin Pro Ser Asn lie 

30 35 40 

aaa ege aag aae aag eac ggc tgg gtc egg cge ctg age acg cca gcc 304 
Lys Arg Lys Asn Lys His Gly Trp Val Arg Arg Leu Ser Thr Pro Ala 

45 50 55 

ggc gts cag gtc ate ctt cgc cga atg etc aag ggc cgc aag teg ctg 352 
Gly Val Gin Val lie Leu Arg Arg Met Leu Lys Gly Arg Lys Ser Leu 
60 65 70 75 

age eat tgaggatege gaegeagteg gegggaeeet eatggaagea tegeeetege 4 08 

Ser His 

ctcggacctt gcctggcgct atttttgcag ggagctgggg agcaggaacg ccteggaect 4 68 
gagtgctctc catattgtgg ggttgaagtc tggatgggag cttgccaagt ccctttttag 528 
gctttttaat taggaagcat ttcgaacctg cgcaacagac caaagaacag tacaaagaac 588 
atccgtgtac ccagtaccct gactaccgac tacctacaac ccgtccctgc cccatcctga 648 
gttettttga agctgatetc aggcatcgga ttatttctte tgtaaatatt tcagaatgta 708 
tetetecaag atgagagctc attaaaagac aattacaa 746 



<210> 51 

<211> 731 

<212> DNA 

<213> Homo sapiens 



<220> 
<221> CDS 
<222> 73. .348 



<220> 

<221> sig_peptide 

<222> 73. .117 

<223> Von Heijne matrix 

score 4.68058603039206 
seq VLAGSLLGPTSRS/AA 



<400> 51 

gttgttccag ggctggagcg gctctgggct ccggaatcgc ccgcagccgg tactgcggga 60 
cccactgcag at atg get gtc ttg get gga tec ctg ttg ggc ecc acg agt ill 
Met Ala Val Leu Ala Gly Ser Leu Leu Gly Pro Thr Ser 
-15 -10 -5 



agg 


teg 


gea 


geg 


ttg 


etg 


ggt 


ggc 


agg 


tgg 


etc 


car 


ccc 


egg 


gcc 


tgg 


159 


Arg 


Ser 


Ala 


Ala 


Leu 


Leu 


Gly 


Gly 


Arg 


Trp 




Gin 


Pro 


Arg 


Ala 


Trp 








1 








5 










10 












ctg 


ggg 


ttc 


cca 


gac 


gcc 


tgg 


ggc 


etc 


ccc 


acc 


ccg 


cag 


cag 


gcc 


egg 


207 


Leu 


Gly 


Phe 


Pro 


Asp 


Ala 


Trp 


Gly 


Leu 


Pro 


Thr 


Pro 


Gin 


Gin 


Ala 


Arg 




15 










20 










25 










30 




ggc 


aag 


get 


cgc 


ggg 


aat 


gag 


tat 


cag 


ccg 


age 


aat 


ate 


aaa 


cgc 


aag 


255 


Gly 


Lys 


Ala 


Arg 


Gly 


Asn 


Glu 


Tyr 


Gin 


Pro 


Ser 


Asn 


He 


Lys 


Arg 


Lys 












35 










40 










45 






aae 


aag 




ggc 


tgg 


gtc 


egg 




ctg 


arc 


acg 






ggc 


gtg 




303 


Asn 


Lys 


His 


Gly 


Trp 


Val 


Arg 


Arg 


Leu 


Xaa 


Thr 


Pro 


Ala 


Gly 


Val 


Gin 










50 










55 










60 








gtc 


ate 


ctt 




cga 


atg 


etc 


aag 


ggc 




aag 






age 


cat 




348 


Val 


He 


Leu 


Arg 


Arg 


Met 


Leu 


Lys 


Gly 




Lys 


Ser 




Ser 


His 







65 70 75 



tgaggatege 


gaegeagteg 


gegggaeeet 


eatggaagea 


tegeeetege 


ctcggacctt 


408 


gectggeget 


atttttgcag 


ggagctgggg 


agcaggaacg 


ccteggaect 


gagtgctctc 


4 68 


catattgtgg 


ggttgaagtc 


tggatgggag 


cttgccaagt 


ccctttttag 


gctttttaat 


528 


taggaagcat 


ttcgaacctg 


cgcaacagac 


caaagaacag 


tacaaagaac 


atccgtgtac 


588 


ccagtaccct 


gactaccgac 


tacctacaac 


ccgtccctgc 


cccatcctga 


gttettttga 


648 


agctgatetc 


aggcatcgga 


ttatttctte 


tgtaaatatt 


tcagaatgta 


tetetecaag 


708 


atgagagete 


attaaaagat 


aat 








731 
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<210> 52 

<211> 773 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 22. .297 

<220> 

<221> sig_peptide 

<222> 22. . 66 

<223> Von Heijne matrix 

score 4.68058603039206 
seq VLAGSLLGPTSRS/AA 

<220> 

<221> polyA_site 
<222> 758.. 773 

<220> 

<221> polyA_signal 
<222> 740.. 745 



actgcgggac ccactgcgga t 


atg 
Met 
-15 


get 
Ala 


gte 
Val 


ttg 
Leu 


get 
Ala 


gga 
Gly 
-10 


tee 
Ser 


ctg 
Leu 


ttg 
Leu 


ggc 
Gly 


51 


ccc 
Pro 
-5 


acg 
Thr 


agt 
Ser 


agg 
Arg 


tog 
Ser 


gca 
Ala 
1 


geg 
Ala 


ttg 
Leu 


ctg ggt 
Leu Gly 
5 


ggc 
Gly 


agg 
Arg 


tgg 
Trp 


etc 
Leu 


cag 
Gin 
10 


ccc 
Pro 


99 


egg 
Arg 


gcc 
Ala 


tgg 
Trp 


ctg 
Leu 
15 


ggg 

Gly 


ttc 
Phe 


cca 
Pro 


gac 
Asp 


gcc 
Ala 
20 


tgg 
Trp 


ggc etc 
Gly Leu 


ccc 
Pro 


acc 
Thr 
25 


ccg 
Pro 


cag 
Gin 


147 


cag 
Gin 


Ala 


egg 
Arg 
30 


ggc 
Gly 


Lys 


get 
Ala 


Arg 


ggg 

Gly 
35 


aat 
Asn 


gar 
Glu 


tat 
Tyr 


cag 
Gin 


ccg 
Pro 
40 


age 
Ser 


aat 
Asn 


ate 
He 


195 


aaa 
Lys 


cgc 
Arg 
45 


aag 
Lys 


aac 


aag 
Lys 


cac 
His 


ggc 
Gly 
50 


tgg 
Trp 


gtc 
Val 


egg 


cgc 


ctg 
Leu 
55 


arc 
Xaa 


acg 
Thr 


ccg 
Pro 


gcc 
Ala 


243 


ggc 
Gly 
60 


gtg 

Val 


mrg 

Xaa 


gtc 

Val 


ate 

He 


ett 

Leu 
65 


egc 
Arg 


ega 
Arg 


atg 

Met 


etc 


aag 

Lys 
70 


ggc 

Gly 


egc 


aag 

Lys 


teg 

Ser 


ctg 

Leu 
75 


291 



age cat tgaggatcgc gacgcagtcg gegggacect catggaagea tcgecetegc 347 
Ser His 

ctcggacctt gcctggcgct atttttgcag ggagctgggg agcaggaaeg cctcggacet 4 07 

gagtgctctc catattgtgg gtttgaagtc tggatgggag ettgceaagt cectttttag 4 67 

gctttttaat taggaagcat ttcgaacctg cgcaacagac caaagaacag tacaaagaae 527 

atccgtgtac ccagtaccct gactaccgac tacctacaac ccgtccctgc cccatcctga 587 

gttcttttga agctgatctc aggcatcgga ttatttcttc tgtaaatatt tcagaatgta 647 

tctctccaag atgagagctc attaaaagat aattacaaag cttatcacat ecaaaagaat 707 

tatcaataat tttgaaatat tattaaacgt gtaataaatg ttcaaagttc aaaaaaaaaa 767 

aaaaaa 773 

<210> 53 

<211> 740 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 80. .355 
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<220> 

<221> sig_peptide 

<222> 80. .124 

<223> Von Heijne matrix 

score 6.4806443024348 

seq VLAGSLLXPTSRS/AA 

<400> 53 

ataatccttt gttccagggc tggagcggct ctgggctccg gaatcgcccg cagccggtac 
tgcgggaccc actgcggat atg get gtc ttg get gga tec ctg ttg grc cec 
Met Ala Val Leu Ala Gly Ser Leu Leu Xaa Pro 
-15 -10 -5 



acg 


agt 


agg 


teg 


gca 


gcg 


ttg 


ctg 


ggt 




agg 


tgg 


etc 


cag 


cce 


egg 


160 


Thr 


Ser 


Arg 


Ser 


Ala 


Ala 


Leu 


Leu 


Gly 


Xaa 


Arg 


Trp 


Leu 


Gin 


Pro 


Arg 












1 








5 










10 








gcc 


tgg 


ctg 


ggg 


ttc 


cca 


gac 


gcc 


tgg 


ggc 


etc 


cce 


ace 


ccg 


cag 


cag 


208 


Ala 


Trp 


Leu 


Gly 


Phe 


Pro 


Asp 


Ala 


Trp 


Gly 


Leu 


Pro 


Thr 


Pro 


Gin 


Gin 








15 










20 










25 










gcc 


egg 


ggc 


aag 


get 


cgc 


ggg 


aat 


gag 


tat 


cag 


ccg 


age 


aac 


ate 


aaa 


256 


Ala 




Gly 




Ala 


Arg 


Gly 


Asn 


Glu 


Tyr 


Gin 


Pro 


Ser 


Asn 


He 


Lys 






30 










35 










40 












cgc 


aag 




aag 




ggc 


tgg 


gtc 


egg 


cgc 


ctg 


are 


acg 


ccg 


gee 


ggc 


304 


Arg 


Lys 


Asn 


Lys 


His 


Gly 


Trp 


Val 


Arg 


Arg 


Leu 


Xaa 


Thr 


Pro 


Ala 


Gly 




45 










50 










55 










60 




kwr 


smg 


ktc 


ate 


ctt 


cgc 


cga 


atg 


etc 


aag 


ggc 


cgc 


aag 


teg 


ctg 


age 


352 


Xaa 


Xaa 


Xaa 


lie 


Leu 


Arg Arg 


Met 


Leu 


Lys 


Gly 


Arg 


Lys 


Ser 


Leu 


Ser 





cat tgaggatcgc gacgcagtsg scggggaacc ctcatggaag catcgccctc 405 

His 

gcctcggaec ttgcetggeg ctatttttgc agggagctgg ggagcaggaa cgceteggae 4 65 

ctgagtgctc tecatattgt ggggttgaag tctggatggg agettgecaa gtcecttttt 525 

aggettttta attaggaagc atttcgaacc tgcgcaacag accaaagaac agtacaaaga 585 

aeateegtgt acceagtacc ctgaetaccg aetacetaca acecgtccct gccccatect 645 

gagttctttt gaagetgatc teaggcatcg gattatttct tctgtaaata tttcagaatg 705 

tatctctcca agatgagagc tcattaaaag acaat 740 

<210> 54 

<211> 638 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 137. .397 

<220> 

<221> sig_peptide 

<222> 137. .253 

<223> Von Heijne matrix 

score 4.61343345943784 

seq LQGARVLLTGANA/GV 

<220> 

<221> polyA_site 
<222> 623. .638 

<220> 

<221> polyA_signal 
<222> 616. .621 

<400> 54 

ggggtgcggt cggggacccg gcaggaggcg gccgagaaga gaggaccgtg ggggcgttcg 60 
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cgtggctccc agcccgggac cccacccccg ctggacagtg ggggaaactg aggcctgagc 120 
gggcccacac aggacc atg aag gtg ctt etc etc aca ggg ctg ggg gcc ctg 172 
Met Lys Val Leu Leu Leu Thr Gly Leu Gly Ala Leu 
-35 -30 
gcc tat tat tgg gat gac aac ttc gac cca gcc age etc cag 220 
Ala Tyr Tyr Trp Asp Asp Asn Phe Asp Pro Ala Ser Leu Gin 
-25 -20 -15 

ega gtg ctg ctg aca ggg gee aac get ggt gtt ggt gag gag 268 
Arg Val Leu Leu Thr Gly Ala Asn Ala Gly Val Gly Glu Glu 
-5 15 

tat cac tac gcg cgt ctg ggc tec eac ctg gtg etc act gee 316 
Tyr His Tyr Ala Arg Leu Gly Ser His Leu Val Leu Thr Ala 

10 15 20 

gag get etc ctg cag aag gca egg tgg etc aeg ctt gta gte 364 
Glu Ala Leu Leu Gin Lys Ala Arg Trp Leu Thr Leu Val Val 

25 30 35 

ttg gga ggc cga gga aag tgg ate ace tgaggteagg agttcaagac 417 
Leu Gly Gly Arg Gly Lys Trp lie Thr 
40 45 
cagcctggcc aacatggcga aacctggtct etaetaagaa gacaaaaatt aggecaggca 477 
tggtggctca tgcctgtaat cctagcactt tgggaggcca aggcggatgg ateacttgag 537 
gtcaggagtt caaaaccagc ctggccaaca tggtgaaacc ctgtctctac taaaaataca 597 
aaaaataaaa ataaaataaa taaacaaaaa aaaaaaaaaa a 638 

<210> 55 

<211> 577 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 18. .392 

<220> 

<221> sig_peptide 

<222> 18. . 62 

<223> Von Heijne matrix 

score 4.60228634566274 

seq LLTHNLLSSHVRG/VG 

<220> 

<221> polyA_site 
<222> 562. .577 

<220> 

<221> polyA_signal 
<222> 545. .550 

<400> 55 

tttttttgtg cggcgac atg aaa ctg ctt acc cac aat ctg ctg age teg 50 
Met Lys Leu Leu Thr His Asn Leu Leu Ser Ser 













-15 






-10 










-5 




cat 


gtg 


egg 


ggg 


gtg 


ggg 


tec cgt 


ggc ttc 


cec 


ctg 


cgc 


etc 


cag 


gcc 


98 


His 


Val Arg Gly 


Val 


Gly Ser Arg Gly Phe 


Pro 


Leu Arg 


Leu 


Gin 


Ala 












1 






5 








10 








ace 


gag 


gte 


cgt 


ate 


tgc 


cct gtg 


gaa ttc 


aac 




aac 


ttc 


gtg 


gcg 


146 


Thr 


Glu 


Val 


Arg 


He 


Cys 


Pro Val 


Glu Phe 


Asn 


Pro 


Asn 


Phe 


Val 


Ala 








15 








20 








25 










cgt 


atr 


ata 


cct 


aaa 


gtg 


gag tgg 


teg gcg 


ttc 


ctg 


gag 


gcg 


gee 


gat 


194 


Arg 


Xaa 


He 


Pro 




Val 


Glu Trp 


Ser Ala 


Phe 




Glu 


Ala 


Ala 


Asp 






30 










35 






40 












aac 


ttg 


cgt 


ctg 


ate 


cag 


gtg eeg 


aaa ggg 


eeg 


gtt 


gag 


gga 


tat 


gag 


242 


Asn 




Arg 


Leu 


He 


Gin 


Val Pro 


Lys Gly 


Pro 


Val 


Glu 


Gly Tyr 


Glu 
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45 50 55 60 

gag aat gag gag ttt ctg agg acc atg cac cac ctg ctg ctg gag gtg 290 

Glu Asn Glu Glu Phe Leu Arg Thr Met His His Leu Leu Leu Glu Val 

65 70 75 

gaa gtg ata gag ggc acc ctg cag tgc ccg gaa tct gga cgt atg ttc 338 
Glu Val He Glu Gly Thr Leu Gin Cys Pro Glu Ser Gly Arg Met Phe 

80 85 90 

ccc ate age cgc ggg ate ccm amc atg ctg ctg agt gaa gag gaa act 386 
Pro He Ser Arg Gly He Pro Xaa Met Leu Leu Ser Glu Glu Glu Thr 

95 100 105 

gag agt tgattgtgsc aggcgccagt ttttcttgtt atgactgttt atttttgttg 442 
Glu Ser 
110 

atctataccG tgtttccgaa ttctgccgtg tgtatcccca acccttgacc caatgacacc 502 
aaacacagtg tttttgagct cggtattata tatttttttc tcattaaagg tttaaaacca 562 
aaaaaaaaaa aaaaa 577 

<210> 56 

<211> 521 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 82. .360 

<220> 

<221> sig^peptide 

<222> 82 . . 147 

<223> Von Heijne matrix 

score 4.53239781724629 

seq FFLLITIALGTKT/ES 

<220> 

<221> polyA_site 
<222> 506. .521 

<220> 

<221> polyA_signal 
<222> 486. .491 

<400> 56 



attttctcaa caattcctca ccgcaggagc ctctgaagct 


CCCi 


iccaggc <: 


;agctctcct 


60 


cccacaacag cttcccacag c 


atg 


aag 


ate 


tec 


gtg 


get 


gcc 


att 


ccc 


ttc 


111 














Met 


Lys 


He 
-20 


Ser 


Val 


Ala 


Ala 


He 
-15 


Pro 


Phe 




ttc 


etc 


etc 


ate 


ace 


ate 


gee 


eta 


ggg 


acc 


aag 


act 


gaa 


tee 


tee 


tea 


159 


Phe 


Leu 


Leu 
-10 


He 


Thr 


He 


Ala 


-5 


Gly 


Thr 


Lys 


Thr 


Glu 
1 


Ser 


Ser 


Ser 




egg 


gga 


cet 


tac 


cac 


ccc 


tea 


gag 


tgc 


tgc 


ttc 


acc 


tac 


act 


acc 


tac 


207 


Arg 


Gly 


Pro 


Tyr 


His 


Pro 


Ser 


Glu 


Cys 


Cys 


Phe 


Thr 


Tyr 


Thr 


Thr 


Tyr 




5 










10 










15 










20 




aag 


ate 


ccg 


cgt 


cag 


egg 


att 


atg 


gat 


tac 


tat 


gag 


acc 


aac 


age 


cag 


255 


Lys 


He 


Pro 


Arg 


Gin 


Arg 


He 


Met 


Asp 


Tyr 


Tyr Glu 


Thr 


Asn 


Ser 


Gin 












25 










30 










35 






tgc 


tec 


aag 


ccc 


gga 


att 


gtc 


ttc 


ate 


acc 


aaa 


agg 


ggc 


eat 


tee 


gty 


303 


Cys 


Ser 


Lys 


Pro 
40 


Gly 


He 


Val 


Phe 


He 
45 


Thr 


Lys 


Arg 


Gly 


His 
50 


Ser 


Val 




tgt 


acc 


aac 




agt 


gac 


aag 


tgg 


gtc 


cag 


gac 


tat 


ate 


aag 


gac 


atg 


351 


Cys 


Thr 


Asn 
55 


Pro 


Ser 


Asp 


Lys 


Trp 
60 


Val 


Gin 


Asp 


Tyr 


He 
65 




Asp 


Met 






gag 


aac 


tgagtgaccc agaaggggtg gcgaaggcac agctcagaga 






400 




Glu 
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70 

cataaagaga agatgccaag gccccctcct ccacccaccg ctaactctca gccccagtca 4 60 

ccctcttgga gcttccctgc tttgaattaa agaccactca tgcccaaaaa aaaaaaaaaa 520 
a 521 

<210> 57 
<211> 588 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 93. .308 

<220> 

<221> sig_peptide 

<222> 93. .218 

<223> Von Heijne matrix 

score 4.46006149913644 

seq LRLEVAAAPGTPA/QP 

<220> 

<221> polyA_site 
<222> 573. .588 

<220> 

<221> polyA_signal 
<222> 558. .563 

<400> 57 

aaaagaaatt cccgggcccg gccagcctgt gccgctgtta tgaggagaag caaagcccct 60 
tgcagaagca cttgcgggtc tgcagcctac gc atg aat agg ttg gca ggt gtg 113 
Met Asn Arg Leu Ala Gly Val 
-40 



ggc 


tgg 


egg 


gtg 


gac 


tac 


acc 


ctg 


age 


tec 


age 


ctg 


ctg 


caa 


tee 


gtg 


161 


Gly 


Trp 


Arg 


Val 


Asp 


Tyr 


Thr 


Leu 


Ser 


Ser 


Ser 


Leu 


Leu 


Gin 


Ser 


Val 




-35 










-30 










-25 










-20 




gaa 


gag 


ccc 


atg 


gtg 


cac 


Ctg 


egg 


ctg 


gag 


gtg gca 


get 


gee 


cca 


ggg 


209 


Glu 


Glu 


Pro 


Met 


Val 


His 


Leu 


Arg 


Leu 


Glu 


Val 


Ala 


Ala 


Ala 


Pro Gly 












-15 










-10 










-5 






acc 


cca 


gcc 


cag 


cct 


gtt 


gcc 


atg 


tec 


etc 


tea 


gca 


gac 


aag 


ttc 


cag 


257 


Thr 


Pro 


Ala 


Gin 
1 


Pro 


Val 


Ala 


Met 

5 


Ser 


Leu 


Ser 


Ala 


Asp 

10 


Lys 


Phe 


Gin 




gtc 




ctg 


gca 


gaa 


ctg 


aag 


cag 


gcc 


cag 


acc 


ctg 


atg 


age 


tec 


ctg 


305 


Val 


Leu 


Leu 


Ala 


Glu 


Leu 


Lys 


Gin 


Ala 


Gin 


Thr 




Met 


Ser 


Ser 


Leu 





15 20 25 



ggc tgaggagaag ggtgttccag gcctgtgtgg agccgccctg cccgtatgga 358 

Gly 

30 

gtcacgccct ctgaactgct cttcgggagg cagccctggt tctaggatgc tgaggccctg 418 

gcecggactc tggcetccca gatccccagc tgcctcactt ctctcttgag aacttggctc 478 

agggctcctg aggacctttc ccagcattac cttcccttcc cttgaaaggc aattgttggc 538 

tgttttcata agcaggaaaa ataaacagaa gtacaaaaaa aaaaaaaaaa 588 

<210> 58 

<211> 521 

<212> DNA 

<213> Homo sapiens 



<220> 
<221> CDS 
<222> 113. .364 
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<220> 

<221> sig_peptide 
<222> 113. .172 
<223> Von Heijne matrix 

score 4.37180298395146 

SLLLSLPPHQGLT/FS 

<220> 

<221> polyA_site 
<222> 500. .521 

<400> 58 

ttttttacat ggtgttccca cagctgggag gacacccaca tggtcggcgt gcaggatatt 60 
tcgctggacc ctagaaaagc caccacgacc tgtgggccat gatgctaccc ca atg get 118 

Met Ala 
-20 



get 


get 


get 


gtt 


cet 


tet 


ctt 


ctt 


ctt 


tct 


ctt 


cct 


cct 


cac 


cag 


ggg 


166 


Ala 


Ala 


Ala 


Val 
-15 


Pro 


Ser 




Leu 


Leu 
-10 


Ser 


Leu 


Pro 


Pro 


His 
-5 


Gin 


Gly 




etc 


act 


ttc 


tec 


aac 


aaa 


ata 




cct 


ttt 


gga 


get 




gga 


gtc 


ttg 


214 


Leu 


Thr 


Phe 
1 


Ser 






lie 
5 


Gin 


Pro 


Phe 


Gly 


Ala 
10 


Gin 


Gly 


Val 


Leu 




cat 


ccg 


gaa 


cca 


gga 


ctg 


cga 


gac 


tgg 


ctg 


ctg 


cca 


acg 


tgc 


tec 


aga 


262 


His 


Pro 


Glu 


Pro 


Gly 


Leu 


Arg 


Asp 


Trp 


Leu 


Leu 


Pro 


Thr 


Cys 


Ser 


Arg 




15 










20 










25 










30 




caa 


ttg 


cga 


gtc 


gca 


ctg 


ccg 


gag 


aag 


ggg 


tec 


gag 


ggc 


agt 


ctg 


tgt 


310 


Gin 


Leu 


Arg 


Val 


Ala 
35 


Leu 


Pro 


Glu 


Lys 


Gly 
40 


Ser 


Glu 


Gly 


Ser 


Leu 
45 


Cys 




caa 


acg 


cag 


ctg 


cca 


get 


act 


cca 


tgc 


ttc 


ctg 


cct 


teg 


aat 


acg 


gtg 


358 


Gin 


Thr 


Gin 


Leu 


Pro 


Ala 


Thr 


Pro 


Cys 


Phe 


Leu 


Pro 


Ser 


Asn 


Thr 


Val 





50 55 60 

aga acg tgaagtcatg agctgctgct aaggcatgtg gcaaccttga agagaaggtc 414 
Arg Thr 

aagagctacc agccaccaaa agaatgccag cacttcctgt gtctttgctt tggattcatg 474 
agaaatatac gttcctattt gcttcaaaaa aaaaaaaaaa tgcgaaa 521 

<210> 59 

<211> 516 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 83. .244 

<220> 

<221> sig_peptide 

<222> 83. .139 

<223> Von Heijne matrix 

score 4.24406450728052 

seq LLILSKVLDDTFQ/SV 

<220> 

<221> polyA_site 
<222> 501. .516 

<400> 59 

actgetatag cgatataact eagatgttct caggacagca agtttacatg tgaatagaga 60 
gggagggacg tgttaggaag ac atg aaa ttc aat ttt gta ctt ttg ata ctt 112 
Met Lys Phe Asn Phe Val Leu Leu He Leu 
-15 -10 
tee aaa gtt eta gat gac act ttc caa agt gtc aag aaa tgg tta aat 160 
Ser Lys Val Leu Asp Asp Thr Phe Gin Ser Val Lys Lys Trp Leu Asn 



FCT/lBOl/00914 
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tat ttt cag ttt act tta aga aat ggt tta atg tgg cca ggt gcg gtg 208 
Tyr Phe Gin Phe Thr Leu Arg Asn Gly Leu Met Trp Pro Gly Ala Val 

10 15 20 

get eat gee tgt aat eee age act ggc tea cgc ctg taatcccagc 254 
Ala His Ala Cys Asn Pro Ser Thr Gly Ser Arg Leu 

25 30 35 

aetttgggag tccgaggtgg gcagateatg aggtcaggag atcgagacea teetggctaa 314 
eatggtgaaa ccccgtetet actaaaaata eaaaaaaatt agccaggcat ggtggegggc 374 
gcctgtagtc ccagctactc gggaggctga ggcaggagaa tggcgtgaac ctgggaggcg 4 34 
gagcttacag tgagcegaga tcgegceact gcactccagc ctgggegaca gagcgagaet 4 94 
ctgttcaaaa aaaaaaaaaa aa 516 



<210> 60 

<211> 517 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 83. .244 



<220> 

<221> sig_peptide 

<222> 83. .139 

<223> Von Heijne matrix 

score 4.24406450728052 
LLILSKVLDDTFQ/SV 



<220> 

<221> polyA_site 
<222> 502. .517 



<400> 60 

aetgctatag cgatataaet eagatgttct eaggacagea agtttaeatg tgaatagaga 60 
gggagggacg tgttaggaag ac atg aaa ttc aat ttt gta ctt ttg ata ctt 112 
Met Lys Phe Asn Phe Val Leu Leu lie Leu 
-15 -10 
tec aaa gtt eta gat gac act ttc caa agt gtc aag aaa tgg tta aat 160 
Ser Lys Val Leu Asp Asp Thr Phe Gin Ser Val Lys Lys Trp Leu Asn 

-5 15 
tat ttt cag ttt act tta aga aat ggt tta atg tgg cca ggt gcg gtg 208 
Tyr Phe Gin Phe Thr Leu Arg Asn Gly Leu Met Trp Pro Gly Ala Val 

10 15 20 

get cat gcc tgt aat ccc age act ggc tea cgc ctg taatcccagc 254 
Ala His Ala Cys Asn Pro Ser Thr Gly Ser Arg Leu 

25 30 35 

aetttgggag tccgaggtgg gcagateatg aggtcaggag atcgagacea teetggctaa 314 
eatggtgaaa ceecgtytct actaaaaata eaaaaaaatt agccaggcat ggtggegggc 374 
gcctgtagtc ccagctactc gggaggctga ggcaggagaa tggcgtgaac ctgggaggcg 434 
gagcttacag tgagcegaga tcgegceact gcaeteeage ctgggegaca gagcgagaet 494 
ctgtttcaaa aaaaaaaaaa aaa 517 



<210> 61 

<211> 518 

<212> DNA 

<213> Homo sapiens 



<220> 
<221> CDS 
<222> 84. .245 



<220> 
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<221> sig_peptide 

<222> 84. .140 

<223> Von Heijne matrix 

score 4.24406450728052 

LLILSECVLDDTFQ/SV 



<220> 

<221> polyA_site 
<222> 503. .518 



<400> 61 

aactgctata gcgatataac tcagatgttc tcaggacagc aagtttacat gtgaatagag 60 
agggagggac gtgttaggaa gac atg aaa ttc aat ttt gta ctt ttg ata ctt 113 
Met Lys Phe Asn Phe Val Leu Leu He Leu 
-15 -10 
tec aaa gtt eta gat gac act ttc caa agt gtc aag aaa tgg tta aat 161 
Ser Lys Val Leu Asp Asp Thr Phe Gin Ser Val Lys Lys Trp Leu Asn 

-5 15 
tat ttt cag ttt act tta aga aat ggt tta atg tgg cca ggt gcg gtg 209 
Tyr Phe Gin Phe Thr Leu Arg Asn Gly Leu Met Trp Pro Gly Ala Val 

10 15 20 

get cat gcc tgt aat ccc age act ggc tea egc etg taateccagc 255 
Ala His Ala Cys Asn Pro Ser Thr Gly Ser Arg Leu 

25 30 35 

actttgggag tccgaggtgg gcagatcatg aggtcaggag atcgagacca tcctggctaa 315 
eatggtgaaa eccegtetct actaaaaata caaaaaaatt agccaggcat ggtggcgggc 375 
gcetgtagte ecagetaetc gggaggctga ggcaggagaa tggcgtgaac etgggaggeg 4 35 
gagettaeag tgagcegaga tcgegecact geactccagc etgggegaca gagcgagact 4 95 
ctgtttcaaa aaaaaaaaaa aaa 518 



<210> 62 

<211> 548 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 57. .329 



<220> 

<221> sig_peptide 
<222> 57. .113 

<223> Von Heijne matrix 

score 4.2427958051416 
seq FVLLLSEIVSISA/LS 



<220> 

<221> polyA_site 
<222> 533. .548 

<220> 

<221> polyA_signal 
<222> 505. .510 



<400> 62 

agttgtcttt ggtagttttt ttgeactaac ttcaggaace ageteatgat cteagg atg 59 

Met 

tat gga aaa ata ate ttt gta tta eta ttg tea gaa att gtg age ata 107 

Tyr Gly Lys He He Phe Val Leu Leu Leu Ser Glu He Val Ser He 

-15 -10 -5 

tea gca tta agt acc act gag gtg gca atg cac act tea acc tct tct 155 
Ser Ala Leu Ser Thr Thr Glu Val Ala Met His Thr Ser Thr Ser Ser 
15 10 
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tea gtc aca aag agt tac ate tea tea cag aea aat gga gaa atg gga 203 

Ser Val Thr Lys Ser Tyr He Ser Ser Gin Thr Asn Gly Glu Met Gly 

15 20 25 30 

caa ctt gte cat cgt tte aet gta cca get cct gta gtg ata ata etc 251 

Gin Leu Val His Arg Phe Thr Val Pro Ala Pro Val Val He He Leu 

35 40 45 

att att ttg tgt gtg atg get ggt att att gga aeg ate etc tta att 299 
He He Leu Cys Val Met Ala Gly He He Gly Thr He Leu Leu He 

50 55 60 

tct tac agt att cgc cga ctg ata aag gca tgaggatgtg gcetgcatgc 349 
Ser Tyr Ser He Arg Arg Leu He Lys Ala 

65 70 
tgcctgatct tgcctagaac cggctgcacc tgctgttctc ttgtttatgc aaactggctg 409 
cacctgctat tcctttgctt atgcccctac ccctggctat cctaattccc tgttctcctg 4 69 
cctcactatt actgtattct ctacttctaa ataaaaataa aacaaaatac aaaccgttaa 529 
aataaaaaaa aaaaaaaaa 548 



<210> 63 

<211> 722 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 331. . 672 



<220> 

<221> sig^peptide 

<222> 331. .426 

<223> Von Heijne matrix 

score 4.21873806574589 
seq LLGSGLFVFSLTA/FN 



<220> 

<221> polyA_site 
<222> 707. .722 



<220> 

<221> polyA_signal 
<222> 688.. 693 



<400> 63 

tttttcctcg tcctctcccg acagagctga cgtgtcctgg gttccaccgg gagcgggcat 60 

ttccaccgga egggagggtt cggggtgtcc ggggctgggg aatacgtagg ggttgccgcg 120 

cggtgtgggg agttggggcg tgtggctgca gtcccgggag ttcttggagg gggtcggccc 180 

accgagcttc cggaccggct gatctgcccg tagcttgccg gagggagggc ggagcctgac 240 

tctccgtccc ttctcccatc ccctccagtg gtgggtacgg gcacctcgct ggcgctctcc 300 

tccctcctgt ccctgctgct ctttgctggg atg cag atg tac age cgt cag ctg 354 
Met Gin Met Tyr Ser Arg Gin Leu 
-30 -25 



gee 


tec 


ace 


gag 


tgg 


etc 


ace 


ate 


cag 


ggc 


ggc 


ctg 


ctt 


ggt 


teg 


ggt 


402 


Ala 


Ser 


Thr 


Glu 


Trp 


Leu 


Thr 


He 


Gin 


Gly 


Gly 


Leu 


Leu 


Gly 


Ser 


Gly 












-20 










-15 










-10 








tte 


gtg 


tte 


teg 


etc 


act 


gcc 


tte 


aat 


aat 


ctg 


gag 




ctt 


gtc 


450 


Leu 


Phe 


Val 


Phe 
-5 


Ser 


Leu 


Thr 


Ala 


Phe 
1 


Asn 


Asn 


Leu 


Glu 
5 


Asn 


Leu 


Val 




ttt 


ggc 


aaa 


gga 


tte 


eaa 


gca 


aag 


ate 


tte 


cct 


gag 


att 


cte 


ctg 


tgc 


4 98 


Phe 


Gly 


Lys 


Gly 


Phe 


Gin 


Ala 


Lys 


He 


Phe 


Pro 


Glu 


He 


Leu 


Leu 


Cys 






10 










15 










20 












cte 


ctg 


ttg 


get 


etc 


ttt 


gca 


tct 


ggc 


etc 


ate 


cac 


cga 


gte 


tgt 




546 


Leu 


Leu 


Leu 


Ala 


Leu 


Phe 


Ala 


Ser 


Gly 


Leu 


He 


His 




Val 


Cys 


Val 




25 










30 










35 










40 






ace 


tgc 


tte 


ate 


tte 


tee 


atg 


gtt 


ggt 


ctg 


tac 


tac 


ate 


aac 


aag 


594 
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Thr Thr Cys Phe lie Phe Ser Met Val Gly Leu Tyr Tyr lie Asn Lys 

45 50 55 

aty tec tec ace ctg tac cag gea gca get cea gtc etc aca eca gee 642 
lie Ser Ser Thr Leu Tyr Gin Ala Ala Ala Pro Val Leu Thr Pro Ala 

60 65 70 

aag gtc aca ggc aag age aag aag aga aac tgaccctgaa tgttcaataa 692 
Lys Val Thr Gly Lys Ser Lys Lys Arg Asn 

75 80 
agttgattct ttgcaaaaaa aaaaaaaaaa 722 



<210> 64 

<211> 479 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 96. .320 



<220> 

<221> sig_peptide 

<222> 96. .200 

<223> Von Heijne matrix 

score 4.14350049367799 
seq MAVAFVLSLGVAA/LY 



<220> 

<221> polyA_site 
<222> 464. .479 



<220> 

<221> polyA_signal 
<222> 437. .442 

<400> 64 

attctgcgcc tgcgcgcgtc tacagcacgg ttcgtttttc ctttagtcag gaaggacgtt 60 
ggtgttgagc atacgtatca aggacagtaa ytacc atg get eee gaa gtt ttg 113 

Met Ala Pro Glu Val Leu 
-35 -30 



cca 


aaa 


cct 


egg 


atg 


cgt 


ggc ctt ctg gcc agg 


cgt ctg cga 


aat cat 


161 


Pro 


Lys 


Pro 


Arg 


Met 


Arg 


Gly Leu Leu Ala Arg 


Arg Leu Arg 


Asn His 












-25 




-20 




-15 




atg 


get 


gta 


gca 


ttc 


gtg 


eta tec ctg ggg gtt 


gea get ttg 


tat aag 


209 


Met 


Ala 


Val 


Ala 


Phe 


Val 


Leu Ser Leu Gly Val 


Ala Ala Leu 


Tyr Lys 










-10 






-5 


1 






ttt 


cgt 


gtg 


get 


gat 


caa 


aga aag aac gea tac 


gea gat ttc 


tac aga 


257 


Phe 


Arg 


Val 


Ala 


Asp 


Gin 


Arg Lys Asn Ala Tyr 


Ala Asp Phe 


Tyr Arg 






5 










10 


15 






aac 


tac 


gat 


gte 


atg 


aaa 


gat ttt gag gag atg 


agg aag get 


ggt ate 


305 


Asn 


Tyr 




Val 


Met 


Lys 


Asp Phe Glu Glu Met 


Arg Lys Ala 


Gly He 




20 










25 


30 




35 




ttt 


eag 


agt 


gta 


aag 


taatettgga atataaagaa tttetteagg ttgaattaee 


360 


Phe 


Gin 


Ser 


Val 


Lys 













40 



tagaagtttg teaetgactt gtgttectga aetatgaeac atgaatatgt gggetaagaa 420 
atagttcctc ttgataaata aacaattaac aaatactttg gacaaaaaaa aaaaaaaaa 479 



<210> 65 

<211> 1627 

<212> DNA 

<213> Homo sapiens 



<220> 
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<221> CDS 
<222> 90.. 413 

<220> 

<221> sig_peptide 

<222> 90. .209 

<223> Von Heijne matrix 

score 4.06476262296284 

seq VTAPSMVAPVTFA/SI 

<220> 

<221> polyA_site 
<222> 1612. .1627 

<220> 

<221> polyA_signal 
<222> 1577. .1582 

<400> 65 

aatctctgcg aaagcttcca ggttcgctca tcgtaggtca ctggaaaaaa ttgtagcatt 60 



gctaaagagt atttcagaga ai 


iattttgc 


atg 


caa 


aaa 


ttt 


agg 


aag 


atg 


agt 


113 


















Met 


Gin 


Lys 


Phe 


Arg 


Lys 


Met 


Ser 




















-40 










-35 








gaa 


act 


cat 


cat 


tct 


gta 


ate 


tct 


gtg 


aaa 


get 


tec 


agt 


cec 


tgg 


eta 


161 


Glu 


Thr 


His 
-30 


His 


Ser 


Val 


He 


Ser 
-25 


Val 


Lys 


Ala 


Ser 


Ser 
-20 


Pro 


Trp 


Leu 




tct 


tct 


tea 


gtg 


act 


get 


cea 


tee 


atg 


gta 


gee 


cea 


gte 


act 


ttt 


gca 


209 


Ser 


Ser 
-15 


Ser 


Val 


Thr 


Ala 


Pro 
-10 


Ser 


Met 


Val 


Ala 


Pro 
-5 


Val 


Thr 


Phe 


Ala 




tct 


att 


gta 


gaa 


gaa 


gaa 


eta 


caa 


caa 


gaa 


gca 


get 


ctt 


att 


aga 


agt 


257 


Ser 


He 


Val 


Glu 


Glu 


Glu 


Leu 


Gin 


Gin 


Glu 


Ala 


Ala 


Leu 


He 


Arg 


Ser 




1 








5 










10 










15 






cga 


gaa 


aaa 


ccg 


ttg 


get 


ctg 


att 


cag 


att 


gag 


gag 


cat 


gee 


ata 


caa 


305 


Arg 


Glu 


Lys 


Pro 

20 


Leu 


Ala 


Leu 


He 


Gin 

25 


He 


Glu 


Glu 


His 


Ala 
30 


He 


Gin 




gat 


tta 


ttg 


gtt 


ttc 


tat 


gag 


gca 


ttt 


ggc 


aac 


cct 


gaa 


gag 


ttt 


gtc 


353 


Asp 


Leu 


Leu 
35 


Val 


Phe 


Tyr 


Glu 


Ala 
40 


Phe 


Gly 


Asn 


Pro 


Glu 
45 


Glu 


Phe 


Val 




att 


gtt 


gaa 


agg 


aca 


ccg 


cag 


gga 


cea 


ctg 


gca 


gta 


cct 


atg 


tgg 


aat 


401 


He 


Val 


Glu 




Thr 


Pro 


Gin 


Gly 


Pro 


Leu 


Ala 


Val 


Pro 


Met 


Trp 


Asn 





50 55 60 

aag eat gga tge tagttcaetg tggagttgag atgeatttta eataattatg 453 
Lys His Gly Cys 
65 

agtttgttea tataaagaaa 
taaatagatt aagaaaagat 
gattaaaaat aaatttctca 
aaaaaettat taaggttaaa 

aagtgattga atcataagat 
taacttgaga atttagaaga 
gcattgatga atttaaagta 
ttttgtgttt ggtgtaatag 
tcatgtggtc catgtctett 
cacatattct ccattttgca 
attattetga acagtttagt 
gagtcattgt gggttatttt 
aatcaattgc tgaaagtgtt 
ttaaatagtt tggaagaggg 
ttctctgaac ctcaattgta 
tggtttgttt ctgagtgttt 
ctagataaat tggaaaggtt 
agcaattcac tgctataatt 
gtttaactct gcccaaaagc 



agctgtggaa aagagtctta gagattttgt aatateatte 513 

ataatttctt tactgcagtt aaatcatata atgtttgtat 573 

gaattgtgat tttagtaact ttatataaaa tgtgtgagac 633 

tagaattgtt tcttstgaat ratctaacaa aggaraatat 693 

ataagggggg taaagtatta aaaataactt ttttgtttga 753 

ttttgccaag tatgtgttgt tgcttgactt cttaaatatg 813 

ggagcatcag ttattacttc tgattcatta atggccagaa 873 

ttgtgtcacc attcttgttg ctttttaaaa ateaggctaa 933 

caaagcttga cctgcacaaa tgccatattt cyatttggac 993 

ttgagcagta gagtacagtg gaaagggaat aagaatactg 1053 

cccaagagaa tagcgtttta aaaaagaaaa acaagatttg 1113 

tggtgggatg gaggatctta aaaatgecta attgtgagag 1173 

aaaatttctg aaaataaatg cttaattaca tatacaggaa 1233 

ttggattatc attaccttta caatactgta taatcagaag 1293 

tatctagaca taaaaattgt tttctgtata ggatgttgtt 1353 

aaattttgca aaaacaaatg ttaaatttgt gcttcagtac 1413 

aatgttctag tttctggaag gtaagcctgg gagacacata 1473 

tagttgatgt aaaatgacgg aaactggctc aatatgtcag 1533 

agcagacatg taageagatg tgcaataaaa aatgatcttg 1593 
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atccatttca aaaaaaagaa aaaaaaaaaa aaaa 1627 

<210> 66 

<211> 555 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 9.. 389 

<220> 

<221> sig_peptide 

<222> 9. .149 

<223> Von Heijne matrix 

score 3.92635967755835 

seq LSSDLQQVGGASA/RI 



<220> 

<221> polyA_site 
<222> 540. .555 



<220> 

<221> polyA_signal 
<222> 512. .517 



<400> 66 

tgggagtg atg ttc acg cct ctg aca gtg aaa tac gcg tac tac gac act 50 
Met Phe Thr Pro T,eu Thr Val Lys Tyr Ala Tyr Tyr Asp Thr 
-45 -40 -35 



gaa 


cgc 




gga 


gtt 


gac 


ctg 


ate 


atg 


aag 


acc 


tgc 


ttt 


age 


ccc 


aac 


98 


Glu 


Arg 


He 


Gly 
-30 


Val 


Asp 


Leu 


He 


Met 
-25 


Lys 


Thr 


Cys 


Phe 


Ser 
-20 


Pro 


Asn 




aga 


gtg 


att 


gga 


etc 


tea 


agt 


gac 


ttg 


cag 


caa 


gta 


gga 


ggg 


gca 


tea 


146 


Arg 


Val 


He 
-15 


Gly 


Leu 


Ser 


Ser 


Asp 
-10 


Leu 


Gin 


Gin 


Val 


Gly 
-5 


Gly 


Ala 


Ser 






cgc 


ate 


cag 


gat 




ctg 


agt 




gtg 


ttg 




tat 


gca 


gag 




194 


Ala 


Arg 
1 


He 


Gin 


Asp 


Ala 
5 


Leu 


Ser 


Thr 


Val 


Leu 
10 


Gin 


Tyr 


Ala 


Glu 


Asp 
15 




gta 


ctg 


tot 


gga 


aag 


gtg 


tea 


get 


gae 


aat 


act 


gtg 


gge 


cgc 


tte 


ctg 


242 


Val 


Leu 


Ser 


Gly 


Lys 
20 


Val 


Ser 


Ala 


Asp 


Asn 
25 


Thr 


Val 


Gly 


Arg 


Phe 
30 


Leu 




atg 


age 


ctg 


gtt 


aac 


caa 


gta 


ccg 


aaa 


ata 


gtt 


cec 


gat 


gae 


ttt 


gag 


290 


Met 


Ser 


Leu 


Val 
35 


Asn 


Gin 


Val 


Pro 


Lys 
40 


He 


Val 


Pro 


Asp 


Asp 
45 


Phe 


Glu 




acc 


atg 


etc 


aac 


age 


aae 


ate 


aat 


gae 


ett 


ttg 


atg 


gtg 


ace 


tac 


ctg 


338 


Thr 


Met 


Leu 
50 


Asn 


Ser 


Asn 


He 


Asn 
55 


Asp 


Leu 


Leu 


Met 


Val 
60 


Thr 


Tyr 


Leu 




gee 




etc 


aca 


eag 


tea 


cag 


att 


gca 


etc 


aat 


gaa 


aaa 


ctt 


gta 


aac 


386 


Ala 


Asn 
65 


Leu 


Thr 


Gin 


Ser 


Gin 
70 


He 


Ala 


Leu 


Asn 


Glu 
75 


Lys 


Leu 


Val 


Asn 





ctg tgaatggacc ecaagcagta cacttgetgg tetaggtatt aaccccagga 439 
80 

cteagaagtg aaggagaaat gggttttttg tggtettgag teaeaetgag atagtcagtt 499 

gtgtgtgact ctaataaacg gageetaeet tttgtaaatt aaaaaaaaaa aaaaaa 555 



<210> 67 
<211> 422 
<212> DNA 

<213> Homo sapiens 



<220> 
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<221> CDS 
<222> 24 . . 278 

<220> 

<221> sig_peptide 

<222> 24 . . 119 

<223> Von Heijne matrix 

score 3.85301646018147 

seq LAHIQALISGIEA/QL 

<220> 

<221> polyA_site 
<222> 407. .422 

<220> 

<221> polyA_signal 
<222> 384 . .389 

<400> 67 

ttgagctgca gtcacagctg age atg aaa get gee ttg gaa gac aca ctg gca 53 

Met Lys Ala Ala Leu Glu Asp Thr Leu Ala 
-30 -25 



gaa 


acg 


gag 


geg 


ege 


ttt 


gga 


gee 


eag 


etg 


geg 


eat 


ate 


eag 


geg 


etg 


101 


Glu 


Thr 


Glu 


Ala 


Arg 


Phe 


Gly 


Ala 


Gin 


Leu 


Ala 


His 


He 


Gin 


Ala 


Leu 








-20 










-15 










-10 










ate 


age 


ggt 


att 


gaa 


gee 


eag 


etg 


gge 


gat 


gtg 


ega 


get 


gat 


agt 


gag 


149 


lie 


Ser 


Gly 


He 


Glu 


Ala 


Gin 


Leu 


Gly 


Asp 


Val 


Arg 


Ala 


Asp 


Ser 


Glu 






-5 










1 








5 










10 




egg 


eag 


aat 


eag 


gag 


tae 


eag 


egg 


ete 


atg 


gae 


ate 


aag 


teg 


egg 


ctg 


197 


Arg 


Gin 


Asn 


Gin 


Glu 


Tyr 


Gin 


Arg 


Leu 


Met 


Asp 


He 


Lys 


Ser 


Arg 


Leu 












15 










20 










25 






gag 


cag 


gag 


att 


gee 


ace 


tae 


ege 


age 


etg 


etc 


gag 


gga 


cag 


gaa 


gat 


245 


Glu 


Gin 


Glu 


He 


Ala 


Thr 


Tyr 


Arg 


Ser 


Leu 


Leu 


Glu 


Gly 


Gin 


Glu 


Asp 










30 










35 










40 








cac 


tac 


aac 


aat 


ttg 


tct 


gcc 


tec 


aag 


gtc 


etc 


tgaggcagca ggctctgggg 


298 


His 


Tyr 


Asn 




Leu 


Ser 


Ala 


Ser 




Val 

















45 50 



cttctgctgt cctttggagg gtgtcttctg ggtagaggga tgggaaggaa gggaeeetta 358 
cccccggctc ttctcctgae etgceaataa aaatttatgg tecaaggcaa aaaaaaaaaa 418 
aaaa 422 

<210> 68 

<211> 630 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 232. .534 

<220> 

<221> sig_peptide 

<222> 232. .306 

<223> Von Heijne matrix 

score 3.78142478107177 

seq AKTCLVLCSRVLS/VI 

<220> 

<221> polyA_site 
<222> 615. .630 

<220> 

<221> polyA_signal 
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<222> 595. . 600 
<400> 68 

tatcactgtt acgaaccaag gatttacaga tcactggcaa aaattctgag aactttcaca 60 
ccagtatact gtccaagccc attaagtggc atcacacctc tcttttatgt agctcagaca 120 
agacagtcta atatcttcaa aatactactg caatatggaa tcttagaaag agaaaaaaac 180 
cctatcaaca ttgtcttaac aatagtactc tacccttcga gagtaagagt a atg gtt 237 

Met Val 
-25 



gat 


cgt 


gaa 


ttg 


get 




ate 


eat 


gaa 




gee 




aca 


tgt 


ttg 


gta 


285 












Asp 








Asp 




L^S 


















-20 










-15 










-10 








eta 


tgt 




aga 


gtg 


ctt 


tct 




att 


tea 


gtc 


aag 


gaa 




aag 




333 


Leu 


Cys 


Ser 


Arg 


Val 


Leu 


Ser 


Val 


He 


Ser 


Val 


Lys 
5 


Glu 


He 


Lys 


Thr 




cag 


ctg 


-5 
agt 


tta 


gga 


aga 


eat 


1 

eca 


att 


att 


tea 


aat 


tgg 


ttt 


gat 


tae 


381 


Gin 


Leu 


Ser 


Leu 


Gly 




His 


Pro 


He 


He 


Ser 


Asn 


Trp 


Phe 


Asp 


Tyr 




10 










15^ 










20 










25 




att 


cct 


tea 


aea 


aga 


tae 


aaa 


gat 


eea 


tgt 


gaa 


eta 


tta 


eat 


ett 


tge 


429 


He 


Pro 


Ser 


Thr 


Arg 


Tyr 


Lys 


Asp 


Pro 


Cys 


Glu 


Leu 


Leu 


His 


Leu 


Cys 












30 










35 










40 






aga 


eta 


aee 


ate 


agg 


aat 


eaa 


eta 


tta 


aee 


aae 


aat 


atg 


ete 


eea 


gat 


477 


Arg 


Leu 


Thr 


He 


Arg 


Asn 


Gin 


Leu 


Leu 


Thr 


Asn 


Asn 


Met 


Leu 


Pro 


Asp 










45 










50 










55 








gga 


ata 


ttt 


tea 


ctt 


eta 


att 


eet 


get 


cgt 


eta 




aac 


tat 


etg 


aat 


525 


Gly 


He 


Phe 


Ser 


Leu 


Leu 


He 


Pro 


Ala 


Arg 


Leu 


Gin 




Tyr 


Leu 


Asn 





60 55 70 



tta gaa ate taacatacgt cagtgtccta agttccttaa caatgcttac 574 
Leu Glu He 
75 

eaatgtatgg ettagaagtt aataaaaatt eaetteatge aaaaaaaaaa aaaaat 630 

<210> 69 

<211> 730 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 80. .283 

<220> 

<221> sig_peptide 

<222> 80. .220 

<223> Von Heijne matrix 

seore 3.72491712201476 

seq PLLSLHSRGGSSS/ES 

<220> 

<221> polyA_site 
<222> 715. .730 

<220> 

<221> polyA_signal 
<222> 697. .702 

<400> 69 

attttgegaa eggegageag eggeggcggc geggagagac geagcggagg ttttectggt 60 
ttcggacccc agcggcegg atg gtg aaa tee tee etg eag egg ate etc aat 112 
Met Val Lys Ser Ser Leu Gin Arg He Leu Asn 
-45 -40 
age cae tge tte gee aga gag aag gaa ggg gat aaa eee age gee ace 160 
Ser His Cys Phe Ala Arg Glu Lys Glu Gly Asp Lys Pro Ser Ala Thr 
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-35 -30 -25 

ate cac gcc age cgc acc atg ccg etc eta age etg cac age cge gge 208 
lie His Ala Ser Arg Thr Met Pro Leu Leu Ser Leu His Ser Arg Gly 
-20 -15 -10 -5 

gge age age agt gag agt tee agg gte tee etc eae tgc tgt agt aac 256 
Gly Ser Ser Ser Glu Ser Ser Arg Val Ser Leu His Cys Cys Ser Asn 

15 10 
eeg ggt eeg ggg eet egg tgg tgc tee tgatgeeeet eaeeeaeeee 303 
Pro Gly Pro Gly Pro Arg Trp Cys Ser 

15 20 
tgaagatccc aggtgggcga ggaggaggag tagggccgcc tcggggctgg gcatccggcc 3 63 
cctggggcca ccccttgtca gccgggtggg taggaaccgt agactcgctc atctcgcctg 423 
ggtttgtccg catgttgtaa tcgtgcaaat aaacgctcac tccgaattag cggtgtattt 483 
cttgaagttt aatattgtgt ttgtgatact gaagtatttg ctttaattct aaataaaaat 543 
ttatatttta ettttttatt getggtttaa gatgatteag attateettg taetttgagg 603 
agaagtttet tatttggagt ettttggaaa eagtettagt ettttaaett ggaaagatga 663 
ggtattaate eeetccattg ctctccaaaa gecaataaag tgattacace caaaaaaaaa 723 
aaaaaaa 730 



<210> 70 

<211> 730 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 80. .283 



<220> 

<221> sig_peptide 

<222> 80. .220 

<223> Von Heijne matrix 

score 3.72491712201476 
seq PLLSLHSRGGSSS/ES 



<220> 

<221> polyA_site 
<222> 715. .730 



<220> 

<221> polyA_signal 
<222> 697. .702 



<400> 70 

attttgegaa eggegageag eggeggegge geggagagae geageggagg tttteetggt 60 
tteggacecc agcggccgg atg gtg aaa tec tec etg cag egg ate etc aat 112 
Met Val Lys Ser Ser Leu Gin Arg lie Leu Asn 
-45 -40 



age 


cae 


tgc 


tte 


gee 


aga 


gag 


aag 


gaa 


ggg gat 




eee age 


gee 


aee 


160 


Ser 


His 
-35 


Cys 


Phe 


Ala 




Glu 
-30 




Glu 


Gly Asp 


-25 


Pro Ser 


Ala 


Thr 




ate 


cac 


gcc 


age 


cgc 


acc 


atg 


ccg 


etc 


eta age 


etg 


cac age 


cgc 


gge 


208 


He 


His 


Ala 


Ser 


Arg 


Thr 


Met 


Pro 


Leu 


Leu Ser 




His Ser 


Arg 


Gly 




-20 










-15 








-10 








-5 




gge 


age 


age 


agt 


gag 


agt 


tec 


agg 


gte 


tec etc 


cac 


tgc tgt 


agt 


aac 


256 


Gly 


Ser 


Ser 


Ser 


Glu 


Ser 


Ser 


Arg 


Val 


Ser Leu 


His 


Cys Cys 


Ser 


Asn 










1 








5 






10 








eeg 


ggt 


eeg 


ggg 


cct 


egg 


tgg 


tgc 


tec 


tgatgeeeet eaeeeaeeee 




303 


Pro 


Gly 


Pro 
15 


Gly 


Pro 




Trp 


Cys 

20 


Ser 















tgaagatccc aggtgggcga ggaggaggag tagggccgcc tcggggctgg gcatccggcc 363 
cctggggcca ccccttgtca gccgggtggg taggaaccgt agactcgctc atctcgcctg 423 
ggtttgtccg catgttgtaa tcgtgcaaat aaacgctcac tccgaattag cggtgtattt 483 
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ttatatttta cttttttatt gctggtttaa 

agaagtttct tatttggagt cttttggaaa 

ggtattaatc ccctccattg ctctccaaaa 
aaaaaaa 
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59 

gaagtatttg ctttaattct aaataaaaat 543 
gatgattcag attatccttg tactttgagg 603 
cagtcttagt cttttaactt ggaaagatga 663 
gccaataaag tgattacacc caaaaaaaaa 723 
730 



<210> 71 

<211> 863 

<212> DNA 

<213> Homo sapiens 



<220> 
<221> CDS 
<222> 80. .283 



<220> 

<221> sig_peptide 

<222> 80. .220 

<223> Von Heijne matrix 

score 3.72491712201476 
PLLSLHSRGGSSS/ES 



<220> 

<221> polyA_site 
<222> 844 . .863 



<220> 

<221> polyA_signal 
<222> 827. .832 



<400> 71 

attttgcgaa cggcgagcag cggcggcggc gcggagagac gcagcggagg ttttcctggt 60 
ttcggacccc agcggccgg atg gtg aaa tec tec ctg cag egg ate ete aat 112 
Met Val Lys Ser Ser Leu Gin Arg lie Leu Asn 
-45 -40 



age 


eac 


tge 


ttc 


gee 


aga 


gag 


aag 


gaa 


ggg gat aaa 


ccc 


age gcc 


ace 


160 


Ser 


His 


Cys 


Phe 


Ala 


Arg 


Glu 


Lys 


Glu 


Gly Asp Lys 


Pro 


Ser Ala 


Thr 






-35 










-30 






-25 










ate 


cac 


gee 


age 


cgc 


ace 


atg 


ccg 


etc 


eta age etg 


cac 


age cgc 


ggc 


208 


He 


His 


Ala 


Ser 


Arg 


Thr 


Met 


Pro 


Leu 


Leu Ser Leu 


His 




Gly 




-20 










-15 








-10 






-5 




ggc 


age 


age 


agt 


gag 


agt 


tec 


agg 


gtc 


tec etc cac 


tgc 


tgt agt 


aac 


256 


Gly 


Ser 


Ser 


Ser 


Glu 
1 


Ser 


Ser 


Arg 


Val 
5 


Ser Leu His 


Cys 


Cys Ser 


Asn 




ccg 


ggt 


ccg 


ggg 


cct 


egg 


tgg 


tgc 


tec 


tgatgeccet caeecaeeee 




303 


Pro 


Gly 


Pro 


Gly 


Pro 


Arg 


Trp 


Cys 


Ser 
















15 










20 















tgaagatcce aggtgggcga gggaatagtc agagggatea caatetttea getaaettat 363 

tctactccga tgateggetg aatgtaaeag aggaactaac gtccaacgae aagacgagga 423 

ttctcaacgt cteageetet aeatcgagat eecgggcggc gcgctgcccg aggggagcaa 4 83 

ggacagcttt geagttetee tggagttegc tgaggagcag ctgcgagccg accatgtctt 543 

catttgcttc cacaagaacc gcgaggacag agccgccttg ctccgaacct tcagcttttt 603 

gggctttgag attgtgagac cggggcatcc ccttgtcccc aagagacccg acgcttgctt 663 

catggcctac acgttcgaga gagagtcttc gggagaggag gaggagtagg gccgcctcgg 723 

ggctgggcat ccggcccctg gggccacccc ttgtcagccg ggtgggtagg aaccgtagae 783 

tegeteatet egeetgggtt tgteegeatg ttgtaategt geaaataaac getcaetceg 843 

aattaaaaaa aaaaaaaaaa 863 



<210> 72 
<211> 730 
<212> DNA 
<213> Homo 



sapiens 
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<220> 
<221> CDS 
<222> 80. .283 

<220> 

<221> sig_peptide 

<222> 80. .220 

<223> Von Heijne matrix 

score 3.72491712201476 

PLLSLHSRGGSSS/ES 

<220> 

<221> polyA_site 
<222> 715. .730 

<220> 

<221> polyA_signal 
<222> 697. .702 

<400> 72 

attttgcgaa cggcgagcag cggcggcggc gcggagagac gcagcggagg ttttcctggt 60 
ttcggacccc agcggccgg atg gtg aaa tec tec ctg cag egg ate etc aat 112 
Met Val Lys Ser Ser Leu Gin Arg lie Leu Asn 
-45 -40 



age 


cac 


tgc 


tte 


gee 


aga 


gag 


aag 


gaa 


ggg gat 


aaa 


ccc age 


gcc 


acc 


160 


Ser 


His 
-35 


Cys 


Phe 


Ala 


Arg 


Glu 

-30 


Lys 


Glu 


Gly Asp 


Lys 
-25 


Pro Ser 


Ala 


Thr 




ate 


cac 


gcc 


age 


cgc 




atg 


ecg 


etc 


eta age 


ctg 


cac age 


cgc 


ggc 


208 


He 


His 


Ala 


Ser 


Arg 


Thr 


Met 


Pro 


Leu 


Leu Ser 


Leu 


His Ser 


Arg 


Gly 




-20 










-15 








-10 








-5 




ggc 


age 


age 


agt 


gag 


agt 


tec 


agg 


gtc 


tec etc 


cay 


tgy tgt 


agt 


aas 


256 


Gly 


Ser 


Ser 


Ser 


Glu 
1 


Ser 


Ser 




Val 
5 


Ser Leu 


His 


Cys Cys 
10 


Ser 


Xaa 




ecg 


ggt 


ecg 


ggg 


cct 


egg 


tgg 


tgc 


tec 


tgatgcccct eacecaccce 




303 


Pro 


Gly 


Pro 


Gly Pro 


Arg 


Trp 


Cys 


Ser 


















15 










20 

















tgaagatccc aggtgggcga ggaggaggag tagggccgcc tcggggctgg gcatccggcc 363 

cctggggeea cecettgtea geegggtggg taggaaecgt agactcgete atctcgcetg 423 

ggtttgteeg catgttgtaa tcgtgcaaat aaacgctcac tccgaattag cggtgtattt 483 

cttgaagttt aatattgtgt ttgtgatact gaagtatttg etttaattet aaataaaaat 543 

ttatatttta ettttttatt gctggtttaa gatgattcag attatecttg tactttgagg 603 

agaagtttct tatttggagt cttttggaaa cagtcttagt cttttaaett ggaaagatga 663 

ggtattaate cectccattg eteteeaaaa geeaataaag tgattacace caaaaaaaaa 723 

aaaaaaa 730 

<210> 73 

<211> 879 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 9.-395 

<220> 

<221> sig_peptide 
<222> 9. .56 

<223> Von Heijne matrix 

score 3.61791418904325 
LLLLLRALRRGPG/PG 



<220> 

<221> polyA_site 
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<222> 864 . .879 
<400> 73 

aggccaac atg gcc gtg ctg ctg ctg ctg etc cgt gcc etc ege egg ggt 50 
Met Ala Val Leu Leu Leu Leu Leu Arg Ala Leu Arg Arg Gly 
-15 -10 -5 



cca 


gge 


ceg 


ggt 




egg 


ceg 


ctg 


tgg 


gge 


cea 


gge 


ceg 


gee 


tgg 


agt 


98 


Pro 


Gly 


Pro 
1 


Gly 


Pro 


Arg 


Pro 
5 


Leu 


Trp 


Gly 


Pro 


Gly 
10 


Pro 


Ala 


Trp 


Ser 




cca 


ggg 


ttc 


ccc 


gcc 


agg 


ccc 


ggg 


agg 


ggg 


egg 


ccg 


tac 


atg 


gcc 


age 


146 


Pro 


Gly 


Phe 


Pro 


Ala 


Arg 


Pro 


Gly 


Arg 


Gly 


Arg 


Pro 


Tyr 


Met 


Ala 


Ser 




15 










20 










25 










30 




agg 


cct 


ccg 


ggg 


gac 


etc 


gcc 


gag 


get 


gga 


gge 


cga 


get 


ctg 


cag 


age 


194 


Arg 


Pro 


Pro 


Gly 


Asp 
35 


Leu 


Ala 


Glu 


Ala 


Gly 
40 


Gly 


Arg 


Ala 


Leu 


Gin 
45 


Ser 




tta 


caa 


ttg 


aga 


ctg 


eta 


acc 


cct 


acc 


ttt 


gaa 


ggg 


ate 


aac 


gga 


ttg 


242 


Leu 


Gin 


Leu 


Arg 

50 


Leu 


Leu 


Thr 


Pro 


Thr 
55 


Phe 


Glu 


Gly 


lie 


Asn 

60 


Gly 


Leu 




ttg 


ttg 


aaa 


caa 


eat 


tta 


gtt 


cag 


aat 


cca 


gte 


aga 


etc 


tgg 


caa 


ctt 


290 


Leu 


Leu 


Lys 
65 


Gin 


His 


Leu 


Val 


Gin 
70 


Asn 


Pro 


Val 


Arg 


Leu 
75 


Trp 


Gin 


Leu 




tta 


ggt 


ggt 


act 


ttc 


twt 


ttt 


aac 


acc 


tea 


agg 


ttg 


aag 


cak 


aag 


aat 


338 


Leu 


Gly 

80 


Gly 


Thr 


Phe 


Xaa 


Phe 

85 


Asn 


Thr 


Ser 


Arg 


Leu 
90 


Lys 


Xaa 


Lys 


Asn 




aag 


gag 


aag 


gat 


aag 


teg 


aag 


ggg 


aag 


gcg 




gaa 


gag 


gac 


gaa 


ggt 


386 


Lys 


Glu 


Lys 


Asp 


Lys 


Ser 


Lys 


Gly 


Lys 


Ala 


Pro 


Glu 


Glu 


Asp 


Glu 


Gly 




95 










100 










105 










110 





ata tte ate tgatgttett eagteagtag etgcetctgg atgtctttae 435 
He Phe He 

atttetgttt tecttttage aaggtgaaac cagtctggaa aatggggaga tgggccgggt 4 95 

geagtggetc aeaettgtaa tegaaacget ttgggaggcc caggtggaag gatcaettga 555 

ggcctatacc acatagcgag accctgtctc actgcaaatt aaaaggctgg gcgtggtggc 615 

tcacacctgt aatcccagca ctttgggagg ctgaggcagg cggatcacct gcaccctggc 675 

caacatggtg aaaccccgtc tttactaaaa atagaaaatt agccgggcgt gatggcacac 735 

gcetgtaatc ccagetaete gggaggetga ggcaggagaa ttgcttgaac ctgggaggtg 795 

gaggttgctg tgagtggaga tcatgccatt gcactccagc ctgagcaaca agagcaaaac 855 

tccatctcaa aaaaaaaaaa aaaa 879 

<210> 74 

<211> 879 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 9. .395 

<220> 

<221> sig_peptide 
<222> 9. .56 

<223> Von Heijne matrix 

score 3.61791418904325 
LLLLLRALRRGPG/PG 

<220> 

<221> polyA_site 
<222> 864.. 879 

<400> 74 

aggccaac atg gcc gtg ctg ctg ctg ctg etc cgt gcc etc ege egg ggt 50 
Met Ala Val Leu Leu Leu Leu Leu Arg Ala Leu Arg Arg Gly 
-15 -10 -5 

cca gge ccg ggt cct egg ccg ctg tgg gge cca gge ccg gcc tgg agt 98 
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Pro 


Gly 


Pro 

1 


Gly 


Pro 


Arg 


Pro 
5 


Leu 


Trp 


Gly 


Pro 


Gly 

10 


Pro 


Ala Trp Ser 




cca 


ggg 


ttc 


ccc 


gcc 


agg 


ccc 


ggg 


agg 


ggg 


egg 


ccg 




atg gcc age 


146 


Pro 


Gly 


Phe 


Pro 


Ala 


Arg 


Pro 


Gly 


Arg 


Gly 


Arg 


Pro 


Tyr 


Met Ala Ser 




15 










20 










25 






30 




agg 


cct 


ccg 


ggg 


gac 


etc 


gcc 


gag 


get 


gga 


ggc 


cga 


get 


ctg eag age 


194 


Arg 


Pro 


Pro 


Gly 


Asp 


Leu 


Ala 


Glu 


Ala 


Gly 


Gly 




Ala 


Leu Gin Ser 












35 










40 








45 




tta 


caa 


ttg 


aga 


ctg 


Ota 


acc 


cct 


acc 


ttt 


gaa 


ggg 


ate 


aac gga ttg 


242 


Leu 


Gin 


Leu 


Arg 


Leu 


Leu 


Thr 


Pro 


Thr 


Phe 


Glu 


Gly He Asn Gly Leu 










50 










55 










60 




ttg 


ttg 


aaa 


caa 


cat 


tta 


gtt 


cag 


aat 


cca 


gtc 


aga 


etc 


tgg caa ett 


290 


Leu 


Leu 


Lys 


Gin 


His 


Leu 


Val 


Gin 


Asn 


Pro 


Val 


Arg 


Leu 


Trp Gin Leu 








65 










70 










75 






tta 


ggt 


ggt 


act 


ttc 


tat 


ttt 


aac 


acc 


tea 


agg 


ttg 


aag 


eag aag aat 


338 


Leu 


Gly 


Gly 


Thr 


Phe 


Tyr 


Phe 


Asn 


Thr 


Ser 


Arg 


Leu 


Lys 


Gin Lys Asn 






80 










85 










90 








aag 


gag 


aag 


gat 


aag 


teg 


aag 


ggg 


aag 


gcg 


cct 


gaa 


gag 


gac gaa ggt 


386 


Lys 


Glu 


Lys 


Asp 


Lys 


Ser 


Lys 


Gly 


Lys 


Ala 


Pro 


Glu 


Glu 


Asp Glu Gly 




95 










100 










105 






110 





ata ttc ate tgatgttctt cagtcagtag ctgcctctgg atgtctttac 435 
He Phe He 
rtttctgttt weettttage 
gcagtggete acacttgtaa 
ggcctatacc acatagctag 
tcacacctgt aatcccagca 
caacatggtg aaaccccgtc 
gcctgtaatc ccagctactc 
gaggttgctg tgagtggaga 
tccatcccaa aaaaaaaaaa 

<210> 75 
<211> 464 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 16. .342 

<220> 

<221> sig_peptide 
<222> 16. .114 
<223> Von Heijne matrix 

score 3.66786754931606 

seq LAVFQMLKSMCAG/QR 

<220> 

<221> polyA_site 
<222> 446. .464 

<220> 

<221> polyA_signal 
<222> 412. .417 

<400> 75 

gaatgcgcct gcgtt atg gcg gcc tec ggc gee cca agg ate ctg gtg gac 51 
Met Ala Ala Ser Gly Ala Pro Arg He Leu Val Asp 
-30 -25 
ctg ctg aag ctg aac gtg gcc ccc etc gee gtc ttc eag atg etc aag 99 
Leu Leu Lys Leu Asn Val Ala Pro Leu Ala Val Phe Gin Met Leu Lys 

-20 -15 -10 

tec atg tgt gee ggg eag agg eta gcg age gag cec eag gac eet geg 147 



aaggtgaaae cagtetggam aatggggaga tgggeegggt 4 95 

tcgaaaeget ttgggaggec eaggtggaag gatcaettga 555 

accctgtetc actgcaaatt aaaaggctgg gcgtggtggc 615 

ctttgggagg ctgaggcagg cggatcaect gcaccctggc 675 

tttactaaaa atagaaaatt agccgggcgt gatggcacac 735 

gggaggetga ggcaggagaa ttgettgaac etgggaggtg 795 

tcatgccatt geactccage ctgagcaaca agagcaaaac 855 

aaaa 879 
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Ser 


Met 


Cys 


Ala 


Gly 


Gin 


Arg 


Leu 


Ala 


Ser 


Glu 


Pro 


Gin 


Asp 


Pro Ala 














1 








5 










10 






gtg 




ctg 






teg 




gtg 




gag 






ggg 


aga aac 


195 


Ala 


Val 


Ser 


Leu 


Pro 


Thr 


Ser 


Ser 


Val 


Pro 


Glu 


Thr 


A?g 


Gly Arg Asn 










15 










20 










25 






aaa 


ggc 


age 


get 


gcc 


etc 


ggg 


gga 


gca 


ttg 


gee 


etg 


geg 


gaa 


egm rre 


243 


Lys 


Gly 


Ser 
30 


Ala 


Ala 


Leu 


Gly 


Gly 
35 


Ala 


Leu 


Ala 


Leu 


Ala 
40 


Glu 


Arg Xaa 




age 


cgc 


gaa 


gga 


tec 


age 


eag 


agg 


atg 


eea 


ege 


eag 


eee 


age 


get aee 


291 


Ser 


Arg 
45 


Glu 


Gly 


Ser 


Ser 


Gin 
50 


Arg 


Met 


Pro 


Arg 


Gin 
55 


Pro 


Ser 


Ala Thr 




agg 


ctg 


ccc 


aag 


ggg 


ggc 


ggg 


eet 


ggg 


aag 


age 


eet 


aea 


egg 


ggc age 


339 


Arg 


Leu 


Pro 


Lys 


Gly 


Gly 


Gly 


Pro 


Gly 


Lys 


Ser 


Pro 


Thr Arg 


Gly Ser 




60 










65 










70 








75 





aee taggatgggg cagagaettg ttgeatettt gteeeeagea aaggetacat 392 
Thr 

gttaeetcet teaattgata ataaacettt etgagatgea gagggteeag gteaaaaaaa 452 

aaaaaaaaaa aa 464 

<210> 76 

<211> 624 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 59. .367 

<220> 

<221> sig_peptide 

<222> 59. .166 

<223> Von Heijne matrix 

score 3.66643500261364 

seq RALSTFLFGSIRG/AA 

<220> 

<221> polyA_site 
<222> 609. . 524 

<220> 

<221> polyA_signal 
<222> 588. .593 

<400> 75 

agtgcgcaga cgcaggggtc ggcgccgggt gagagcgtgc ggccggattc accacaac 58 



atg 


gca 


aat 


ett 


ttt 


ata 


agg 


aaa 


atg 


gtg 


aae 


eet 


etg 


ete 


tat 


ete 


106 


Met 


Ala 


Asn 


Leu 


Phe 


He 




Lys 


Met 


Val 


Asn 


Pro 


Leu 


Leu 


Tyr 


Leu 






-35 










-30 










-25 












cgt 


cgt 


cae 


aeg 


gtg 


aag 


eet 


cga 


gcc 


etc 


tee 


aea 


ttt 


eta 


ttt 


gga 


154 




Arg 


His 


Thr 


Val 


Lys 


Pro 




Ala 


Leu 


Ser 


Thr 


Phe 


Leu 


Phe 


Gly 




-20 










-15 










-10 










-5 




tec 


att 


cga 


ggt 


gca 


gcc 


ccc 


gtg 


get 


gtg 


gaa 


ccc 


ggg 


gca 


gca 


gtg 


202 


Ser 


He 


Arg 


Gly 


Ala 

1 


Ala 


Pro 


Val 


Ala 
5 


Val 


Glu 


Pro 


Gly 


Ala 
10 


Ala 


Val 




cgc 


tea 


ctt 


etc 


tea 


ccc 


ggc 


etc 


ctg 


ccc 


eat 


etg 


etg 


eet 


geg 


etg 


250 


Arg 


Ser 


Leu 


Leu 


Ser 


Pro 


Gly 


Leu 


Leu 


Pro 


His 


Leu 


Leu 


Pro 


Ala 


Leu 








15 










20 










25 










ggg 


ttc 


aaa 




aag 


act 


gtc 


ett 


aag 


aag 


ege 


tge 


aag 


gac 


tgt 


tac 


298 


Gly 


Phe 


Lys 


Asn 


Lys 


Thr 


Val 


Leu 


Lys 


Lys 


Arg 


Cys 


Lys 


Asp 


Cys 


Tyr 






30 










35 










40 












ctg 


gtg 


aag 


agg 


egg 


ggt 


egg 


tgg 


tac 


gtc 


tac 


tgt 


aaa 


acc 


cat 


ccg 


346 




Val 


Lys 


Arg 


Arg 


Gly 


Arg 


Trp 


Tyr 


Val 


Tyr 


Cys 


Lys 


Thr 


His 


Pro 




45 










50 










55 










60 
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agg cac aag cag aga cag atg tagacccttt ccctccagag tcacgcacat 397 
Arg His Lys Gin Arg Gin Met 
65 

actcgtcatc gcatcacttg ggagaatggt tgtatcttat ggaaggaatt atcacatcaa 457 
ggagtcaggg gaaagtgact ggaagcaaac gccctaaaag ttacccatca cgtttcagtg 517 
taaatgagta actatagaag acattgcgtt atcttatttc maaaacgttc caactaaaaa 577 
acattttcct attaaaatag accttccgaa caaaaaaaaa aaaaaaa 624 



<210> 77 
<211> 1158 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 58. .315 

<220> 

<221> sig_peptide 

<222> 58. .155 

<223> Von Heijne matrix 

score 3.68684848854945 

seq SVSLSGFTVGTLS/ET 

<220> 

<221> polyA_site 
<222> 1143. .1158 

<400> 77 

ctgtgggaga ggccaggatg ctgcattagg cacaggataa cctgggaacc caggcac 57 
atg ggt cct get etc ega agt ctg caa gtc aag aag gga aca gag cac 105 
Met Gly Pro Ala Leu Arg Ser Leu Gin Val Lys Lys Gly Thr Glu His 

-35 -30 -25 

gcc gac cct etc cct ttc ccc tct gtc tct ctt agt ggc ttt aca gtg 153 
Ala Asp Pro Leu Pro Phe Pro Ser Val Ser Leu Ser Gly Phe Thr Val 
-20 -15 -10 -5 

ggt acc ctg tea gaa aee age act ggg gge cet gee aee eee aea tgg 201 
Gly Thr Leu Ser Glu Thr Ser Thr Gly Gly Pro Ala Thr Pro Thr Trp 

15 10 
aag gag tgt cct ate tgt aag gag cgc ttt cct get gag agt gac aag 24 9 

Lys Glu Cys Pro lie Cys Lys Glu Arg Phe Pro Ala Glu Ser Asp Lys 

15 20 25 

gat gcc ctg gag gac cac atg gat gga cac ttc ttt ttc age acc cag 297 
Asp Ala Leu Glu Asp His Met Asp Gly His Phe Phe Phe Ser Thr Gin 

30 35 40 

gac ccc ttc ace ttt gag tgatettact ccctcgtaea tgcacaaata 34 5 

Asp Pro Phe Thr Phe Glu 
45 50 

cacactcatg caeacaeaca etcaeacaea tgcataeaet taggtttcat gcccatttte 405 
tatcacactg ggctccatga tattctgttc ectaagaact gcttctgtgt gceetgtttt 465 
catceeaaga tttetcaett eateetetec tacetggcte ttttgtecca gggaggggte 525 
ctgtteggaa gcagtggetg aatttateec etgaaagtgg ttttggagga aeegggatgg 585 
aggaggcctt cccctgtggg aatagaatcg tccactccta gccctggttg cttctgatac 645 
aeageeactg caeacaeaca etcaeaetea eacteeettg tytgatgccc caaagceaat 705 
teetggggea eeetaeecte tcttatttgg agttteegtt ggtttaeetg agttttetet 765 
ggggtctgea cagaggcage ageatggaea teatggecte teaggteeet tttggttcte 825 
agtttcattg gttcctettt etgtteecec attgaettet gtgecceace etageetttt 885 
ceataacett aggtattcag tttggagggg ttttttgtat ttttgaggat tectgtatte 94 5 
tgtateetet eetegeatet eeteaeatgg aaagaaataa tgtatttgtg ccttctgtga 1005 
ggaatggggg gaacaagtgg tcccaggtat eeceatttee aaggcccccc tecctctcca 1065 
ggteeeeeea eageaataaa agcttececc tgatateeat cectttgtag tttgaaeaaa 1125 
tatatttata tgatatgaaa aaaaaaaaaa aaa 1158 
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<210> 78 

<211> 641 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 187. .342 

<220> 

<221> sig_peptide 

<222> 187. .276 

<223> Von Heijne matrix 

score 3.57432920610252 

seq VKLLLKMVSSATT/ER 

<220> 

<221> polyA_site 
<222> 626. .641 

<400> 78 

attctaaatc aataaatata gcttacccac agacttttta tgggatgtaa agtatgtatt 60 
acaacaggta aatccatttc tctgttttgg gaaagcaggt taatgttttg tttgttttgt 120 
tgctgcatat aatgttgcac tatatatata ttctaatgca gttggtctcc taattctaca 180 
ggataa atg tct age tgt ggc att gtt gga tea tea gtt teg ttt eag 228 

Met Ser Ser Cys Gly He Val Gly Ser Ser Val Ser Phe Gin 

-30 -25 -20 

tta gat get gtg aaa ttg etc ttg aaa atg gtg tec tct gcc acc aca 276 
Leu Asp Ala Val Lys Leu Leu Leu Lys Met Val Ser Ser Ala Thr Thr 

-15 -10 -5 

gaa egg tgt tgt aat gga agt gcc aat ttt cat aaa aac ttg tgt gca 324 
Glu Arg Cys Cys Asn Gly Ser Ala Asn Phe His Lys Asn Leu Cys Ala 
15 10 15 

aca ggt att aag aat ttt tgaggtcggg cgcagtagct cacgcctgta 372 
Thr Gly He Lys Asn Phe 
20 

atcccagcac tttgggaggc ctaggcaggt ggateacctg aggtcaggag ttccagacca 432 
gcctgaccaa catggtgaaa tcccatctcc actaaataca aaaaattagc tgggcgtggt 4 92 
ggeataeaee tgtaatecca gctaettggg aggetaagac aggagaattg cttgaaccca 552 
ggaggcggag gttgcagtga gccgagattg caccactaca ctccagcctg ggtgatgagc 612 
gaaactccat ctcaaaaaaa aaaaaaaaa 641 

<210> 79 

<211> 660 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 2. .658 

<220> 

<221> sig_peptide 

<222> 2. .121 

<223> Von Heijne matrix 

score 6.4143157541603 

ITMSATVVDAVNA/AP 

<400> 79 

t cct awt ega aaa att acg aga ttg get tgg att ctg teg gat gga ctt 4 9 

pro Xaa Arg Lys He Thr Arg Leu Ala Trp He Leu Ser Asp Gly Leu 
-40 -35 -30 -25 

ggg get age tge gge ggg get gga gga ggc eag ata ace atg tea gcc 97 
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Gly 


Ala 


Ser 


Cys 


Gly 


Gly Ala 


Gly 


Gly 










-20 










aca 


gtt 


gta 


gat 


gca 


gtt 


aat 


get 


gca 


Thr 


Val 


Val 


Asp 


Ala 


Val 


Asn 


Ala 


Ala 
1 


atg 


agt 


ttg 


-5 
gaa 


saa 


eca 


aag 


aag 


atg 


Met 


Ser 


Leu 


Glu 


Xaa 


Pro 


Lys 


Lys 


Met 




10 










15 






aag 


aag 


gag 


eta 


gaa 


gaa 


eag 


ega 


aaa 


Lys 


Lys 


Glu 


Leu 


Glu 


Glu 


Gin 


Arg 


Lys 


25 










30 








gtt 


gat 


gaa 


gaa 


gga 


aaa 


gac 


ate 


aac 


Val 


Asp 


Glu 


Glu 


Gly Lys Asp 


He 


Asn 


tct 


tea 


gtg 


eea 


tgg 


tat 


att 


gat 


ect 


Ser 


Ser 


Val 


Pro 


Trp Tyr 


He 


Asp 


Pro 








60 










65 


cac 


cag 


aga 


oca 




oca 


gaa 


aaa 


caa 


His 


Gin 


Arg 


Pro 


Gin 


Pro 


Glu 


Lys 


Gin 






75 










ao 




gaa 


tgg 


tac 


aag 


agg 


ggt 


gta 


aaa 


gag 


Glu 


Trp 


Tyr 


Lys 


Arg 


Gly Val 


Lys 


Glu 




90 










95 






cgc 


aaa 


gga 


gca 


tgt 


gaa 


aat 


tgt 


ggg 


Arg 


Lys 


Gly Ala 


Cys 


Glu 


Asn 


Cys 


Gly 


105 










110 








gac 


tgc 


ttt 


gag 


aga 


cot 


agg 


ega 


gtt 


Asp 


Cys 


Phe 


Glu 


Arg 


Pro Arg 


Arg 


Val 










125 










aat 


ata 


get 


eea 


gat 


gaa 


eat 


gte 


eag 


Asn 


He 


Ala 


Pro 


Asp 


Glu 


His 


Val 


Gin 








140 












gat 


ggg 


aag 


agg 


gat 


egg 


tgg 




ggc 


Asp 


Gly 


Lys Arg 


Asp Arg 


Trp 


Asn 


Gly 






155 










160 




aaa 


att 


gtt 


gaa 


gag 


tat 


gcc 


aaa 


gtt 


Lys 


He 


Val 


Glu 


Glu Tyr Ala 


Lys 


Val 




170 










175 







<210> 80 

<211> 647 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 1. .645 

<220> 

<221> sig peptide 

<222> 1. .108 

<223> Von Heijne matrix 

score 5.52542399647426 

ITMSATVXDAVNA/AP 

<400> 80 

att aeg aga ttg get tgg att ctg tck 
He Thr Arg Leu Ala Trp He Leu Ser 

-35 -30 
ggc ggg get gga gga ggc cag ata acc 
Gly Gly Ala Gly Gly Gly Gin He Thr 
-20 -15 
gca gtt aat get gca ccc eta teg ggg 



66 



Gly 


Gin 


He 


Thr 


Met 


Ser 


Ala 




-15 










-10 






ccc 


eta 


teg 


ggg 


tec 




gaa 


145 


Pro 


Leu 


Ser 


Gly 
5 


Ser 


Lys 


Glu 




acc 


aga 


gag 


gac 


tgg 


aga 


aag 


193 


Thr 




Glu 


Asp Trp Arg 


Lys 








20 












ttg 


ggc 


aat 


get 


ect 


gca 


gaa 


241 


Leu 


Gly Asn 


Ala 


Pro 


Ala 


Glu 






35 










40 




eec 


eat 


att 


ect 


eag 


tat 


att 


289 


Pro 


His 


He 


Pro 


Gin 


Tyr 


He 




50 










55 






tea 


aaa 


aga 


ect 


act 


tta 


aaa 


337 


Ser 


Lys 


Arg 


Pro 


Thr 


Leu Lys 












70 








aag 


cag 


ttc 


age 


tea 


tct 


gga 


385 


Lys 


Gin 


Phe 


Ser 


Ser 


Ser 


Gly 










85 










aat 


tec 


ata 


att 


act 


aag 


tac 


433 


Asn 


Ser 


He 


He 


Thr 


Lys 


Tyr 








100 












gcc 


atg 




cac 


aaa 


aag 


aaa 


481 


Ala 


Met 


Thr 


His 






Lys 






115 










120 




gga 


gcc 


aaa 






ggt 


act 


529 


Gly 


Ala 


Lys 


Phe 


Thr 


Gly Thr 




130 
















ect 






atg 


ttt 


gac 


tat 


577 


Pro 


Gin 


Leu 


Met 


Phe 


Asp 


Tyr 












150 


















cac 


atg 


625 


Tyr 


Asn 


Pro 


Glu 


Glu 


His 


Met 










165 










gat 


ttg 


gc 










660 


Asp 


Leu 
















gga 


ett 


ggg 


get 


age 


tgc 




Asp 


Gly 


Leu 


Gly Ala 


Ser 


Cys 








-25 












atg 


tea 


gcc 


aca 


gtt 


gtm 




96 


Met 


Ser 


Ala 


Thr 


Val 


Val 


Asp 






-10 










-5 




tec 


aaa 


gaa 


atg 


agt 


ttg 


gaa 


144 
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Ala Val 


Asn Ala 


Ala 


Pro 


Leu 


Ser 


Gly 


Ser 


Lys 


Glu 


Met Ser 


Leu 


Glu 








1 








5 








10 








gaa cca 


aag aag 


atg 


acc 


aga 


gag 


gac 


tgg 


aga 


aag 


aag aag 


gag 


eta 


192 


Glu Pro 


Lys Lys 


Met 


Thr 


Arg 


Glu 


Asp 


Trp 


Arg 


Lys 


Lys Lys 


Glu 


Leu 






15 








20 










25 








gaa gaa 


cag cga 


aaa 


ttg 


ggc 


aat 


get 


cet 


gca 


gaa 


gtt gat 


gaa 


gaa 


240 


Glu Glu 


Gin Arg 


Lys 


Leu 


Gly 


Asn 


Ala 


Pro 


Ala 


Glu 


Val Asp 


Glu 


Glu 




30 








35 










40 










gga aaa 


gac ate 


aac 


ecc 


cat 


att 


cct 


cag 


tat 


att 


tet tea 


gtg 


cca 


288 


Gly Lys 


Asp He 


Asn 


Pro 


His 


He 


Pro 


Gin 


Tyr 


He 


Ser Ser 


Val 


Pro 




45 






50 










55 








60 




tgg tat 


att gat 


cct 


tea 


aaa 


aga 


cet 


act 


tta 


aaa 


cac cag 


aga 


cca 


336 


Trp Tyr 


He Asp 


Pro 


Ser 


Lys 


Arg 


Pro 


Thr 


Leu 


Lys 


His Gin 


Arg 


Pro 








65 










70 








75 






caa cca 


gaa aaa 


caa 


aag 


cag 


tte 


age 


tea 


tet 


gga 


gaa tgg 


tac 


aag 


384 


Gin Pro 


Glu Lys 


Gin 


Lys 


Gin 


Phe 


Ser 


Ser 


Ser Gly Glu Trp 


Tyr 


Lys 






80 










85 








90 








agg ggt 


gta aaa 


gag 


aat 


tec 


ata 


att 


act 




tac 


cgc aaa 


gga 


gca 


432 


Arg Gly 


Val Lys 


Glu 


Asn 


Ser 


He 


He 


Thr 


Lys 


Tyr 


Arg Lys 


Gly Ala 






95 








100 










105 








tgt gaa 


aat tgt 


ggg 


gcc 


atg 


aca 


cac 


aaa 


aag 


aaa 


gac tgc 


ttt 


gag 


480 


Cys Glu 


Asn Cys 


Gly 


Ala 


Met 


Thr 


His 


Lys 


Lys 


Lys Asp Cys 


Phe 


Glu 




110 








115 










120 










aga act 


agg cga 


gtt 


gga 


gcc 


aaa 


ttt 


aca 


ggt 


act 


aat ata 


get 


cca 


528 


Arg Pro 


Arg Arg 


Val 


Gly 


Ala 


Lys 


Phe 


Thr 


Gly 


Thr 


Asn He 


Ala 


Pro 




125 






130 










135 








140 




gat gaa 


cat gtc 


cag 


cct 


caa 


ctg 


atg 


ttt 


gac 


tat 


gat ggg 


aag 


agg 


576 


Asp Glu 


His Val 


Gin 


Pro 


Gin 


Leu 


Met 


Phe 


Asp Tyr Asp Gly 


Lys 


Arg 








145 










150 








155 






gat egg 


tgg aat 


ggc 


tac 


aat 


cca 


gaa 


gaa 


cac 


atg 


aaa att 


gtt 


gaa 


624 


Asp Arg 


Trp Asn 


Gly 


Tyr 


Asn 


Pro 


Glu 


Glu 


His 


Met 


Lys He 


Val 


Glu 






160 










165 








170 








gag tat 


gcc aaa 


gtt 


gat 


ttg 


gc 
















647 


Glu Tyr 


Ala Lys 


Val 




Leu 






















175 


























<210> 8: 


L 


























<211> 643 


























<212> DNA 


























<213> Homo sapiens 
























<220> 




























<221> CDS 


























<222> 1 


. .642 


























<220> 




























<221> sig_peptide 
























<222> 1 


. .108 


























<223> Von Heijne matrix 






















score 6.4143157541603 




















seq ITMSATWDAVNA/AP 




















<400> 81 


























att acg 


aga ttg 


get 


tgg 


att 


ctg 


teg 


gat 


gga 


ctt 


ggg get 


age 


tgc 


48 


He Thr 


Arg Leu 


Ala 


Trp 


He 


Leu 


Ser 


Asp 


Gly 


Leu 


Gly Ala 


Ser 


Cys 




-35 








-30 










-25 










ggc ggg 


get gga 


gga 


ggc 


cag 






atg 


tea 


gcc 


aca gtt 


gta 


gat 


96 


Gly Gly Ala Gly Gly 


Gly 


Gin 


He 


Thr 


Met 


Ser 


Ala 


Thr Val 


Val 


Asp 




-20 






-15 










-10 








-5 




gca gtt 


aat get 


gca 




eta 


teg 


ggg 


tec 


aaa 


gaa 


atg agt 


ttg 


gaa 


144 


Ala Val 


Asn Ala 


Ala 


Pro 


Leu 


Ser 


Gly 


Ser 


Lys 


Glu 


Met Ser 


Leu 


Glu 








1 








5 








10 








gaa cca 


aag aag 


atg 




aga 


gag 


gac 


tgg 


aga 


aag 


aag aag 


gag 


eta 


192 
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Glu 


Pro 


Lys 


Lys 


Met 


Thr 


Arg 


Glu 


Asp 


Trp 


Arg 


Lys 


Lys 


Lys Glu Leu 








15 










20 










25 






gaa 


gaa 


cag 


cga 


aaa 


ttg 


ggc 


aat 


get 


cct 


gca 


gaa 


gtt 


gat gaa gaa 


240 


Glu 


Glu 


Gin 


Arg 


Lys 


Leu 


Gly 


Asn 


Ala 


Pro 


Ala 


Glu 


Val 


Asp Glu Glu 






30 










35 










40 








gga 


aaa 


gac 


ate 


aac 


ccc 


cat 


att 


cct 


cag 


tat 


att 


tct 


tea gtg cca 


288 




















Gin 


Tyr 


lie 


Ser 


Ser Val Pro 




45 










50 










55 






60 




tgg 


tat 


att 


gat 


cct 


tea 


aaa 


aga 


cct 


act 


tta 


aaa 


cac 


cag aga cca 


336 


Trp 


Tyr 


lie 


Asp 


Pro 


Ser 


Lys 




Pro 


Thr 


Leu 


Lys 


His 


Gin Arg Pro 












65 










70 








75 




caa 


cca 


gaa 


aaa 


caa 


aag 


cag 


ttc 


age 


tea 


tet 


gga 


gaa 


tgg tac aag 


384 


Gin 


Pro 


Glu 


Lys 


Gin 


Lys 


Gin 


Phe 


Ser 


Ser Ser Gly Glu Trp Tyr Lys 





80 85 90 

agg ggt gta aaa gag aat tec ata att act aag tac cgc aaa gga gca 
Arg Gly Val Lys Glu Asn Ser lie lie Thr Lys Tyr Arg Lys Gly Ala 
95 100 105 

tgt gaa aat tgt ggg gcc atg aca cac aaa aag raa gac tgc ttt gag 
Cys Glu Asn Cys Gly Ala Met Thr His Lys Lys Xaa Asp Cys Phe Glu 

110 115 120 

aga cct agg cga gtt gga gcc aaa ttt aca ggt act aat ata get cca 
Arg Pro Arg Arg Val Gly Ala Lys Phe Thr Gly Thr Asn He Ala Pro 
125 130 135 140 

gat gaa cat gtc cag cct caa ctg atg ttt gac tat gat ggg aag agg 
Asp Glu His Val Gin Pro Gin Leu Met Phe Asp Tyr Asp Gly Lys Arg 

145 150 155 

gat egg tgg aat ggc tac aat cca gaa gaa cac atg aaa att gtt gaa 
Asp Arg Trp Asn Gly Tyr Asn Pro Glu Glu His Met Lys He Val Glu 

160 165 170 

gag tat gee aaa gtt gat t 
Glu Tyr Ala Lys Val Asp 
175 

<210> 82 

<211> 1200 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 29. .469 

<220> 

<221> sig_peptide 

<222> 29. .73 

<223> Von Heijne matrix 

score 4.5958301326157 

seq VLVSXCSARLLQQ/EE 

<220> 

<221> polyA_site 
<222> 1185. .1200 

<400> 82 

acttggcgag tgagacgctg atgggagg atg gac gta ctg gtg tct rag tgc 
Met Asp Val Leu Val Ser Xaa Cys 
-15 -10 
tec gcg egg ctg ctg cag cag gaa gaa gag att aaa tct ctg act get 
Ser Ala Arg Leu Leu Gin Gin Glu Glu Glu He Lys Ser Leu Thr Ala 

-5 15 
gaa att gac egg ttg aaa aac tgt ggc tgt tta gga get tct cca aat 
Glu He Asp Arg Leu Lys Asn Cys Gly Cys Leu Gly Ala Ser Pro Asn 
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ttg 


gag 


cag 


^eu 




gaa 


gaa 


aat 


tta 


aaa 


tta 


aag 


tat 


cga 


ctg 


aat 


196 


Leu 


Glu 


Gin 




Gin 
30 


Glu 


Glu 


Asn 


Leu 


Lys 
35 




Lys 


Tyr 


Arg 


Leu 
40 


Asn 




att 


ctt 


cga 


aag 


agt 


ctt 


cag 


gca 


gaa 


agg 


aac 


aaa 


cca 


act 


aaa 


aat 


244 


He 


Leu 


Arg 


Lys 
45 


Ser 


Leu 


Gin 


Ala 


Glu 
50 


Arg 


Asn 




Pro 


Thr 

55 




Asn 




atg 


att 


aac 


att 


att 


age 


cge 


eta 


eaa 


gag 


gte 


ttt 


ggt 


cat 


gca 


att 


292 


Met 


He 


Asn 
60 


He 


He 


Ser 


Arg 


Leu 
65 


Gin 


Glu 


Val 


Phe 


Gly 
70 


His 


Ala 


He 




aag 


get 


gca 


tat 


cca 


gat 


ttg 


gaa 


aat 


cet 


eet 


ctg 


eta 


gtg 


aca 


cca 


340 


Lys 


Ala 

75 


Ala 


Tyr 


Pro 


Asp 


Leu 

80 


Glu 


Asn 


Pro 


Pro 


Leu 

85 


Leu 


Val 


Thr 


Pro 




agt 




cag 


gcc 


aag 


ttt 


ggg 


gac 


tat 


cag 


tgt 




agt 


get 


atg 


ggt 


388 


Ser 


Gin 


Gin 


Ala 


Lys 


Phe 


Gly 


Asp 


Tyr 


Gin 


Cys 


Asn 


Ser 


Ala 


Met 


Gly 




90 










95 










100 










105 




att 


tct 


cag 


gtg 


atg 


tat 


tgt 


cat 


gac 


tct 


tgg 


ctg 


ttt 


gat 


ttt 


ttt 


436 


He 


Ser 


Gin 


Val 


Met 


Tyr 


Cys 


His 


Asp 


Ser 


Trp 


Leu 


Phe Asp Phe 


Phe 












110 










115 










120 






aag 


tat 


tat 


tat 


cat 


cat 


tgc 


cat 


tta 


cag 


aaa 


taatactatt acaagttgta 


489 


Lys 


Tyr 


Tyr 


Tyr 
125 


His 


His 


Cys 


His 


Leu 
130 


Gin 


Lys 















tccttagtga 


aaaggacatt 


tgccacagtt 


tgaaaaactt 


gagaaaggag 


ttgggggggt 


549 


atatgtttta 


acttttttag 


gcaeaatttt 


taaggtttgg 


ttaaatttta 


tatgtattct 


609 


caatatttaa 


gggcaatcat 


tggtaetett 


ttgtttaggt 


atttccctcc 


tgctgtgtcc 


669 


aggaktgctg 


tgtggtggtr 


atgagtgctg 


ggaggtgaaa 


aattaaaata 


agccatttac 


729 


cagtcagcat 


cccaattaaa 


tatttgatgt 


aactgtgatc 


tttgagccag 


gcttatatat 


789 


tcattttcaa 


gcagaggagt 


tccccatttt 


aaatagagge 


attgtctgat 


gtgtttatgg 


849 




ctggcttggg 




ttcctttctt 


tgctgaattg 


gaaggggtta 


909 


ctctgaagag 




acagtgtggt 


ttatttctca 


agtgtgaata 


ttgcacacct 


969 


tcatggcttg 


aaaattagaa 




gctggaacca 


gacttcataa 


ctgtagactt 


1029 


tgttatgtca 


tcattaaaaa 


gttcgaggtg 


ttgtaatcec 


agctactagg 


gaggctgagg 


1089 


caggagaatt 


gcttgaaccc 


aggggcggag 


gttacagtga 


gccgagatca 


caccactgea 


1149 


ctccagcctg 


ggcaacagag 


cgagactcca 


tctctaaaaa 


aaaaaaaaaa 


a 


1200 



<210> 83 
<211> 351 
<212> DNA 

<213> Homo sapiens 



<220> 
<221> CDS 
<222> 1. .351 

<220> 

<221> sig_peptide 

<222> 1. .129 

<223> Von Heijne matrix 

score 4.10872570449087 

LGQVAAAAAAANA/AS 

<220> 

<221> polyA_site 
<222> 336. .351 

<220> 

<221> polyA_signal 
<222> 306. .311 

<400> 83 

ctt tet cea get caa geg tct ggt att tgg tgt agg gac age get ttt 48 
Leu Ser Pro Ala Gin Ala Ser Gly He Trp Cys Arg Asp Ser Ala Phe 

-40 -35 -30 

tec ggg tgg age gag cgt gga tee agt teg egg egg ggt tgt ttg ggt 96 
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Ser Gly Trp Ser Glu Arg Gly Ser Ser Ser Arg Arg Gly Cys Leu Gly 

-25 -20 -15 

caa gtt get get get get geg get gcg aat get get tet eet eet eet 144 
Gin Val Ala Ala Ala Ala Ala Ala Ala Asn Ala Ala Ser Pro Pro Pro 

-10 -5 15 

tea geg aac age ctg gga teg ggt ggt cgc tgc aag ctg agg get ccg 192 
Ser Ala Asn Ser Leu Gly Ser Gly Gly Arg Cys Lys Leu Arg Ala Pro 

10 15 20 

aeg age ege eeg tec eag teg egg eee ega tec ctg cec eag gee egg 24 0 

Thr Ser Arg Pro Ser Gin Ser Arg Pro Arg Ser Leu Pro Gin Ala Arg 

25 30 35 

ttc ccc ccg teg ccg ctg ccg cct tet eet gea gga aeg agt tge aea 288 
Phe Pro Pro Ser Pro Leu Pro Pro Ser Pro Ala Gly Thr Ser Cys Thr 

40 45 , 50 

tgt tae tgc ttc eta eta ata aat get gae etg ate aaa tgg age eea 336 
Cys Tyr Cys Phe Leu Leu lie Asn Ala Asp Leu lie Lys Trp Ser Pro 

55 60 65 

aaa aaa aaa aaa aaa 351 
Lys Lys Lys Lys Lys 
70 

<210> 84 

<211> 438 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 15. .278 

<220> 

<221> sig_peptide 

<222> 15. .122 

<223> Von Heijne matrix 

score 3.79790641648006 

seq ILMRDFSPSGIFG/AF 

<220> 

<221> polyA_site 
<222> 423. .438 

<220> 

<221> polyA_signal 
<222> 396. .401 

<400> 84 



aeeaggacte caaa 


atg 


geg 


tea 


gtt 


gta 


eea 


gtg 


aag 


gae aag aaa 


ctt 


50 










Met 


Ala 


Ser 


Val 


Val 


Pro 


Val 


Lys 


Asp Lys Lys 


Leu 














-35 










-30 






-25 




etg 


gag 


gtc 


aaa 


ctg 


ggg 


gag 


ctg 


cca 


age 


tgg 


ate 


ttg atg egg 


gae 


98 


Leu 


Glu 


Val 


Lys 


Leu 


Gly 


Glu 


Leu 


Pro 


Ser 


Trp 


He 


Leu Met Arg 


Asp 












-20 










-15 






-10 






ttc 


agt 


cct 


agt 


ggc 


att 


ttc 


gga 


gcg 


ttt 


caa 


aga 


ggt tae tae 


egg 


146 


Phe 


Ser 


Pro 


Ser 
-5 


Gly 


He 


Phe 


Gly 


Ala 
1 


Phe 


Gin 


Arg 


Gly Tyr Tyr 
5 


Arg 




tac 




aac 


aag 


tae 


ate 


aat 


gtg 


aag 


aag 


ggg 


age 


ate teg ggg 


att 


194 


Tyr 


Tyr 


Asn 


Lys 


Tyr 


He 


Asn 


Val 


Lys 


Lys Gly Ser 


He Ser Gly 


He 






10 










15 










20 










atg 


gtg 


ctg 


gca 


tgc 


tac 


gtg 


etc 


ttt 


age 


tac 


tec ttt tec 


tac 


242 


Thr 


Met 


Val 




Ala 


Cys 


Tyr 


Val 




Phe 


Ser 


Tyr 


Ser Phe Ser 


Tyr 




25 










30 










35 






40 




aag 


cat 


etc 




cac 


gag 


egg 


etc 
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tgaagaggac 
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Tyr 
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45 50 
acactctgca cccccccacc ccacgacctt ggcccgagcc cctccgtgag gaacacaatc 348 
tcaatcgttg ctgaatcctt tcatatccta ataggaatta acctccaaat aaaacatgac 408 
tggtacgtgt gctgaaaaaa aaaaaaaaaa 4 38 

<210> 85 

<211> 419 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 74. .259 

<220> 

<221> sig_peptide 

<222> 74. .199 

<223> Von Heijne matrix 

score 3.56683458636822 

seq CXXSCCSCCPVGC/AK 

<220> 

<221> polyA_site 
<222> 404. .419 

<220> 

<221> polyA_signal 
<222> 388. .393 

<400> 85 

cactccgcct tccacgtgca cccactgcct cttcccttct cgcttgggaa ctctagtctc 60 
gcctcgggtt gca atg gac ccc aac tgc tec tgt gcc get gea ggt gte 109 
Met Asp Pro Asn Cys Ser Cys Ala Ala Ala Gly Val 
-40 -35 
tec tgc acc tgc gcc age tec tgc aag tgc aaa gag tgc aaa tgc acc 157 
Ser Cys Thr Cys Ala Ser Ser Cys Lys Cys Lys Glu Cys Lys Cys Thr 
-30 -25 -20 -15 

tec tgc aar rag age tgc tgc tee tgc tge eet gtg gge tgt gee aag 205 
Ser Cys Lys Xaa Ser Cys Cys Ser Cys Cys Pro Val Gly Cys Ala Lys 

-10 -5 1 

tgt gcc cag gge tgc ate tgc aaa ggg gca teg gag aag tgc age tge 253 
Cys Ala Gin Gly Cys lie Cys Lys Gly Ala Ser Glu Lys Cys Ser Cys 

5 10 15 

tgc gcc tgatgteggg aeagceetge teeeaagtae aaatagagtg aeeegtaaaa 309 
Cys Ala 
20 

tccaggattt tttgtttttt getaeaatet tgaeeeettt getaeattce tttttttetg 369 
tgaaatatgt gaataataat taaaeactta gaeeaaaaaa aaaaaaaaaa 419 

<210> 86 
<211> 396 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 1. .396 

<220> 

<221> polyAsite 
<222> 381. .396 



<400> 86 
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ccc aaa gtc ccc tec ccg gag egg age gca cct agg gte eet ett eeg 
Pro Lys Val Pro Ser Pro Glu Arg Ser Ala Pro Arg Val Pro Leu Pro 

20 25 30 

tee eee eag eee age tae ccr tte aga cca gca gee teg ggg gge aec 
Ser Pro Gin Pro Ser Tyr Pro Phe Arg Pro Ala Ala Ser Gly Gly Thr 

35 40 45 

ccc ccg cca gcc tgc etc cct ccc get eag ccc tgc cag ggt tec cca 
Pro Pro Pro Ala Cys Leu Pro Pro Ala Gin Pro Cys Gin Gly Ser Pro 

50 55 60 

gee atg aat etc ttc ega ttc ctg gga gac etc tec cac etc etc gcc 
Ala Met Asn Leu Phe Arg Phe Leu Gly Asp Leu Ser His Leu Leu Ala 
65 70 75 80 

ate ate ttg eta ctg etc aaa ate tgg aag tec cgc teg tgc gcc gcc 
lie lie Leu Leu Leu Leu Lys lie Trp Lys Ser Arg Ser Cys Ala Ala 

85 90 95 

cac ccc cag ctt cct etc tec ttc tgt ctg tct gtc tgt ctg tct gtc 
His Pro Gin Leu Pro Leu Ser Phe Cys Leu Ser Val Cys Leu Ser Val 

100 105 110 

tct etc tct etc tct gkc tct etc tct etc tct ttc tct gtc tea aaa 
Ser Leu Ser Leu Ser Xaa Ser Leu Ser Leu Ser Phe Ser Val Ser Lys 

115 120 125 

aaa aaa aaa aaa 
Lys Lys Lys Lys 
130 



<210> 87 

<211> 493 

<212> DNA 

<213> Homo sapiens 



<220> 
<221> CDS 
<222> 3. .491 



<400> 87 

aa ccc cac cgc eta act etc cca act etg eag tec gtg ace ttc agg 47 
Pro His Arg Leu Thr Leu Pro Thr Leu Gin Ser Val Thr Phe Arg 
15 10 15 
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ttt 
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Cys 
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Ser 


Arg 


Ser 
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Glu 
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Lys 
25 
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Met 


Val 


Ser 


Ser 
30 


Phe 
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gtt 


tct 
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caa 


gta 


tta 


eta 


ggc 


ttt 


get 


gga 


egg 


aat 
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Arg 


Val 


Ser 


Glu 
35 


Leu 


Gin 


Val 




Leu 
40 


Gly 


Phe 


Ala 


Gly 


Arg 
45 


Asn 






agt 


gga 






eat 


gae 


etc 


ctg 


atg 


agg 


gcg 


etg 


cat 


tta 


ttg 


aag 


191 


Ser 


Gly 


Arg 
50 


Lys 


His 


Asp 


Leu 


55 


Met 


Arg 


Ala 




His 
50 


Leu 




Lys 




age 


ggc 


tgc 


age 


cct 


gcg 


gtt 


cag 


att 


aaa 


ate 


ega 


gaa 


ttg 


tat 


aga 


239 


Ser 


Gly 
65 


Cys 


Ser 


Pro 


Ala 


Val 
70 


Gin 


He 


Lys 


He 


Arg 
75 


Glu 


Leu 


Tyr 


Arg 
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ega 


tat 
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act 


ett 




gga 


ctt 


tet 


gat 


tta 


tee 


aea 
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80^ 


Arg 


Tyr 


Pro 


Arg 


Thr 
85 


Leu 


Glu 
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Leu 
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90 
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Leu 
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Thr 


He 
95 
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teg 
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gat 
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tea 


cct 
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gaa 
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Val 
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Val 
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Ala 
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Gly 
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Val 
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115 










120 










125 








cct 




tea 


cca 
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gtg 
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ctt 
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gat 


act 


431 
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130 

aag ccc aca ttt gag atg 
Lys Pro Thr Phe Glu Met 
145 

cat cct gat gtg ca 
His Pro Asp Val 

160 



73 

Pro Val Gly Ser Val Leu 
135 

cag cag cca tct ccc cca 
Gin Gin Pro Ser Pro Pro 
150 155 
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Leu Gin Asp Thr 
140 

att cct cct gtc 479 
He Pro Pro Val 

4 93 



<210> 88 

<211> 500 

<212> DNA 

<213> Homo sapiens 



<220> 
<221> CDS 
<222> 3. .500 



<400> 88 

aa ccc cac cgc eta act etc cca act ctg cag tec gtg acc ttc agg 47 

Pro His Arg Leu Thr Leu Pro Thr Leu Gin Ser Val Thr Phe Arg 
15 10 15 
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agg 
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att 
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Ser 


Gly 
65 


Cys 


Ser 


Pro 


Ala 


Val 
70 


Gin 


He 


Lys 


He 


Arg 
75 


Glu 


Leu 


Tyr 


Arg 
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tat 
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ett 


tet 
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ate 
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Arg 


Arg 


Tyr 


Pro 


Arg 


Thr 


Leu 


Glu 


Gly 


Leu 


Ser 


Asp 


Leu 


Ser 


Thr 


He 




80 










85 










90 










95 




aaa 


tea 


teg 


gtt 


ttc 


agt 


ttg 


gat 


ggt 


ggc 


tea 


tea 


cct 


gta 


gaa 


cct 


335 


Lys 


Ser 


Ser 


Val 


Phe 
100 


Ser 


Leu 


Asp 


Gly 


Gly 
105 


Ser 


Ser 


Pro 


Val 


Glu 
110 


Pro 
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ttg 


gee gtg 


get 


gga 


ate 


cac 


teg 


ttg 


cct 


tee 


act 


tea 


gtt 


aca 


383 


Asp 


Leu 


Ala 


Val 
115 


Ala 


Gly 


He 


His 


Ser 

120 


Leu 


Pro 


Ser 


Thr 


Ser 

125 


Val 


Thr 




cct 
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ett 
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Pro 


His 


Ser 
130 


Pro 


Ser 


Ser 


Pro 


Val 
135 


Gly 


Ser 


Val 


Leu 


Leu 
140 


Gin 


Asp 


Thr 




aag 


ccc 


aca 


ttt 


gag 


atg 


cag 


cag 


cca 


tct 


ccc 


cca 


att 
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cct 


gtc 


479 


Lys 


Pro 
145 
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Phe 


Glu 
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Gin 
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<210> 89 

<211> 544 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 141. .494 
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<221> polyA__site 
<222> 529. .544 

<220> 

<221> polyA_signal 
<222> 508.. 513 

<400> 89 

agtctcgcga taactgcgca ggcgcggacc aaagcgatct cttctgagga tccggcaaga 60 
tggcagaagt atagcagaag aagaagcgga ccttccgcaa gttcacctac cgcggcgtgg 120 
acctcgacca gctgctggac atg tec tac gag cag ctg atg cag ctg tac agt 173 
Met Ser Tyr Glu Gin Leu Met Gin Leu Tyr Ser 
1 5 10 



gcg 


cgc 


cag 


egg 


egg 


egg 


ctg 
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ggc 


etg egg 


egg 


aag 


cag 


cac 
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Leu 
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35 
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Glu 
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45 
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50 
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55 


Met 
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He 
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atg 


gtg 
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He 


Gly 




gcc 


ace 


cac 


tec 


tec 
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gcgcacatga ctccaaaaaa aaaaaaaaaa 54 4 



<210> 90 

<211> 304 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 2. .304 



<400> 90 

a cgt acg gca ggg egg ggt egg cee gca gtg gee age tgg gag eta ega 4 9 

Arg Thr Ala Gly Arg Gly Arg Pro Ala Val Ala Ser Trp Glu Leu Arg 



1 








5 










10 








15 
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eat 
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20 
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25 


Gin 


Gly 
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30 
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egg 


gaa 
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gae 


teg 


gtt 


145 
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Arg 
35 
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Met 
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40 


Gin 
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45 
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85 

aaa awt gcr aaa age 
Lys Xaa Ala Lys Ser 
100 

<210> 91 
<211> 302 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 3. .302 

<220> 

<221> polyA_site 
<222> 281. .302 

<220> 

<221> polyA_signal 
<222> 230. .235 

<400> 91 

ac gta egg cag ggc ggg gtc ggc ccg cag tgg cca get ggg age tae 47 

Val Arg Gin Gly Gly Val Gly Pro Gin Trp Pro Ala Gly Ser Tyr 
15 10 15 
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eee 


eat 
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35 
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40 


Gin 


Gly 


Arg 


Glu 


Arg Asp Ser 

45 
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etc 
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191 
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55 
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239 
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75 


Glu 
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Leu 
80 
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85 


Arg 
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Leu Lys Lys 
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100 























<210> 92 

<211> 596 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 154. .594 

<400> 92 

atagtgageg geagaeagag gggtggeeag tttgttgttt taeetaaaag gtgtttteca 60 
catggagtgg gactgagagg cacaggtggc tgaggggacc cgcctgggac gtgaggegea 120 
ggtcctegge cecttcatet gteagctgca gca atg gaa tae gtg ctg gaa gtg 174 

Met Glu Tyr Val Leu Glu Val 

1 5 

aag aac tct ccg egg cac etc ctg aag caa ttc aca gtg tgt gac gtt 222 
Lys Asn Ser Pro Arg His Leu Leu Lys Gin Phe Thr Val Cys Asp Val 
10 15 20 
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<210> 93 

<211> 596 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 154.. 594 

<400> 93 

atagtgagcg gcagacagag gggtggccag tttgttgttt tacctaaaag gtgttttcca 60 
catggagtgg gactgagagg cacaggtggc tgaggggacc cgcctgggac gtgaggcgca 120 
ggteetcggc eeetteatct gtcagctgca gca atg gaa tac gtg ctg gaa gtg 174 
Met Glu Tyr Val Leu Glu Val 
1 5 
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Ser 
10 


Pro 


Arg 


His 


Leu 


Leu 

15 


Lys 


Gin 


Phe 


Thr 


Val 

20 


Cys 


Asp 


Val 




eet 


ctg 


tat 


gac 


att 


tgt 


gac 


tac 


aac 


gtc 


tec 


aga 


gac 


cga 


tgc 


cag 


270 


Pro 


Leu 
25 


Tyr 


Asp 


He 


Cys 


Asp 
30 


Tyr 


Asn 


Val 


Ser 


Arg 
35 


Asp 


Arg 


Cys 


Gin 




gag 


etc 


ggg 


tgc 


tgc 


ttc 




gaa 


gge 


gtc 


tge 


tac 


aag 


aaa 


gcg 


gtt 


318 


Glu 


Leu 


Gly 


Cys 


Cys 


Phe 


Tyr 


Glu 


Gly 


Val 


Cys 


Tyr 


Lys 


Lys 


Ala 


Val 




40 










45 










50 










55 




ccc 


att 


tac 


ate 


cac 


gtg 


ttc 


tct 


gcc 


ttg 


att 


gtg 


ate 




get 


ggg 


366 


Pro 


He 


Tyr 


He 


His 
60 


Val 


Phe 


Ser 


Ala 


Leu 
65 


He 


Val 


He 


He 


Ala 
70 


Gly 




gcc 


ttc 


gtc 


ate 


ace 


ate 


ate 


tac 


aga 


gte 


att 


cag 


gag 


age 


agg 




414 


Ala 


Phe 


Val 


He 
75 


Thr 


He 


He 


Tyr 


Arg 
80 


Val 


He 


Gin 


Glu 


Ser 
85 


Arg 


Lys 




gaa 


aag 


gee 


ate 


eet 


gtg 


tat 


gtc 


gcg 


ctg 


cca 


cag 


aag 


tec 


age 


gaa 


462 


Glu 


Lys 


Ala 

90 


He 


Pro 


Val 


Tyr 


Val 
95 


Ala 


Leu 


Pro 


Gin 


Lys 
100 


Ser 


Ser 


Glu 




aag 


gcg 


gag 


ttg 


gee 


tea 


tee 


age 


age 


aag 


tta 


ggg 


ctg 


aag 


cct 


gcg 


510 


Lys 


Ala 


Glu 


Leu 


Ala 


Ser 


Ser 


Ser 


Ser 




Leu 


Gly 


Leu 


Lys 


Pro 


Ala 




105 










110 










115 












agt 


eet 


ggg 


eet 


eea 


agt 


get 


ggg 


eee 


teg 


atg 


aag 


agt 


gae 


gag 


gat 


558 


Ser 


Pro 


Gly 


Pro 


Pro 


Ser 


Ala 


Gly 


Pro 


Ser 


Met 


Lys 


Ser 


Asp 


Glu 


Asp 
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120 125 130 135 

aag gat gat gta aca ggg aca ata aca gaa gcc gaa ga 596 
Lys Asp Asp Val Thr Gly Thr He Thr Glu Ala Glu 
140 145 



<210> 94 

<211> 1413 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 24. .470 



<220> 

<221> polyA_site 
<222> 1398. .1413 



<400> 94 

gggttcccgt tccccgcgga gcc atg egg tac aac gag aag gag ctg cag get 
Met Arg Tyr Asn Glu Lys Glu Leu Gin Ala 



ctg 


tec 


egg 


cag 


ccg 


gcc 


gag 


atg 


gcg 


gee 


gag 


Ctg 


ggc 


atg 


agg 


ggc 


101 


Leu 


Ser 


Arg 


Gin 


Pro 


Ala 


Glu 


Met 


Ala 


Ala 


Glu 


Leu 


Gly 


Met 


Arg 


Gly 












15 










20 










25 






ccc 


aag 


aag 


ggc 


age 


gtg 


ctg 


aag 


egg 


egg 


ctg 


gtg 


aag 


ctg 


gtg 


gtg 


149 


Pro 


Lys 


Lys 


Gly 


Ser 


Val 


Leu 


Lys 


Arg 


Arg 


Leu 


Val 


Lys 


Leu 


Val 


Val 










30 










35 










40 








aat 


ttc 


etc 


ttc 


tac 


ttt 


egg 


aca 


gac 


gag 


gcc 


gag 


CCC 


gtc 


gga 


gcc 


197 


Asn 


Phe 


Leu 


Phe 


Tyr 


Phe 


Arg 


Thr 


Asp 


Glu 


Ala 


Glu 


Pro 


Val 


Gly 


Ala 








45 










50 










55 










ctg 


ctg 


ctg 


gag 


cgc 


tgc 




gtc 


gtc 


egg 




gag 


ccc 


ggc 


ace 


ttc 


245 


Leu 


Leu 


Leu 


Glu 


Arg 


Cys 


A?g 


Val 


Val 


Arg 


Glu 


Glu 


Pro 


Gly 


Thr 


Phe 






60 










65 










70 












tec 


ate 


age 


ttc 


att 


gag 


gac 


cet 


gag 


agg 


aag 


tat 


cac 


ttt 


gag 


tgc 


293 


Ser 


He 


Ser 


Phe 


He 


Glu 


Asp 


Pro 


Glu 


Arg 


Lys 


Tyr 


His 


Phe 


Glu 


Cys 





75 80 85 90 

age age gag gag cag tgt cag gag tgg atg gag get ctg egt egg gee 341 
Ser Ser Glu Glu Gin Cys Gin Glu Trp Met Glu Ala Leu Arg Arg Ala 

95 100 105 

age tac gag ttc atg egg aga age etc ate ttc tac agg aac gaa ate 38 9 
Ser Tyr Glu Phe Met Arg Arg Ser Leu He Phe Tyr Arg Asn Glu He 

110 115 120 

egg aag gtg acg ggc aag gac ccc ctg gaa cag ttc ggc ata tec gag 437 
Arg Lys Val Thr Gly Lys Asp Pro Leu Glu Gin Phe Gly He Ser Glu 

125 130 135 

gag gcc agg ttc cag ctg agt ggc ttg cag gcg tgagcgcagg gcacggtggt 4 90 
Glu Ala Arg Phe Gin Leu Ser Gly Leu Gin Ala 

140 145 

eagcgtgcag egggacggga ctggccctgc ccagccatga atcgcttggc catgcctgga 550 

tetgttttgt tttggttttt ggtttttggg tcagggtttc actgtgttgc ccaggctaga 610 

gtgcagtggt gccacagctc actgtgacct tgaccttctg gactcaagtg atcctcctgc 670 

ctcagcttcc caagtagcgg ggatcacagg catgagtcgc cacacccggc catcacacct 730 

ggattttcag tgggaggttt ttggtttgga gacatccaaa gcctgaagcc aggtgggtgt 790 

gggcaggggc tgcattttat gaaactgccc ageaagctgc gctccctggg gccccaggat 850 

ccacctaaet ggcttcaget tttgteatat gggtaaaaat taaagatgte attgaactae 910 

tgtcttgttt atgagaccat teagtggtga actgtttctg gctgataggt tatgagatat 97 0 

gtaaagcttt ctagtactct taaaataact aaatggagta ttatatatca attcatatca 1030 

ttgactttat tattttagta gtatgcctat agaaaatatt atggactcag agtgtcataa 1090 

aatcactctt aagaatccat gcagcaggcc aggcacagtg gctcacacct gtaatgcctg 1150 

cactttggaa ggccgagaca ggcggatcac ttgaggtcag gagtttgaaa ccagccaggc 1210 

caacacagtg aaaccctgtc tetactaaaa atacaaaagg ttagccgggc atggtggcag 127 0 

gcgcctgtaa tcccagctac tcaggaggct gaggcaggag aattgcttga acgcaggagg 1330 
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caaaggttgc agtgagctga gatcacgcca ctgcactcca gcctgggcaa cagacctcga 1390 
ctccatcaaa aaaaaaaaaa aaa 1413 



<210> 95 

<211> 519 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 211. .519 



<220> 

<221> polyA_site 
<222> 504. .519 



<400> 95 

agattcttaa ctacgtgtct tacagcaaag ttgcccatgg gaacaaaggc tgtagggtgt 
cattgatctt gtaatacaac acatatttga tgggtgaatc ttattagcac tcctttgaaa 
atgttccagt atggagtcag ttgctgataa agaggaagtt cacagctagg aatccgaagt 
taagatttct caattgccgc ctgggcactc atg cct gta gtc cca gca ctt ggg 
Met Pro Val Val Pro Ala Leu Gly 
1 5 
agg cca agg tgg gca gat cac ctg agg teg gga gtt egg gac cag cct 
Arg Pro Arg Trp Ala Asp His Leu Arg Ser Gly Val Arg Asp Gin Pro 

10 15 20 

ggc caa cct ggt gaa gcc ccc cca tct eta eta aaa ata eaa aaa ttg 
Gly Gin Pro Gly Glu Ala Pro Pro Ser Leu Leu Lys lie Gin Lys Leu 
25 30 35 40 

gcc ggg tat ggt ggt ggc tgc ctg tgg tec eag eta etc ggg agg ctg 
Ala Gly Tyr Gly Gly Gly Cys Leu Trp Ser Gin Leu Leu Gly Arg Leu 

45 50 55 

agg egg gag aat cac ttg age ccg gga ggc gga ggt tgc agt gag ccg 
Arg Arg Glu Asn His Leu Ser Pro Gly Gly Gly Gly Cys Ser Glu Pro 

60 65 70 

aga ttg tgc cac tgc act cca gcc tgg gtg aca gag caa gac tec ate 
Arg Leu Cys His Cys Thr Pro Ala Trp Val Thr Glu Gin Asp Ser lie 

75 80 85 

tea aaa ata gaa aag att tct eaa ttg cca aaa aaa aaa aaa aaa 
Ser Lys lie Glu Lys lie Ser Gin Leu Pro Lys Lys Lys Lys Lys 
90 95 100 



<210> 96 

<211> 347 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 73. .306 



<220> 

<221> polyA_site 
<222> 332. .347 



<220> 

<221> polyA_signal 
<222> 305. .310 



<400> 96 

actccgcett ccaegtgeac ccactgcctc ttcccttete gcttgggaac tetagtctcg 
cetcgggttg ca atg gac ccc aae tgc tee tgt gee get ggt gtc tee tgc 
Met Asp Pro Asn Cys Ser Cys Ala Ala Gly Val Ser Cys 
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1 








5 








10 








acc 


tgc 


gcc 


age 


tec 


tgc 


aag 


tgc 


aaa 


gag 


tgc aaa 


tgc 


ace 


tec 


tgc 


159 


Thr 


Cys 
15 


Ala 


Ser 


Ser 


Cys 


Lys 
20 


Cys 


Lys 


Glu 


Cys Lys 
25 


Cys 


Thr 


Ser 


Cys 




aag 


aag 


age 


tgc 


tgc 


tec 


tgg 


tgc 


act 


ggc 


cec acc 


etc 


gtg 


gae 


ace 


207 


Lys 


Lys 


Ser 


Cys 


Cys 


Ser 


Trp 


Cys 


Thr 


Gly 


Pro Thr 


Leu 


Val 


Asp 


Thr 




30 










35 










40 








45 




tgc 


cct 


gcc 


ctg 


cca 


cct 


gtc 


tgt 


ctg 


tec 


caa aga 


agt 


tet 


ggt 


atg 


255 


Cys 


Pro 


Ala 


Leu 


Pro 
50 


Pro 


Val 


Cys 


Leu 


Ser 

55 


Gin Arg 


Ser 


Ser 


Gly 
60 


Met 




aac 


ttg 


agg 




cat 


gtc 


cag 


tgg 


gag 


gtg 


aga cca 


cct 


etc 


aat 


att 


303 


Asn 




Arg 


Thr 
65 


His 


Val 


Gin 


Trp 


Glu 

70 


Val 


Arg Pro 


Pro 


Leu 
75 


Asn 


He 





L aaaaaaaaaa a 



<210> 97 

<211> 477 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 120. .395 

<220> 

<221> polyA_site 
<222> 462. . 477 

<400> 97 

gcttetttct agtgtggacc agaggtgggg geaatgagta aacatettca ctgtaagggt 

tttgggactg cccctcaggg cttgaggtet eggeategtg tcteagteag tecttggeg 

atg tec cct cag acg cac tea cag act tgc ata cgt aat tta gta aca 

Met Ser Pro Gin Thr His Ser Gin Thr Cys He Arg Asn Leu Val Thr 
15 10 15 

tgc ate aac tat cct aga aca tct aca ggg tgc aaa gga acc acc act 

Cys He Asn Tyr Pro Arg Thr Ser Thr Gly Cys Lys Gly Thr Thr Thr 



cag 




ate 




gag 


cca 


gtg 


gag 


tta 


gaa 


gtg 




ggg 








263 


Gin 


A?g 


He 

35 


Met 


Glu 


Pro 


Val 


Glu 
40 


Leu 


Glu 


Val 


Glu 


Gly 
45 


Thr 


Glu 


Gin 










aaa 


acc 


tgt 


ggt 


tea 


eta 


gga 


agg 


ggg 


aat 


gaa 


aac 




311 


Asp 


Asn 
50 


Ala 


Lys 


Thr 


Cys 


Gly 
55 


Ser 


Leu 


Gly 


Arg 


Gly 
60 


Asn 


Glu 


Asn 


Thr 




atg 


etc 




ggt 


gga 


tte 


age 


atg 


aac 


aca 


act 


gtg 


ggg 


caa 


gga 


att 


359 


Met 


Leu 


Arg 


Gly 


Gly 


Phe 


Ser 


Met 


Asn 


Thr 


Thr 


Val 


Gly 


Gin 


Gly 


He 




65 










70 










75 










80 




tec 


aag 


caa 


aca 


cac 


cac 


act 


agt 


acc 


act 


tct 


tec 


taaacagaaa 




405 


Ser 


Lys 


Gin 


Thr 


His 
85 


His 


Thr 


Ser 


Thr 


Thr 
90 


Ser 


Ser 













agggaaagag eatecttega sttceccetc ectccatcag caecgggttt tectccaaaa 4 65 

aaaaaaaaaa aa 477 

<210> 98 

<211> 363 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 117. .269 



<220> 
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<221> polyA_site 
<222> 348. .363 

<220> 

<221> polyA_signal 
<222> 329. .334 

<400> 98 

cctttctcgt tccccggcca tcttatcggc tgctgttggt tgggggccgt cccgctccta 60 
aggcaggaag atggtggccg caaagaagac ggaaatctga aatagagtac tatgct atg 119 

Met 
1 

ttg get aaa act ggt gtc cat cac tac agt ggc aat aat att gaa ctg 167 
Leu Ala Lys Thr Gly Val His His Tyr Ser Gly Asn Asn He Glu Leu 

5 10 15 

ggc aca gca tgc gga aaa tac tac aga gtg tgc aca ctg get ate att 215 
Gly Thr Ala Cys Gly Lys Tyr Tyr Arg Val Cys Thr Leu Ala He He 

20 25 30 

gat cca ggt gac tct gac ate att aga age atg cea gaa cag act ggt 2 63 

Asp Pro Gly Asp Ser Asp He He Arg Ser Met Pro Glu Gin Thr Gly 

35 40 45 

gaa aag taaacetttt caeetaeaaa atttcacctg caaaccttaa acctgcaaaa 319 
Glu Lys 
50 

ttttccttta ataaaatttg cttgttctaa aaaaaaaaaa aaaa 363 

<210> 99 

<211> 606 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 115. .411 

<220> 

<221> polyA_site 
<222> 591. .605 

<220> 

<221> polyA_signal 
<222> 573. .578 

<400> 99 

aagaaagggg tgaggcctaa gggacaatca ggatgttttt cagagagaag tgtggatgct 60 
ggacaggaag aaccacagat accagatacg ggtactgttg taactctgtt ctcc atg 117 

Met 



aaa 




aag 


gaa 


gaa 


aca 


aca 


ctt 


tea 


gag 


atg 


gag 


cct 


gtt 


gag 


cca 


165 


Lys 


Lys 


Lys 


Glu 


Glu 


Thr 


Thr 


Leu 


Ser 


Glu 


Met 


Glu 


Pro 


Val 


Glu 


Pro 








5 










10 










15 








cag 


tac 


caa 


eta 


gtc 


aat 


get 


gaa 


teg 


act 


tct 


ccc 


ttt 


eta 


cat 


tgc 


213 


Gin 


Tyr 


Gin 
20 


Leu 


Val 


Asn 


Ala 


Glu 
25 


Ser 


Thr 


Ser 


Pro 


Phe 
30 


Leu 


His 


Cys 




ctg 




gaa 


gtc 


att 


ggg 


gaa 


tac 


tct 


gta 


cac 


gaa 


ttt 


tea 


ctg 


ttg 


251 


Leu 


hlg 
35 


Glu 


Val 


He 


Gly 


Glu 
40 


Tyr 


Ser 


Val 


His 


Glu 
45 


Phe 


Ser 




Leu 




ggg 


aaa 


aca 


gag 


agt 


caa 


ggg 


att 


gga 


ttg 


tgg 


att 


gca 


ttg 


gtg 


gtt 


309 


Gly 


Lys 


Thr 


Glu 


Ser 


Gin 


Gly 


He 


Gly 


Leu 


Trp 


He 


Ala 


Leu 


Val 


Val 




50 










55 










60 










65 




ttc 


etc 


agt 


ttc 


etc 


ate 


ttc 


tec 


aca 


agt 


ttc 


tac 


ata 


teg 




gca 


357 


Phe 


Leu 


Ser 


Phe 


Leu 
70 


He 


Phe 


Ser 


Thr 


Ser 
75 


Phe 


Tyr 


He 


Ser 


Asn 
80 


Ala 
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gag cag ccc ttc ttc aaa gaa cct cct acg gaa get get aag gaa etc 4 05 

Glu Gin Pro Phe Phe Lys Glu Pro Pro Thr Glu Ala Ala Lys Glu Leu 

85 90 95 

agt etg tagctetgeg tggageeatg tgtaaaeaet gaactgagae ctgccaecte 4 61 

Ser Leu 

ctactaccta agggcccatt ttcatctgat atcatccccc agaaacaaac tcatgatgac 521 
ttccatgttt tttttagatt agatacatgg agaattttcc tttcccttag aattaaaatc 581 
etgcatteta aaaaaaaaaa aaaat 606 



<210> 100 
<211> 648 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 125. .508 



<220> 

<221> polyA_site 
<222> 633. .648 



<220> 

<221> polyA_signal 
<222> 610. . 615 



<400> 100 

agctgcctgg gtaaggccca agatggctgt cttcgcctta gtactcgtgt gaagttggca 60 

gggacggttc ctgtcatctt cttgggctta tttggtgtgc tgttgaaggg gggagaetag 120 

agaa atg gca ggg aac etc tta tec ggg gea ggt agg egc etg tgg gac 169 



Met Ala Gly Asn Leu Leu Ser Gly Ala Gly Arg Arg Leu Trp Asp 





1 








5 










10 










15 




tgg 


gtg 


cct 


etg 


gcg 


tgc 


aga 


age 


ttc 


tet 


ett 


ggt 


gtg 


cct 




ttg 


217 


Trp 


Val 


Pro 


Leu 


Ala 
20 


Cys 


Arg 


Ser 


Phe 


Ser 
25 


Leu 


Gly 


Val 


Pro 


Arg 
30 


Leu 




ate 


ggt 


ata 


agg 


etc 


act 


etc 


ccg 


ccc 


ccc 




gtg 


gtt 


gat 


cgt 


tgg 


265 


He 


Gly 


He 


Arg 
35 


Leu 


Thr 


Leu 


Pro 


Pro 
40 


Pro 


Lys 


Val 


Val 


Asp 
45 


Arg 


Trp 




aac 


gag 




agg 


gee 


atg 


ttc 


gga 


gtg 


tat 


gae 




ate 


ggg 


ate 


etg 


313 


Asn 


Glu 


Lys 
50 


Arg 


Ala 


Met 


Phe 


Gly 
55 


Val 


Tyr 


Asp 




He 
60 


Gly 


He 


Leu 




gga 


aac 


ttt 


gaa 


aag 


cac 


ccc 


aaa 


gaa 


etg 


ate 


agg 


ggg 


ccc 


ata 


tgg 


361 


Gly 


Asn 
65 


Phe 


Glu 


Lys 


His 


Pro 
70 




Glu 




He 


Arg 
75 


Gly 


Pro 


He 


Trp 




ett 




ggt 


tgg 


aaa 


ggg 


aat 


gaa 


ttg 




cgt 


tgt 


ate 


ega 


aag 


agg 


409 


80 


Arg 


Gly 


Trp 


Lys 


Gly 
85 


Asn 


Glu 


Leu 


Gin 


Arg 
90 


Cys 


He 


Arg 


Lys 


Arg 
95 




aaa 


atg 


gtt 


gga 


agt 




atg 


ttc 


get 




gac 


etg 


cac 


aac 


ett 


aat 


457 


Lys 


Met 


Val 


Gly 


Ser 
100 


Arg 


Met 


Phe 


Ala 


Asp 
105 


Asp 


Leu 


His 


Asn 


Leu 

110 


Asn 




aaa 


egc 


ate 


egc 


tat 


etc 


tac 




cac 


ttt 


aac 


ega 


eat 


ggg 


aag 


ttt 


505 


Lys 


Arg 


He 


Arg 
115 


Tyr 


Leu 


Tyr 


Lys 


His 
120 


Phe 


Asn 


Arg 


His 


Gly 
125 


Lys 


Phe 





ega tagaagagaa agctgagaac ttcggaaaag gctcatctgt caccctggag 558 
Arg 

aagggaaact gtacttttec ctgtgaggaa aeggctttgt attttctctg taataaaatg 618 

gggcttcttt ggacaaaaaa aaaaaaaaaa 64 8 



<210> 101 
<211> 491 
<212> DNA 
<213> Homo sapie: 
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<220> 
<221> CDS 
<222> 50. .205 

<220> 

<221> polyA_site 
<222> 469. .491 

<400> 101 

actctttcct tggctgctca ggtcataggt gctgtgatct agagacaaa atg ttc ctt 58 

Met Phe Leu 
1 

aca ctg gca gat act tgc aaa eta agg gga atg age tte ett ctg aat 106 
Thr Leu Ala Asp Thr Cys Lys Leu Arg Gly Met Ser Phe Leu Leu Asn 

5 10 15 

gtt tat gaa gga gag gcc act gtg tea tct gtc tta gag eta ttg gaa 154 
Val Tyr Glu Gly Glu Ala Thr Val Ser Ser Val Leu Glu Leu Leu Glu 
20 25 30 35 

tec tgg ate att gtg gga aat gaa aga tac ttc gat gga ate age age 202 
Ser Trp lie lie Val Gly Asn Glu Arg Tyr Phe Asp Gly lie Ser Ser 

40 45 50 

cat tgatecaatg ceaactccaa gactggaacg tcgcaatgat agttccaagg 255 
His 

cggaaatttg acgtaattct tttcgacaca gttttacagg tctggatgcc cattttaatt 315 
cttetgaaag eatgeeteet eettetgget teaggactcc atctccagcc tcttgatcta 375 
aaaataatec eeaaaecaaa aaattagata ctattteete aaaattaggt attttaatea 435 
aaacatetta aeataaatae attattatea ceeaaaaaaa aaaaaaaatg egaaaa 4 91 

<210> 102 
<211> 565 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 90. .341 

<220> 

<221> polyA_site 
<222> 534. .565 

<220> 

<221> polyA_signal 
<222> 515. .520 

<400> 102 



gttgagggag tcgggccgcg actgtggtcg tttttal 


:acc 


ttcccgcgcg gaegeeggeg 


60 


ctgccaacgg aagggcgggc tctgctgcc 


atg 


aag 


gtg 


aag 


att 


aag 


tge 


tgg 


113 


















Met 
1 


Lys 


Val 




He 
5 


Lys 


Cys 


Trp 




aac 


ggc 


gtg 


gee 


aet 


tgg 


etc 


tgg 


gtg 


gee 






gag 


aae 


tgt 


ggc 


161 


Asn 


Gly 


Val 


Ala 


Thr 


Trp 


Leu 


Trp 


Val 


Ala 


Asn 


Asp 


Glu 




Cys 


Gly 






10 










15 










20 












ate 


tgc 


agg 


atg 


gea 


ttt 


aae 


gga 


tge 


tge 


ect 


gae 


tge 




gtg 


eee 


209 


lie 


Cys 


Arg 


Met 


Ala 


Phe 


Asn 


Gly 


Cys 


Cys 


Pro 


Asp 


Cys 


Lys 


Val 


Pro 




25 










30 










35 










40 




ggc 


gac 


gac 


tgc 


ccg 


ctg 


gtg 


tgg 


ggc 


cag 


tgc 


tec 


cac 


tgc 


ttc 


cac 


257 


Gly 


Asp 


Asp 


Cys 


Pro 


Leu 


Val 


Trp 


Gly 


Gin 


Cys 


Ser 


His 


Cys 


Phe 


His 












45 










50 










55 






atg 


cat 


tgc 


ate 


etc 


aag 


tgg 


ctg 


cae 


gca 


cag 


cag 


gtg 




cag 


cac 


305 


Met 


His 


Cys 


lie 


Leu 


Lys 


Trp 


Leu 


His 


Ala 


Gin 


Gin 


Val 


Gin 


Gin 


His 










60 










65 










70 








tgc 


ccc 




tgc 


cgc 


cag 


gaa 


tgg 


aag 


ttc 


aag 


gag 


tgaggcccga 




351 
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Cys Pro Met Cys Arg Gin Glu Trp Lys Phe Lys Glu 

75 80 
cctggctctc gctggagggg catcctgaga ctccttcctc atgctggcgc cgatggctgc 411 
tggggacagc gcccctgagc tgcaacaagg tggaaacaag ggctggagct gcgtttgttt 471 
tgccatcact atgttgacac ttttatccaa taagtgaaaa ctcattaaac tactcaaatc 531 
ttaaaaaaaa aaaaaaagaa aaaaaaaaaa aaaa 565 

<210> 103 
<211> 661 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 169. .420 

<220> 

<221> polyA_site 
<222> 614. .661 

<220> 

<221> polyA_signal 
<222> 594. .599 

<400> 103 

gagactcggc gggcgctgtt gagggagtcg gcccgcgact gtggtcgttt ttataccttc 60 
ccgcgcggac gccggcctgc caacggaagg gcggagacgg agtttcgtca tgttggccag 120 
gcccatttga gatctttgaa gatatcctca acgtgaggct ctgctgcc atg aag gtg 177 

Met Lys Val 



aag 


att 


aag 


tgc 


tgg 


aac 


ggc 


gtg 


gcc 


act 


tgg 


etc 


tgg 


gtg 


gcc 


aac 


225 


Lys 


He 


Lys 


Cys 


Trp 


Asn 


Gly 


Val 


Ala 


Thr 


Trp 


Leu 


Trp 


Val 


Ala 


Asn 






5 










10 










15 












gat 


gag 


aac 


tgt 


ggc 


ate 


tgc 


agg 


atg 


gca 


ttt 


aac 


gga 


tgc 


tgc 


cet 


273 


Asp 


Glu 


Asn 


Cys 


Gly 


He 


Cys 


Arg 


Met 


Ala 


Phe 


Asn 


Gly 


Cys 


Cys 


Pro 




20 










25 










30 










35 




gac 


tgc 


aag 


gtg 


ccc 


ggc 


gac 


gac 


tgc 


ccg 


ctg 


gtg 


tgg 


ggc 


cag 


tgc 


321 


Asp 


Cys 


Lys 


Val 


Pro 


Gly 


Asp 


Asp 


Cys 


Pro 


Leu 


Val 


Trp 


Gly 


Gin 


Cys 












40 










45 










50 






tec 


cac 


tgc 


ttc 


cac 


atg 


cat 


tgc 


ate 


etc 




tgg 


ctg 


cac 


gca 




369 


Ser 


His 


Cys 


Phe 


His 


Met 


His 


Cys 


He 


Leu 


Lys 


Trp 


Leu 


His 


Ala 


Gin 










55 










60 










65 








cag 


gtg 




cag 


cac 


tgc 


ccc 


atg 


tgc 


cgc 






tgg 


aag 


ttc 


aag 


417 


Gin 


Val 


Gin 


Gin 


His 


Cys 


Pro 


Met 


Cys 


Arg 


Gin 


Glu 


Trp 


Lys 


Phe 


Lys 





70 75 80 

gag tgaggcccga cctggctctc gctggagggg catcctgaga ctccttcctc 470 
Glu 

atgctggcgc cgatggctgc tggggacagc gcccctgagc tgcaacaagg tggaaacaag 530 

ggctggagct gcgtttgttt tgccatcact atgttgacac ttttatccaa taagtgaaaa 590 

ctcattaaac tactcaaatc ttgaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaagaaaaa 650 

aaaaaaaaaa a 661 

<210> 104 
<211> 609 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 144. .395 

<220> 

<221> polyA_site 
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<220> 

<221> polyA_signal 
<222> 569. .574 

<400> 104 

gtttttatac cttcccgcgc ggacgccggc gctgccaacg gaagggcggg tagggcggtg 60 
cgtgattaga gacggagttt cgtcatgttg gccaggccca tttgagatct ttgaagatat 120 
cctcaacgtg aggctctgct gcc atg aag gtg aag att aag tgc tgg aac ggc 173 
Met Lys Val Lys lie Lys Cys Trp Asn Gly 















1 








5 






10 




gtg 


gcc 




tgg 




tgg 


gtg 


gcc 




gat 


gag aac 


tgt 


ggc ate 


tgc 


221 


Val 


Ala 


Thr 


Trp 


Leu 


Trp 


Val 


Ala 


Asn 


Asp 


Glu Asn 


Cys 


Gly lie 


Cys 












15 










20 






25 






agg 


atg 


gca 


ttt 


aac 


gga 


tgc 


tgc 


cct 


gac 


tgc aag 


gtg 


ecc ggc 


gac 


269 


Arg 


Met 


Ala 


Phe 


Asn 


Gly 


Cys 


Cys 


Pro 


Asp 


Cys Lys 


Val 


Pro Gly 


Asp 










30 










35 








40 






gac 


tgc 


ccg 


ctg 


gtg 


tgg 


ggc 


cag 


tgc 


tec 


cac tgc 


ttc 


cac atg 


cat 


317 


Asp 


Cys 


Pro 


Leu 


Val 


Trp 


Gly 


Gin 


Cys 


Ser 


His Cys 


Phe 


His Met 


His 








45 










50 








55 








tgc 


ate 


etc 


aag 


tgg 


ctg 


cac 


gca 


cag 


cag 


gtg cag 


cag 


cac tgc 


ccc 


365 


Cys 


lie 


Leu 


Lys 


Trp 


Leu 


His 


Ala 


Gin 


Gin 


Val Gin 


Gin 


His Cys 


Pro 






60 










65 








70 










atg 


tgc 


cgc 


cag 


gaa 


tgg 


aag 


ttc 




gag 


tgaggcccga cctggctctc 


415 


Met 


Cys 


Arg 


Gin 


Glu 


Trp 


Lys 


Phe 


Lys 


Glu 












75 










80 





















gctggagggg catcctgaga ctccttcctc atgctggcgc cgatggctgc tggggacagc 475 

gcccctgagc tgcaacaagg tggaaacaag ggctggagct gcgtttgttt tgccatcact 535 

atgttgacac ttttatccaa taagtgaaaa ctcattaaac tactcaaatc ttgaaaagaa 595 

aaaaaaaaaa aaaa 609 

<210> 105 
<211> 635 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 177. .428 

<220> 

<221> polyA_site 
<222> 620.. 635 

<220> 

<221> polyA_signal 
<222> 602.. 607 

<400> 105 

aagactcggc gggcgctgtt gagggagtcg ggccgcgact gtggtcgttt ttataccttc 60 
ccgcgcggac gccggcgctg ccaacggaag ggcgggtagg gcgagacgga gtttcgtcat 120 
gttggccagg cccatttgag atctttgata tatcctcaac gtgaggctct gctgcc atg 179 

Met 
1 

aag gtg aag att aag tgc tgg aac ggc gtg gcc act tgg etc tgg gtg 227 
Lys Val Lys lie Lys Cys Trp Asn Gly Val Ala Thr Trp Leu Trp Val 
5 10 15 

gcc aac gat gag aac tgt ggc ate tge agg atg gca ttt aac gga tgc 275 
Ala Asn Asp Glu Asn Cys Gly lie Cys Arg Met Ala Phe Asn Gly Cys 

20 25 30 

tgc cct gac tgc aag gtg eee ggc gae gae tge eeg ctg gtg tgg ggc 323 
Cys Pro Asp Cys Lys Val Pro Gly Asp Asp Cys Pro Leu Val Trp Gly 
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35 40 45 

cag tgc tec cac tgc ttc cac atg cat tgc ate etc aag tgg ctg cac 371 
Gin Cys Ser His Cys Phe His Met His Cys lie Leu Lys Trp Leu His 
50 55 60 65 

gea cag cag gtg cag cag cac tgc ccc atg tgc cgc cag gaa tgg aag 419 
Ala Gin Gin Val Gin Gin His Cys Pro Met Cys Arg Gin Glu Trp Lys 

70 75 80 

ttc aag gag tgaggcccga cctggctctc gctggagggg catcctgaga 4 68 

Phe Lys Glu 

ctccttcctc atgctggcgc cgatggctgc tggggacagc gcccctgagc tgcaacaagg 528 
tggaaacaag ggctggagct gcgtttgttt tgccatcact atgttgacac ttttatccaa 588 
taagtgaaaa ctcattaaac tactcaaatc caaaaaaaaa aaaaaaa 635 

<210> 106 
<211> 573 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 114. .365 

<220> 

<221> polyA_site 
<222> 558. .573 

<220> 

<221> polyA_signal 
<222> 539. .544 

<400> 106 

ggcgtgcgag actcggcggg cgctgttgag ggagtcgggc cgcgactgtg gtcgttttta 60 
taccttcccg cgcggacgcc ggcgctgcca acggaagggc gggctctgct gcc atg 116 

Met 





gtg 


aag 


att 


aag 


tgc 


tgg 


aac 


ggc 


gtg 


gcc 


act 


tgg 


etc 


tgg 


gtg 


164 


Lys 


Val 


Lys 


He 


Lys 


Cys 


Trp 


Asn 


Gly 


Val 


Ala 


Thr 


Trp 


Leu 


Trp 


Val 










5 










10 










15 








gcc 


aac 


gat 


gag 


aac 


tgt 


ggc 


ate 


tgc 


agg 


atg 


gca 


ttt 


aac 


gga 


tgc 


212 


Ala 


Asn 


Asp 


Glu 




Cys 


Gly 


He 


Cys 


Arg 


Met 


Ala 


Phe 


Asn 


Gly 


Cys 








20 










25 










30 










tgc 


cct 


gac 


tgc 


aag 


gtg 


ccc 


ggc 


gac 


gac 


tgc 


ccg 


ctg 


gtg 


tgg 


ggc 


260 


Cys 


Pro 


Asp 


Cys 




Val 


Pro 


Gly 


Asp 


Asp 


Cys 


Pro 




Val 


Trp 


Gly 






35 










40 










45 












cag 


tgc 


tec 


cac 


tgc 


ttc 


cac 


atg 


cat 


tgc 


ate 


etc 


aag 


tgg 


ctg 


cac 


308 


Gin 


Cys 


Ser 


His 


Cys 


Phe 


His 


Met 


His 


Cys 


He 


Leu 


Lys 


Trp 


Leu 


His 




50 










55 










60 










65 




gca 


cag 


cag 


gtg 


cag 


cag 


cac 


tgc 


ccc 


atg 


tgc 


cgc 


cag 


gaa 


tgg 


aag 


356 


Ala 


Gin 


Gin 


Val 


Gin 


Gin 


His 


Cys 


Pro 


Met 


Cys 


Arg 


Gin 


Glu 


Trp 


Lys 





70 75 80 



ttc aag gag tgaggcccga cctggctctc gctggagggg catcctgaga 405 

Phe Lys Glu 

ctccttcctc atgctggcgc cgatggctgc tggggacagc gcccctgagc tgcaacaagg 465 

tggaaacaag ggctggagct gcgtttgttt tgccatcact atgttgacac ttttatccaa 525 

taagtgaaaa ctcattaaac tactcaaatc etaaaaaaaa aaaaaaaa 573 

<210> 107 
<211> 689 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
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<220> 

<221> polyA_site 
<222> 674. .689 

<220> 

<221> polyA_signal 
<222> 644. .649 

<400> 107 

tttcgccgcc aaagcatcca gcagccccct gctccggccc age atg gcg acc ccg 

Met Ala Thr Pro 



acc 


cag 


acc 


ccc 


aca 


aag 


get 


cct 


gag 


gaa 


cct 


gac 


cca 


ttt 


tac 


tat 


103 


Thr 


Gin 


Thr 


Pro 


Thr 


Lys 


Ala 


Pro 


Glu 


Glu 


Pro Asp 


Pro 


Phe 


Tyr 


Tyr 




5 










10 










15 










20 




gac 


tac 


aac 


acg 


gtg 


cag 


act 


gtg 


ggc 


atg 


act 


ctg 


gca 


acc 


ate 


ttg 


151 


Asp 


Tyr 


Asn 


Thr 


Val 
25 


Gin 


Thr 


Val 


Gly 


Met 
30 


Thr 


Leu 


Ala 


Thr 


He 
35 


Leu 




ttc 




ctg 


ggt 


ate 


etc 


ate 


gtc 


ate 


age 


aag 


aag 


gtg 


aag 


tge 


agg 


199 


Phe 


Leu 


Leu 


Gly 
40 


He 


Leu 


He 


Val 


He 
45 


Ser 


Lys 


Lys 


Val 


Lys 
50 


Cys 


Arg 




aag 


gcg 


gac 


tec 


agg 


tct 


gag 








tgc 




tec 


tgt 


aag 


tct 


247 


Lys 


Ala 


Asp 

55 


Ser 




Ser 


Glu 


Ser 
60 


Pro 


Thr 


Cys 




Ser 
65 


Cys 


Lys 


Ser 




gag 


ctt 


ccc 


tct 


tea 


gcc 


cct 


ggt 


ggc 


ggc 


ggc 


gtg 


taacaccttc 




293 


Glu 


70 


Pro 


Ser 


Ser 


Ala 


Pro 
75 


Gly 


Gly 


Gly 


Gly 


Val 
80 













ccgaggaaac tccgctgccg accctgcctg agcgcgggag cctgaggacc gggtggaggc 353 

ggtggggacc cagccgcgcg ccgggagcgc tccccggaat gagccgcccc acccacccca 413 

aggctggagc cgctgcaccc tgccgtccct ctccaggcct tggcaatgac gatcccccaa 473 

agagcccgtc tgcaccccag acccagggcc tcaggcctcc agctcctggg atccgggagt 533 

ccatcccggc ccagcacccc cagcatcccc gtgtatggcc cccctgcacc tccttgtctc 593 

atccccgaag atccgtcccc ctggcccctc agtgtccatg tcttgagctt aataaatgtg 553 

catttggttt tttcctctgc aaaaaaaaaa aaaaaa 689 



<210> 108 
<211> 561 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 99. .479 



<220> 

<221> polyA_site 
<222> 546. .561 



<220> 

<221> polyA_signal 
<222> 526. .531 



<400> 108 

tttcgaaggg aagggagccc cctataaaac agcctacagt ggacagtctg gtcggcagag 60 
ccgcaggtca gtcgtgaaga gggagctcta ttgccacc atg agt ttc tec ggc aag 116 

Met Ser Phe Ser Gly Lys 
1 5 

tac caa ctg cag age cag gaa aac ttt gaa gcc ttc atg aag gca ate 164 
Tyr Gin Leu Gin Ser Gin Glu Asn Phe Glu Ala Phe Met Lys Ala He 

10 15 20 

ggt ctg ccg gaa gag etc ate cag aag ggg aag gat ate aag ggg gtg 212 
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Gly 


Leu 


Pro 
25 


Glu 


Glu 


Leu 


He 


Gin 

30 


Lys 


Gly 


Lys Asp He Lys Gly 
35 


Val 




teg 






gtg 


cag 




ggg 




cac 


ttc 


aag ttc acc ate acc 


get 


260 


Ser 


Glu 
40 


He 


Val 


Gin 


Asn 


Gly 
45 


Lys 


His 


Phe 


Lys Phe Thr He Thr 
50 


Ala 




ggg 


tec 




gtg 


ate 




aac 


gaa 


ttc 


acg 


gtg ggg gag gaa tgt 


gag 


308 


Gly 


Ser 


Lys 


Val 


He 


Gin 


Asn 


Glu 


Phe 


Thr 


Val Gly Glu Glu Cys Glu 




55 










60 










65 


70 




ctg 


gag 


aca 


atg 


aea 


ggg 


gag 


aaa 


gtc 


aag 


aca gtg gtt cag ttg 


gaa 


356 


Leu 


Glu 


Thr 


Met 


Thr 
75 


Gly 


Glu 


Lys 


Val 


Lys 
80 


Thr Val Val Gin Leu 
85 


Glu 




ggt 


gac 


aat 


aaa 


ctg 


gtg 


aca 


act 


ttc 


aaa 


aac ate aag tct gtg 


ace 


404 








90 










95 


Lys 


Asn He Lys Ser Val 
100 


Thr 




gaa 


etc 


aac 


ggc 


gac 


ata 


ate 


ace 


aat 


acc 


atg aca ttg ggt gac 


att 


452 


Glu 


Leu 


Asn 


Gly 


Asp 


He 


He 


Thr 


Asn 


Thr 


Met Thr Leu Gly Asp He 








105 










110 






115 






gtc 


ttc 


aag 


aga 


ate 


age 


aag 


aga 


att 


taaacaagtc tgcatttcat 




499 


Val 


Phe 


Lys 


Arg 


He 


Ser 


Lys 


Arg 


He 











120 125 
attattttag tgtgtaaaat taatgtaata aagtgaactt tgttttaaaa aaaaaaaaaa 559 
aa 561 



<210> 109 
<211> 738 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 1. .251 

<220> 

<221> polyA_site 
<222> 724. .738 

<220> 

<221> polyA_signal 
<222> 712. .717 

<400> 109 





gca 


gta 


gee 


gtg 


ggg 


aag 


ttt 


teg 


caa 


tgg 


ceg 


ceg 


gaa 


egg 


teg 


48 


Asp 


Ala 


Val 


Ala 


Val 


Gly 




Phe 


Ser 


Gin 


Trp 


Pro 


Pro 


Glu 


Arg 


Ser 




1 








5 










10 










15 






ceg 


gee 


gat 


get 


etc 


tgc 


gac 


gga 


aag 


aaa 


gee 


gee 


gga 


cec 


act 


tec 


96 


Pro 


Ala 


Asp 


Ala 
20 


Leu 


Cys 


Asp 


Gly 


Lys 
25 


Lys 


Ala 


Ala 


Gly 


Pro 
30 


Thr 


Ser 




ggt 


geg 


ttt 


tae 


aat 


gaa 


etc 


egg 


gea 


etc 


gag 


teg 


etc 


ecc 


gga 


aac 


144 


Gly 


Ala 


Phe 


Tyr 


Asn 


Glu 


Leu Arg 


Ala 


Leu 


Glu 


Ser 


Leu 


Pro 


Gly Asn 








35 










40 










45 










tat 


agt 


tct 


get 


tee 


ace 


gtt 


gtc 


eta 


ett 


tgg 


ata 


gca 


tea 


aga 


caa 


192 


Tyr 


Ser 

50 


Ser 


Ala 


Ser 


Thr 


Val 

55 


Val 


Leu 


Leu 


Trp 


He 

60 


Ala 


Ser 


Arg 


Gin 






tet 




tet 


teg 


agg 


aac 




cag 


tgt 


age 


aga 


aaa 




aaa 


gga 


240 


Lys 


Ser 


Gin 


Ser 


Ser 


Arg 


Asn 


Ser 


Gin 


Cys 


Ser 


Arg 


Lys Asp 


Lys 


Gly 




65 










70 










75 










80 




ggt 


aaa 


cag 


gag 


gag 


eat 


gaa 


taattcatct tgatgtaatc atgcaagact 


291 


Gly 


Lys 


Gin 


Glu 


Glu 


His 


Glu 























85 



ectaagaaga tgttaggaga ecctgtcatc attcttttgg cctctagtgt cetttatett 351 

catgtttget ggettccttt tactactatg gaaataeaae atatgeaaaa gtagtgtttt 411 

cgetgtttaa aaaeategte ageageatea teateattaa etgeaeeagg aageeatgae 471 

tcctttetgg aeaaggtgea ettgectgte ateetteeat etetttette etcettgaag 531 
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ttcacaatac ccagtccagg agcattgcag agaatgctag gtgtgtctgc agacttgact 591 

ttaaaagaaa acaagttcct tcaagtgagt tcatcatgat gggaatgata atttgtttga 651 

gagaatgtgt gttctttggc aaaggaactc ctgttatacc atgacagtaa taatagcagt 711 
aataaaatca ccaaaaaaaa aaaaaaa 738 

<210> 110 
<211> 584 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 88. .375 

<220> 

<221> polyA_site 
<222> 567. .584 

<220> 

<221> polyA_signal 
<222> 546. .551 

<400> 110 

gagtcagtct cgccagctgc cggtctttcg ggggctccgt aactttctat ccgtccgcgt 60 
cagcgcyttg ccaccctcat ctccaat atg cct ggt ccg acc ccc agt ggc act 114 

Met Pro Gly Pro Thr Pro Ser Gly Thr 

1 5 



aac 


gtg 


gga 


tec 


tea 


ggg 


cgc 


tct 




age 


aaa 


gea 


gtg 


gee 


gee 


egg 


162 


Asn 


Val 


Gly 


Ser 


Ser 


Gly 


Arg 


Ser 


Pro 


Ser 


Lys 


Ala 


Val 


Ala 


Ala 


Arg 




10 










15 










20 










25 




gcg 


gcg 


gga 


tec 


act 


gtc 


egg 


eag 


agg 




aat 


gcc 


age 


tgt 


ggg 


aca 


210 


Ala 


Ala 


Gly 


Ser 


Thr 
30 


Val 


Arg 


Gin 


Arg 


Lys 
35 


Asn 


Ala 


Ser 


Cys 


Gly 
40 


Thr 




agg 


agt 


gca 


ggc 


cgc 


aca 


acc 


teg 


gea 


ggc 


acc 


ggg 


ggg 


atg 


tgg 


cga 


258 


Arg 


Ser 


Ala 


Gly 

45 


Arg 


Thr 


Thr 


Ser 


Ala 

50 


Gly 


Thr 


Gly 


Gly 


Met 
55 


Trp 


Arg 




ttc 


tac 




gaa 


gat 


tea 


cet 


ggg 


etc 


aaa 


gtt 


ggc 




gtt 




gta 


306 


Phe 


Tyr 


Thr 
60 


Glu 


Asp 


Ser 


Pro 


Gly 
65 


Leu 


Lys 


Val 


Gly 


Pro 
70 


Val 


Pro 


Val 




ttg 


gtt 


atg 


agt 


ctt 


ctg 


ttc 


ate 


get 


tct 


gta 


ttt 


atg 


ttg 


cac 


att 


354 


Leu 


Val 
75 


Met 


Ser 


Leu 


Leu 


Phe 
80 


lie 


Ala 


Ser 


Val 


Phe 
85 


Met 


Leu 


His 


lie 




tgg 


ggc 


aag 


tac 


act 


cgt 


teg 


tagattcagt tacatccatc tgtcatctga 


405 


Trp 


Gly 


Lys 


Tyr 


Thr 


Arg 


Ser 























90 95 

agaaggagga aaaaacccaa catttcttgg accaaaagta tagtgactat ctgttcatga 4 65 

gagaaatttt ctgtaagctt gctgttttac aggggattta tcaataattg attttgagga 525 

atcagttttt ttctatggct aataaacttt ttaattcact tataaaaaaa aaaaaaaaa 584 

<210> 111 
<211> 588 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 16. .183 

<220> 

<221> polyA_site 
<222> 289. .588 



<220> 
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<221> polyA_signal 
<222> 267. .272 

<400> 111 

tttattttgc cttct atg tea tct tgg age tta atg tgc atg caa gat atg 51 
Met Ser Ser Trp Ser Leu Met Cys Met Gin Asp Met 
15 10 

gga aat act aaa gca aga eta etc tat gaa gee aat ett eca gag aac 99 

Gly Asn Thr Lys Ala Arg Leu Leu Tyr Glu Ala Asn Leu Pro Glu Asn 
15 20 25 

ttt cga aga cca cag aca gat caa gca gtg gaa ttt ttc ate aga gat 147 

Phe Arg Arg Pro Gin Thr Asp Gin Ala Val Glu Phe Phe lie Arg Asp 
30 35 40 

aaa tat gaa aag aag aaa tac tac gaa aaa atg eca tagctattac 193 

Lys Tyr Glu Lys Lys Lys Tyr Tyr Glu Lys Met Pro 



aaatatttcc 


tcctctgatg 


ctectcttca 


gcctttggta 


tcctctcctt 


ctctgcaagc 


253 


tgetgttgac 




tggagaaaga 


aaaggaaaaa 


aaaaaggaag 


agaaaaagag 


313 


agaaaaggag 


ccagaaaagc 


cggcaaaacc 


acttacaget 


gaaaagetgc 


agaagaaaga 


373 


tcagcaactg 






ccctaaaaaa 


gctgcggagc 


cactgtggat 


433 


cttttaggac 


ttgtttggag 


aaagattgag 


aaggaaaaga 


gtgaaaggca 


gttagttgaa 


4 93 


gtggaagagt 


tgaccttcat 


gtactgettt 


ccaaaaaatt 


gatcetagtt 


gttattgaat 


553 


aaattgttgt 


ctgttttaag 


aaaaaaaaaa 


aaaaa 






588 



<210> 112 
<211> 589 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 16.. 183 

<220> 

<221> polyA_site 
<222> 289. .589 

<220> 

<221> polyAsignal 
<222> 267. .272 

<400> 112 

tttattttgc cttct atg tea tct tgg age tta atg tgc atg caa gat atg 51 
Met Ser Ser Trp Ser Leu Met Cys Met Gin Asp Met 
15 10 
gga aat act aaa gca aga eta etc tat gaa gcc aat ett cca gag aac 99 
Gly Asn Thr Lys Ala Arg Leu Leu Tyr Glu Ala Asn Leu Pro Glu Asn 

15 20 25 

ttt cga aga cca cag aca gat caa gca gtg gaa ttt ttc ate aga gat 147 
Phe Arg Arg Pro Gin Thr Asp Gin Ala Val Glu Phe Phe lie Arg Asp 

30 35 40 

aaa tat gaa aag aag aaa tac tac gaa aaa atg cca tagctattac 193 
Lys Tyr Glu Lys Lys Lys Tyr Tyr Glu Lys Met Pro 
45 50 55 

aaatatttcc tcctctgatg etcctcttea geettttgta tectetcett ctctgcaagc 253 
tgetgttgac aaaaataaat tggagaaaga aaaggaaaaa aaaaaggaag agaaaaagag 313 
agaaaaggag ccagaaaagc cggcaaaacc acttacaget gaaaagetgc agaagaaaga 373 
tcagcaactg gagcetaaaa aaagtaccag ccctaaaaaa gctgcggagc ceactgtgga 433 
tettttagga cttgtttgga gaaagattga gaaggaaaag agtgaaaggc agttagttga 4 93 
agtggaagag ttgaccttca tgtactgctt tccaaaaaat tgatectagt tgttattgaa 553 
taaattgttg tctgttttaa gaaaaaaaaa aaaaaa 589 



<210> 113 
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<211> 588 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 16. .183 

<220> 

<221> polyA_site 
<222> 289. .588 

<220> 

<221> polyA_signal 
<222> 267. .272 

<400> 113 

tttattttgc cttct atg tea tct tgg age tta atg tge atg caa gat atg 51 
Met Ser Ser Trp Ser Leu Met Cys Met Gin Asp Met 
15 10 
gga aat act aaa gca aga eta etc tat gaa gcc aat ctt cca gag aac 99 
Gly Asn Thr Lys Ala Arg Leu Leu Tyr Glu Ala Asn Leu Pro Glu Asn 

15 20 25 

ttt cga aga cca cag aca gat caa gca gtg gaa ttt tte ate aga gat 147 
Phe Arg Arg Pro Gin Thr Asp Gin Ala Val Glu Phe Phe lie Arg Asp 

30 35 40 

aaa tat gaa aag aag aaa tac tae gaa aaa atg eca tagctattac ' 193 
Lys Tyr Glu Lys Lys Lys Tyr Tyr Glu Lys Met Pro 
45 50 55 

aaatatttcc tcctctgatg ctectettea gcetttggta tcctctcctt ctctgcaagc 253 
tgetgttgac aaaaataaat tggagaaaga aaaggaaaaa aaaaaggaag agaaaaagag 313 
agaaaaggag ccagaaaagc cggcaaaacc acttaeagct gaaaagctgc agaagaaaga 373 
tcagcaactg gagcctaaaa aaagtaccag ccctaaaaaa gctgcggasc cactgtggat 4 33 
cttttaggac ttgtttggag aaagattgag aaggaaaaga gtgaaaggca gttagttgaa 4 93 
gtggaagagt tgaeetteat gtaetgettt eeaaaaaatt gateetagtt gttattgaat 553 
aaattgttgt ctgttttaag aaaaaaaaaa aaaaa 588 

<210> 114 
<211> 590 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 16. .183 

<220> 

<221> polyA_site 
<222> 289. .590 

<220> 

<221> polyA_signal 
<222> 267. .272 

<400> 114 

tttattttgc cttct atg tea tct tgg age tta atg tge atg caa gat atg 51 
Met Ser Ser Trp Ser Leu Met Cys Met Gin Asp Met 
15 10 

gga aat act aaa gca aga eta etc tat gaa gcc aat ctt cca gag aac 99 
Gly Asn Thr Lys Ala Arg Leu Leu Tyr Glu Ala Asn Leu Pro Glu Asn 

15 20 25 

ttt cga aga cca cag aca gat caa gca gtg gaa ttt ttc ate aga gat 147 
Phe Arg Arg Pro Gin Thr Asp Gin Ala Val Glu Phe Phe lie Arg Asp 
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30 35 40 

aaa tat gaa aag aag aaa tac tac gaa aaa atg cca tagctattac 193 
Lys Tyr Glu Lys Lys Lys Tyr Tyr Glu Lys Met Pro 
45 50 55 

aaatatttcc tcctctgatg ctcctcttca gcctttggta tcctctcctt ctctgcaagc 253 

tgctgttgac aaaaataaat tggagaaaga aaaggaaaaa aaaaaggaag agaaaaagag 313 

agaaaaggag ccagaaaagc cggcaaaacc acttacagct gaaaagctgc agaagaaaga 373 

tcagcaactg gagcctaaaa aaagtaccag ccctaaaaaa gctgcggagc ccactgtgga 433 

tcttttagga cttgtttgga gaaagattga gaaggaaaag agtgaaaggc agttagttga 4 93 

agtggaagag ttgaccttca tgtactgctt tccaaaaaat tgatcctagt tgttattgaa 553 

taaattgttg tctgttttaa gaaaaaaaaa aaaaaat 590 



<210> 115 
<211> 595 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 16. .183 

<220> 

<221> polyA_site 
<222> 572. .595 

<220> 

<221> polyA_signal 
<222> 552. .557 

<400> 115 

tttattttgc cttct atg tea tct tgg age tta atg tgc atg caa gat atg 51 
Met Ser Ser Trp Ser Leu Met Cys Met Gin Asp Met 
15 10 

gga aat act aaa gca aga eta etc tat gaa gee aat ett cca gag aac 99 

Gly Asn Thr Lys Ala Arg Leu Leu Tyr Glu Ala Asn Leu Pro Glu Asn 
15 20 25 

ttt cga aga cca cag aca gat caa gca gtg gaa ttt ttc ate aga gat 147 

Phe Arg Arg Pro Gin Thr Asp Gin Ala Val Glu Phe Phe lie Arg Asp 
30 35 40 

aaa tat gaa aag aag aaa tac tac gaa aaa atg cca tagctattac 193 

Lys Tyr Glu Lys Lys Lys Tyr Tyr Glu Lys Met Pro 

45 50 55 



aaatatttcc 


tcctctgatg 


ctcctcttca 


gcctttggta 


tcctctcctt 


ctctgcaagc 


253 


tgctgttgac 


aaaaataaat 


tggagaaaga 


aaaggaaaaa 


aaaaaggaag 


agaaaaagag 


313 


agaaaaggag 


ccagaaaagc 


cggcaaaacc 


acttacagct 


gaaaagctgc 


agaagaaaga 


373 


tcagcaactg 


gagcctaaaa 


aaagtaccag 


ccctaaaaaa 


gctgcggagc 


ccactgtgga 


433 


tcttttagga 


cttgtttgga 


gaaagattga 


gaaggaaaag 


agtgaaaggc 


agttagttga 


493 


agtggaagag 


ttgaccttca 


tgtactgctt 


tccaaaaaat 


tgatcctagt 


tgttattgaa 


553 


taaattgttg 


tctgttttaa 


gaaaaaaaaa 


aaaaaatgcg 






595 



<210> 116 
<211> 588 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 16. .183 

<220> 

<221> polyA_site 
<222> 289. .588 
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<220> 

<221> polyA_signal 
<222> 267. .272 

<400> 116 

tttattttgc cttct atg tea tct tgg age tta atg tgc atg caa gat atg 51 
Met Ser Ser Trp Ser Leu Met Cys Met Gin Asp Met 
15 10 
gga aat act aaa gca aga eta etc tat gaa gee aat ett cca gag aac 99 
Gly Asn Thr Lys Ala Arg Leu Leu Tyr Glu Ala Asn Leu Pro Glu Asn 

15 20 25 

ttt ega aga eca cag aca gat caa gca gtg gaa ttt ttc ate aga gat 147 
Phe Arg Arg Pro Gin Thr Asp Gin Ala Val Glu Phe Phe lie Arg Asp 

30 35 40 

aaa tat gaa aag aag aaa tac tac gaa aaa atg cca tagctattac 193 
Lys Tyr Glu Lys Lys Lys Tyr Tyr Glu Lys Met Pro 
45 50 55 



aaatatttee 


teetetgatg 


cteetettca 


gcctttggta 


tcctctcctt 


ctctgcaagc 


253 


tgctgttgac 


aaaaataaat 


tggagaaaga 


aaaggaaaaa 


aaaaaggaag 


agaaaaagag 


313 


agaaaaggag 


ccagaaaagc 


cggcaaaacc 


acttacagct 


gaaaagctgc 


agaagaaaga 


373 


tcagcaactg 


gagcctaaaa 


aaagtaccag 


ccctaaaaaa 


gctgcggasc 


cactgtggat 


433 


cttttaggae 


ttgtttggag 


aaagattgag 


aaggaaaaga 


gtgaaaggca 


gttagttgaa 


493 


gtggaagagt 


tgaeetteat 


gtactgcttt 


ceaaaaaatt 


gatcctagtt 


gttattgaat 


553 


aaattgttgt 


etgttttaag 


aaaaaaaaaa 


aaaaa 






588 



<210> 117 
<211> 588 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 15. .183 

<220> 

<221> polyA_site 
<222> 289. .588 

<220> 

<221> polyA_slgnal 
<222> 267. .272 

<400> 117 

tttattttgc cttct atg tea tct tgg age tta atg tgc atg eaa gat atg 51 
Met Ser Ser Trp Ser Leu Met Cys Met Gin Asp Met 
15 10 
gga aat act aaa gca aga eta etc tat gaa gee aat ctt cca gag aac 99 
Gly Asn Thr Lys Ala Arg Leu Leu Tyr Glu Ala Asn Leu Pro Glu Asn 

15 20 25 

ttt cga aga cca cag aca gat caa gca gtg gaa ttt ttc ate aga gat 147 
Phe Arg Arg Pro Gin Thr Asp Gin Ala Val Glu Phe Phe lie Arg Asp 

30 35 40 

aaa tat gaa aag aag aaa tac tac gaa aaa atg cca tagctattac 193 
Lys Tyr Glu Lys Lys Lys Tyr Tyr Glu Lys Met Pro 
45 50 55 

aaatatttee teetetgatg cteetettca gcctttggta tcctctcctt ctctgcaagc 253 
tgctgttgac aaaaataaat tggagaaaga aaaggaaaaa aaaaaggaag agaaaaagag 313 
agaaaaggag ccagaaaagc eggeaaaaee acttacagct gaaaagctgc agaagaaaga 373 
teageaactg gagcctaaaa aaagtaccag ccctaaaaaa gctgcggagc cactgtggat 433 
cttttaggae ttgtttggag aaagattgag aaggaaaaga gtgaaaggca gttagttgaa 4 93 
gtggaagagt tgaeetteat gtactgcttt ecaaaaaatt gatcctagtt gttattgaat 553 
aaattgttgt etgttttaag aaaaaaaaaa aaaaa 588 
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<210> 118 
<211> 588 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 16. .183 

<220> 

<221> polyA_site 
<222> 289. .588 

<220> 

<221> polyA_signal 
<222> 267. .272 

<400> 118 

tttattttgc cttct atg tea tct tgg age tta atg tge atg eaa gat atg 51 
Met Ser Ser Trp Ser Leu Met Cys Met Gin Asp Met 
15 10 

gga aat aet aaa gea aga eta ete tat gaa gee aat ett eea gag aae 99 

Gly Asn Thr Lys Ala Arg Leu Leu Tyr Glu Ala Asn Leu Pro Glu Asn 
15 20 25 

ttt cga aga cea cag aca gat caa gca gtg gaa ttt ttc ate aga gat 147 

Phe Arg Arg Pro Gin Thr Asp Gin Ala Val Glu Phe Phe lie Arg Asp 
30 35 40 

aaa tat gaa aag aag aaa tac tac gaa aaa atg cca tagctattac 193 

Lys Tyr Glu Lys Lys Lys Tyr Tyr Glu Lys Met Pro 

45 50 55 

aaatatttcc tcctctgatg ctectcttea gcctttggta tcctctcctt ctctgcaagc 253 
tgctgttgac aaaaataaat tggagaaaga aaaggaaaaa aaaaaggaag agaaaaagag 313 
agaaaaggag ecagaaaage eggcaaaace aettaeaget gaaaagetge agaagaaaga 373 
teagcaaetg gageetaaaa aaagtaeeag cectaaaaaa getgeggage caetgtggat 433 
cttttaggac ttgtttggag aaagattgag aaggaaaaga gtgaaaggca gttagttgaa 493 
gtggaagagt tgaccttcat gtactgcttt ccaaaaaatt gatcctagtt gttattgaat 553 
aaattgttgt etgttttaag aaaaaaaaaa aaaaa 588 

<210> 119 
<211> 595 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 16. .183 

<220> 

<221> polyA_site 
<222> 571. .595 

<220> 

<221> polyA_signal 
<222> 551. .556 

<400> 119 

tttattttgc cttct atg tea tct tgg age tta atg tge atg caa gat atg 51 
Met Ser Ser Trp Ser Leu Met Cys Met Gin Asp Met 
15 10 

gga aat aet aaa gea aga eta ete tat gaa gcc aat ctt eea gag aac 99 
Gly Asn Thr Lys Ala Arg Leu Leu Tyr Glu Ala Asn Leu Pro Glu Asn 

15 20 25 

ttt cga aga cca cag aca gat caa gca gtg gaa ttt ttc ate aga gat 147 
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Phe Arg Arg Pro Gin Thr Asp Gin Ala Val Glu Phe Phe lie Arg Asp 
30 35 40 

: gaa aaa atg cca tagctattac 193 
■ Glu Lys Met Pro 
45 50 55 

aaatatttcc tcctctgatg ctcctcttca gcctttggta tcctctcctt ctctgcaagc 253 
tgctgttgac aaaaataaat tggagaaaga aaaggaaaaa aaaaaggaag agaaaaagag 313 
agaaaaggag ccagaaaagc cggcaaaacc acttacagct gaaaagctgc agaagaaaga 373 
tcagcaactg gagcctaaaa aaagtaccag ccctaaaaaa gctgcggagc cactgtggat 433 
cttttaggac ttgtttggag aaagattgag aaggaaaaga gtgaaaggca gttagttgaa 4 93 
gtggaagagt tgaccttcat gtactgcttt ccaaaaaatt gatcctagtt gttattgaat 553 
aaattgttgt ctgttttaag aaaaaaaaaa aaaaatgcga aa 595 



<210> 120 
<211> 601 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 16. .183 



<220> 

<221> polyA_site 
<222> 572. .601 



<220> 

<2 21> polyA_signal 
<222> 552. .557 



<400> 120 

tttattttgc cttct atg tea tct tgg age tta atg tgc atg caa gat atg 51 
Met Ser Ser Trp Ser Leu Met Cys Met Gin Asp Met 
15 10 
gga aat act aaa gca aga eta etc tat gaa gee aat ett cca gag aac 99 
Gly Asn Thr Lys Ala Arg Leu Leu Tyr Glu Ala Asn Leu Pro Glu Asn 

15 20 25 

ttt cga aga cea cag aea gat caa gca gtg gaa ttt ttc ate aga gat 147 
Phe Arg Arg Pro Gin Thr Asp Gin Ala Val Glu Phe Phe lie Arg Asp 

30 35 40 

aaa tat gaa aag aag aaa tac tac gaa aaa atg cca tagctattac 193 
Lys Tyr Glu Lys Lys Lys Tyr Tyr Glu Lys Met Pro 
45 50 55 

aaatatttee teetctgatg ctcctcttca gcctttggta tcctctcctt ctctgcaagc 253 
tgctgttgac aaaaataaat tggagaaaga aaaggaaaaa aaaaaggaag agaaaaagag 313 
agaaaaggag ccagaaaagc cggcaaaacc acttacagct gaaaagctgc agaagaaaga 373 
tcagcaactg gagcctaaaa aaagtaccag ccctaaaaaa gctgcggagc ccactgtgga 433 
tcttttagga cttgtttgga gaaagattga gaaggaaaag agtgaaaggc agttagttga 4 93 
agtggaagag ttgacettca tgtactgett tccaaaaaat tgatectagt tgttattgaa 553 
taaattgttg tctgttttaa gaaaaaaaaa aaaaaatgcg aaaagctt 601 



<210> 121 
<211> 589 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> CDS 
<222> 16. .183 



<220> 

<221> polyA_site 
<222> 289. .589 
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<220> 

<221> polyA_signal 
<222> 267. .272 

<400> 121 

tttattttgc cttct atg tea tct tgg age tta atg tgc atg caa gat atg 
Met Ser Ser Trp Ser Leu Met Cys Met Gin Asp Met 
15 10 
gga aat act aaa gca aga eta etc tat gaa 
Gly Asn Thr Lys Ala Arg Leu Leu Tyr Glu 

15 20 25 

ttt cga aga cca cag aca gat caa gca gtg gaa ttt ttc ate aga gat 
Phe Arg Arg Pro Gin Thr Asp Gin Ala Val Glu Phe Phe lie Arg Asp 

30 35 40 

aaa tat gaa aag aag aaa tac tac gaa aaa atg cca tagctattac 
Lys Tyr Glu Lys Lys Lys Tyr Tyr Glu Lys Met Pro 



aaatatttcc 


tcctctgatg 


ctcctcttca 


gcctttggta 


tcctctcctt 


ctctgcaagc 


253 


tgctgttgac 


aaaaataaat 


tggagaaaga 


aaaggaaaaa 


aaaaaggaag 


agaaaaagag 


313 


agaaaaggag 


ccagaaaagc 


cggcaaaace 


aettaeaget 


gaaaagetge 


agaagaaaga 


373 


tcagcaactg 


gagcctaaaa 


aaagtaccag 


ccctaaaaaa 


getgeggage 


ccactgtgga 


433 


tctttttgga 


cttgtttgga 


gaaagattga 


gaaggaaaag 


agtgaaagge 


agttagttga 


493 


agtggaagag 


ttgacettca 


tgtactgctt 


tccaaaaaat 


tgatcctagt 


tgttattgaa 


553 


taaattgttg 


tetgttttaa 


gaaaaaaaaa 


aaaaaa 






589 



<210> 122 
<211> 595 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 16. .183 

<220> 

<221> polyA_site 
<222> 571. .595 

<220> 

<221> polyA_signal 
<222> 551. .556 

<400> 122 

tttattttgc cttct atg tea tct tgg age tta atg tgc atg caa gat atg 51 
Met Ser Ser Trp Ser Leu Met Cys Met Gin Asp Met 
15 10 

gga aat act aaa gca aga eta etc tat gaa gee aat ctt cca gag aac 99 

Gly Asn Thr Lys Ala Arg Leu Leu Tyr Glu Ala Asn Leu Pro Glu Asn 

15 20 25 

ttt cga aga cca cag aca gat caa gca gtg gaa ttt ttc ate aga gat 147 
Phe Arg Arg Pro Gin Thr Asp Gin Ala Val Glu Phe Phe lie Arg Asp 

30 35 40 

aaa tat gaa aag aag aaa tac tac gaa aaa atg cca tagctattac 193 
Lys Tyr Glu Lys Lys Lys Tyr Tyr Glu Lys Met Pro 
45 50 55 

aaatatttcc tcctctgatg ctcctcttca gcctttggta tcctctcctt ctctgcaagc 253 
tgctgttgac aaaaataaat tggagaaaga aaaggaaaaa aaaaaggaag agaaaaagag 313 
agaaaaggag ccagaaaagc cggcaaaace aettaeaget gaaaagetge agaagaaaga 373 
tcagcaactg gagcctaaaa aaagtaccag ccctaaaaaa gctgcggasc cactgtggat 433 
cttttaggae ttgtttggag aaagattgag aaggaaaaga gtgaaaggca gttagttgaa 4 93 
gtggaagagt tgaectteat gtaetgcttt ecaaaaaatt gatcetagtt gttattgaat 553 
aaattgttgt etgttttaag aaaaaaaaaa aaaaatgcga aa 595 
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<210> 123 
<211> 588 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 16. .183 

<220> 

<221> polyA_site 
<222> 289. .588 

<220> 

<221> polyA_signal 
<222> 267. .272 

<400> 123 

tttattttgc cttct atg tea tct tgg age tta atg tgc atg caa gat atg 51 
Met Ser Ser Trp Ser Leu Met Cys Met Gin Asp Met 
15 10 

gga aat act aaa gca aga eta etc tat gaa gcc aat ctt cca gag aac 99 

Gly Asn Thr Lys Ala Arg Leu Leu Tyr Glu Ala Asn Leu Pro Glu Asn 
15 20 25 

ttt cga aga cca cag aca gat caa gca gtg gaa ttt ttc ate aga gat 147 

Phe Arg Arg Pro Gin Thr Asp Gin Ala Val Glu Phe Phe lie Arg Asp 
30 35 40 

aaa tat gaa aag aag aaa tac tac gaa aaa atg cca tagctattac 193 

Lys Tyr Glu Lys Lys Lys Tyr Tyr Glu Lys Met Pro 

45 50 55 

aaatatttcc tcctctgatg ctcctcttca gcctttggta tcctctcctt ctctgcaagc 253 
tgctgttgac aaaaataaat tggagaaaga aaaggaaaaa aaaaaggaag agaaaaagag 313 
agaaaaggag ccagaaaagc cggcaaaacc acttacagct gaaaagctgc agaagaaaga 373 
tcagcaaetg gagcetaaaa aaagtaccag ccctaaaaaa gctgcggagc cactgtggat 433 
ettttaggac ttgtttggag aaagattgag aaggaaaaga gtgaaaggca gttagttgaa 4 93 
gtggaagagt tgaccttcat gtactgcttt ccaaaaaatt gatcctagtt gttattgaat 553 
aaattgttgt ctgttttaag aaaaaaaaaa aaaaa 588 

<210> 124 
<211> 588 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 16. .183 

<220> 

<221> polyA_site 
<222> 289. .588 

<220> 

<221> polyA_signal 
<222> 267. .272 

<400> 124 

tttattttgc cttct atg tea tct tgg age tta atg tgc atg caa gat atg 51 
Met Ser Ser Trp Ser Leu Met Cys Met Gin Asp Met 
15 10 
gga aat act aaa gca aga eta etc tat gaa gcc aat ctt cca gag aac 99 
Gly Asn Thr Lys Ala Arg Leu Leu Tyr Glu Ala Asn Leu Pro Glu Asn 
15 20 25 
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ttt cga aga cca cag aca gat caa gca gtg gaa ttt ttc ate aga gat 147 
Phe Arg Arg Pro Gin Thr Asp Gin Ala Val Glu Phe Phe lie Arg Asp 

30 35 40 

aaa tat gaa aag aag aaa tac tac gaa aaa atg cca tagctattac 193 
Lys Tyr Glu Lys Lys Lys Tyr Tyr Glu Lys Met Pro 
45 50 55 

aaatatttcc tcctctgatg ctcctcttca gcctttggta tcctctcctt ctctgcaagc 253 
tgctgttgac aaaaataaat tggagaaaga aaaggaaaaa aaaaaggaag agaaaaagag 313 
agaaaaggag ccagaaaagc cggcaaaacc acttacagct gaaaagctgc agaagaaaga 373 
tcagcaactg gagcctaaaa aaagtaccag ccctaaaaaa gctgcggagc cactgtggat 433 
cttttaggac ttgtttggag aaagattgag aaggaaaaga gtgaaaggca gttagttgaa 4 93 
gtggaagagt tgaccttcat gtactgcttt ccaaaaaatt gatcctagtt gttattgaat 553 
aaattgttgt ctgttttaag aaaaaaaaaa aaaaa 588 



<210> 125 
<211> 589 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 16.. 183 

<220> 

<2 21> polyA_site 
<222> 289. .589 

<220> 

<221> polyA_signal 
<222> 267. .272 

<400> 125 

tttattttgc cttct atg tea tct tgg age tta atg tgc atg caa gat atg 51 
Met Ser Ser Trp Ser Leu Met Cys Met Gin Asp Met 
1 5 10 

gga aat act aaa gca aga eta etc tat gaa gee aat ctt cca gag aac 99 
Gly Asn Thr Lys Ala Arg Leu Leu Tyr Glu Ala Asn Leu Pro Glu Asn 

15 20 25 

ttt cga aga cca cag aca gat caa gca gtg gaa ttt ttc ate aga gat 147 
Phe Arg Arg Pro Gin Thr Asp Gin Ala Val Glu Phe Phe lie Arg Asp 

30 35 40 

aaa tat gaa aag aag aaa tac tac gaa aaa atg cca tagctattac 193 
Lys Tyr Glu Lys Lys Lys Tyr Tyr Glu Lys Met Pro 
45 50 55 

aaatatttcc tcctctgatg ctcctcttca gcctttggta tcctctcctt ctctgcaagc 253 
tgctgttgac aaaaataaat tggagaaaga aaaggaaaaa aaaaaggaag agaaaaagag 313 
agaaaaggag ccagaaaagc cggcaaaacc acttacagct gaaaagctgc agaagaaaga 373 
tcagcaactg gagcctaaaa aaagtaccag ccctaaaaaa gctgcggagc ccactgtgga 4 33 
tcttttagga cttgtttgga gaaagattga gaaggaaaag agtgaaaggc agttagttga 4 93 
agtggaagag ttgaccttca tgtactgctt tccaaaaaat tgatcctagt tgttattgaa 553 
taaattgttg tctgttttaa gaaaaaaaaa aaaaaa 58 9 

<210> 126 
<211> 663 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 86. .451 

<220> 

<221> polyA_site 
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<222> 622. .663 
<220> 

<2 21> polyA_signal 
<222> 597. .602 

<400> 126 

acacatccaa gcttaagacg gtgaggtcag cttcacattc tcaggaactc tccttctttg 60 



ggccacggaa ttaacccgag caggc 


atg 


gag 


gcc 


tct 


get 


etc 


ace 


tea 


tea 


112 
















Met 
1 


Glu 


Ala 


Ser 


Ala 
5 


Leu 


Thr 


Ser 


Ser 




gca 


gtg 


acc 


agt 


gtg 


gee 


aaa 


gtg 


gtc 


agg 


gtg 


gcc 


tct 


ggc 


tct 


gcc 


160 


Ala 


Val 


Thr 


Ser 


Val 


Ala 


Lys 


Val 


Val 


Arg 


Val 


Ala 


Ser 


Gly 


Ser 


Ala 




10 










15 










20 










25 




gta 


gtt 


ttg 


ccc 


ctg 


gee 


agg 


att 


get 


aca 


gtt 


gtg 


att 


gga 


gga 


gtt 


208 


Val 


Val 


Leu 


Pro 


Leu 


Ala 


Arg 


He 


Ala 


Thr 


Val 


Val 


He 


Gly 


Gly 


Val 












30 


























gtg 


gcc 


atg 


gcg 


get 


gtg 


ccc 


atg 


gtg 


etc 


agt 


gee 


atg 


ggc 


ttc 


act 


256 


Val 


Ala 


Met 


Ala 


Ala 


Val 


Pro 


Met 


Val 


Leu 


Ser 


Ala 


Met 


Gly 


Phe 


Thr 










45 










50 










55 








gcg 


gcg 


gga 


ate 






tec 




ata 




gcc 


aag 


atg 


atg 


tec 


gcg 


304 


Ala 


Ala 


Gly 


He 


Ala 


Ser 


Ser 


Ser 


He 


Ala 


Ala 


Lys 


Met 


Met 


Ser 


Ala 








60 










65 










70 










gcg 


gcc 


att 


gcc 


aat 


ggg 


ggt 


gga 


gtt 


gcc 


teg 


ggc 


age 


ctt 


gtg 


get 


352 


Ala 


Ala 


He 


Ala 


Asn 


Gly 


Gly 


Gly 


Val 


Ala 


Ser 


Gly 


Ser 


Leu 


Val 


Ala 






75 










80 










85 












act 


ctg 


cag 


tea 


Ctg 


gga 


gca 


act 


gga 


etc 


tec 


gga 


ttg 


ace 


aag 


ttc 


400 


Thr 


Leu 


Gin 


Ser 


Leu 


Gly 


Ala 


Thr 


Gly 


Leu 


Ser 


Gly 


Leu 


Thr 


Lys 


Phe 




90 










95 










100 










105 




ate 


ctg 


ggc 


tec 


att 


ggg 


tct 


gn\c 


att 


gcg 


get 


gtc 


att 


gcg 


agg 


ttc 


448 


He 


Leu 


Gly 


Ser 


He 


Gly 


Ser 


Xaa 


He 


Ala 


Ala 


Val 


He 


Ala 


Arg 


Phe 





110 115 120 



tae tageteectg ceectegece tgeagagaag agaaecatgc eaggggagaa 501 
Tyr 

ggeacecage cateetgaec eagcgaggag ecaaetatce eaaatatace tggggtgaaa 561 

tataccaaat tctgcatctc cagaggaaaa taagaaataa agatgaattg ttgcaactct 621 

aaaaaaaaaa aaaaaaaaaa aaaaacaaaa aaaaaaaaaa aa 663 

<210> 127 
<211> 633 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 88. .453 

<220> 

<221> polyA_site 
<222> 618. .633 

<220> 

<221> polyA_signal 
<222> 599. .604 

<400> 127 

aaacacatcc aagcttaaga eggtgaggtc agcttcacat tetcaggaac tctccttett 60 
tgggccacgg aattaaeccg ageagge atg gag gee tct get etc ace tea tea 114 

Met Glu Ala Ser Ala Leu Thr Ser Ser 

1 5 

gca gtg acc agt gtg gcc aaa gtg gtc agg gtg gcc tct ggc tct gcc 162 
Ala Val Thr Ser Val Ala Lys Val Val Arg Val Ala Ser Gly Ser Ala 
10 15 20 25 
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gta gtt ttg ccc ctg gem agg att get aea gtt gtg att gga gga gtt 210 
Val Val Leu Pro Leu Ala Arg He Ala Thr Val Val He Gly Gly Val 
30 35 40 

: atg ggc ttc act 258 
I Met Gly Phe Thr 
45 50 55 

gcg gcg gga ate gee teg tee tee ata gea gcc aag atg atg tee geg 306 
Ala Ala Gly He Ala Ser Ser Ser He Ala Ala Lys Met Met Ser Ala 

60 65 70 

gcg gcc att gee aat sgg ggt gga gtt gcc teg ggc age ctt gtg get 354 
Ala Ala He Ala Asn Xaa Gly Gly Val Ala Ser Gly Ser Leu Val Ala 

75 80 85 

act ctg cag tea ctg gga gca act gga etc tec gga ttg acc aag ttc 402 
Thr Leu Gin Ser Leu Gly Ala Thr Gly Leu Ser Gly Leu Thr Lys Phe 
90 95 100 105 

ate ctg ggc tec att ggg tct gcc att gcg get gtc att gcg agg ttc 450 
He Leu Gly Ser He Gly Ser Ala He Ala Ala Val He Ala Arg Phe 

110 115 120 

tac tagctcectg eceetcgcec tgcagagaag agaaccatgc caggggagaa 503 
Tyr 

ggcacccagc catcctgacc cagcgaggag ccaactatcc caaatatacc tggggtgaaa 563 
tataccaaat tctgcatctc cagaggaaaa taagaaataa agatgaattg ttacaaaaaa 623 
aaaaaaaaaa 633 



<210> 128 
<211> 588 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 15. .236 



<220> 

<221> polyA_site 
<222> 573. .588 



<220> 

<221> polyA_signal 
<222> 551. .556 



gaaaattttc c 


:aet 




get 


tee 


age 


act 


gte 


ecg gtg 


age get 


get 


ggc 


50 










Met 


Ala 


Ser 


Ser 


Thr 


Val 


Pro Val 


Ser Ala 


Ala 


Gly 












1 








5 






10 








teg 


get 






act 


ccc 


gaa 


ata 


ccg 


gac 


aac gtg 


gga gat 


tgg 


ctt 


98 


Ser 


Ala 


Asn 


Glu 


Thr 


Pro 


Glu 


He 


Pro Asp Asn Val Gly Asp Trp 










15 










20 








25 








egg 


ggc 


gtc 


tac 


cgc 


ttt 


gcc 


act 


gat 


agg 


aat gac 


ttc egg 


agg 


aac 


146 


Arg 


Gly 
30 


Val 


Tyr 


Arg 


Phe 


Ala 
35 


Thr 


Asp 


Arg 


Asn Asp 
40 


Phe Arg 


Arg 






ttg 


ata 


eta 


aat 


ttg 


gga 


etc 


ttt 


get 


gcg 


gga gtt 


tgg ctg 


gee 


agg 


194 


Leu 


He 


Leu 


Asn 


Leu 


Gly 


Leu 


Phe 


Ala 


Ala 


Gly Val 


Trp Leu 


Ala 


Arg 




45 










50 










55 






60 




aac 


ttg 


agt 


gac 


att 


gac 


etc 


atg 


gca 


ect 


eag eea 


ggg gtg 






236 


Asn 


Leu 


Ser 


Asp 


He 


Asp Leu Met 


Ala 


Pro 


Gin Pro 


Gly Val 









65 70 

tagceaagta gacaaatgga atectgtgct gaaeeegaat ettecaaaaa acagectaea 296 

atctgtgacc accacaagat gtgceetgat ggeagetgaa gtttgattca gatgggcact 356 

tttcttcccc ttccctgect agtttccttt tgttccttga gtccacgcag aattccattc 416 

tctggtcagc agaeaggett aagetaaagt attgcctcta ttetgtaaag ttetgtacat 476 

agttcccaag ettetgcagg gggtgatttt tgetcttgte etgagaaata acagtgctgt 536 

tttaaaaaac ataaaataaa taccgeacae aaagaeaaaa aaaaaaaaaa aa 588 
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<210> 129 
<211> 608 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 39. .260 

<220> 

<221> polyA_site 
<222> 589. .608 

<220> 

<221> polyA_signal 
<222> 577. .582 

<400> 129 

acggttcctc caccgaggcc catgcgaagc tttccact ; 



ccg gtg age get get gge teg get aat gaa aet eee gaa ata eeg gae 
Pro Val Ser Ala Ala Gly Ser Ala Asn Glu Thr Pro Glu lie Pro Asp 

10 15 20 

aae gtg gga gat tgg ett egg ggc gtc tae ege ttt gee aet gat agg 
Asn Val Gly Asp Trp Leu Arg Gly Val Tyr Arg Phe Ala Thr Asp Arg 

25 30 35 

aat gae tte egg agg aae ttg ata eta aat ttg gga cte ttt get geg 
Asn Asp Phe Arg Arg Asn Leu lie Leu Asn Leu Gly Leu Phe Ala Ala 

40 45 50 

gga gtt tgg etg gcc agg aae ttg agt gae att gae etc atg gea cet 
Gly Val Trp Leu Ala Arg Asn Leu Ser Asp lie Asp Leu Met Ala Pro 
55 60 65 70 

cag cca ggg gtg tagccaagta gacaaatgga atcctgtgct gaacccgaat 
Gin Pro Gly Val 

ettccaaaaa acagcctaca atctgtgacc accacaagat gtgccctgat ggcagctgaa 
gtttgattca gatgggcact tttcttccec ttceetgeet agtttecttt tgtteettga 
gtecacgcag aattccattc tctggtcagc agacaggctt aagctaaagt attgcctcta 
ttetgtaaag ttetgtaeat agttyeeaag ettetgeagg gggtgattkt tgetcttgte 
etgagaaata aeagtgetgt tttaaaaaae atttgaaata aataeegeac aeaaaaaaaa 
aaaaaaaa 

<210> 130 
<211> 1200 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 29. .469 

<220> 

<221> polyA_site 
<222> 1185. .1200 

<400> 130 

acttggcgag tgagacgctg atgggagg atg gae gta ctg gtg tct gag tgc 
Met Asp Val Leu Val Ser Glu Cys 
1 5 
tec gcg egg ctg ctg cag cag gaa gaa gag att aaa tct ctg act get 
Ser Ala Arg Leu Leu Gin Gin Glu Glu Glu lie Lys Ser Leu Thr Ala 



[fije>'/^:/yv'GG208389e.s^] 



WO 02/083898 PCT/lBOl/00914 





att 


gac 


egg 


ttg 


aaa 


aac 


tgt 


ggc 


tgt 


tta 


gga 


get tet 


cea 


aat 


148 


Gl^ 


















Cys 








Pro 


Asn 




25 










30 










35 








40 




ttg 


gag 


cag 


tta 


caa 


gaa 


gaa 


aat 


tta 


aaa 


tta 


aag 


tat cga 


ctg 




196 


Leu 


Glu 


Gin 


Leu 


Gin 


Glu 


Glu 


Asn 


Leu 


Lys 


Leu 


Lys 


Tyr Arg 


Leu 


Asn 












45 










50 








55 






att 


ctt 


cga 


aag 


agt 


ctt 


cag 


gca 


gaa 


agg 


aac 


aaa 


cea act 




aat 


244 


lie 


Leu 


Arg 


Lys 


Ser 


Leu 


Gin 


Ala 


Glu 


Arg 




Lys 


Pro Thr 


Lys Asn 





atg att aac att att age cgc eta caa gag gte ttt ggt cat gca att 292 
Met He Asn He He Ser Arg Leu Gin Glu Val Phe Gly His Ala He 

75 80 85 

aag get gca tat cea gat ttg gaa aat cot cct ctg eta gtg aca cea 340 
Lys Ala Ala Tyr Pro Asp Leu Glu Asn Pro Pro Leu Leu Val Thr Pro 

90 95 100 

agt cag cag gcc aag ttt ggg gac tat cag tgt aat agt get atg ggt 388 
Ser Gin Gin Ala Lys Phe Gly Asp Tyr Gin Cys Asn Ser Ala Met Gly 
105 110 115 120 

att tct cag gtg atg tat tgt cat gac tot tgg ctg ttt gat ttt ttt 436 
He Ser Gin Val Met Tyr Cys His Asp Ser Trp Leu Phe Asp Phe Phe 

125 130 135 

aag tat tat tat cat cat tgc cat tta cag aaa taatactatt acaagttgta 489 
Lys Tyr Tyr Tyr His His Cys His Leu Gin Lys 

140 145 
tccttagtga aaaggacatt tgccacagtt tgaaaaactt gagaaaggag ttgggggggt 549 
atatgtttta aettttttag geacaatttt taaggtttgg ttaaatttta tatgtattet 609 
caatatttaa gggeaateat tggtaetctt ttgtttaggt attteectee tgetgtgtee 669 
aggattgetg tgtggtggtg atgagtgctg ggaggtgaaa aattaaaata agccatttac 729 
cagtcagcat cccaattaaa tatttgatgt aaetgtgatc tttgagccag gcttatatat 789 
tcattttcaa gcagaggagt tccccatttt aaatagaggc attgtctgat gtgtttatgg 849 
ttaactgcat ctggcttggg tctttctgtt ttcctttctt tgctgaattg gaaggggtta 909 
ctctgaagag tccaggtctt acagtgtggt ttatttctca agtgtgaata ttgeacaect 969 
tcatggcttg aaaattagaa atgtaaatat gctggaacca gacttcataa ctgtagactt 1029 
tgttatgtca tcattaaaaa gttcgaggtg ttgtaatccc agctactagg gaggctgagg 1089 
caggagaatt gcttgaaecc aggggcggag gttacagtga gccgagatca caccactgca 1149 
etccagcetg ggcaacagag cgagactcca tctctaaaaa aaaaaaaaaa a 1200 



<210> 131 
<211> 646 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 50. .340 



<220> 

<221> polyA_site 
<222> 631. .646 



<220> 

<221> polyA_signal 
<222> 614. .619 



<400> 131 

gaggagaaga ggtgaccccg ctaaagataa taaatcatca ttgatcaag atg tct tea 

Met Ser Ser 
1 

gca cct gag cct cea aca ttc aaa aag gaa cea cce aaa gaa aaa gag 
Ala Pro Glu Pro Pro Thr Phe Lys Lys Glu Pro Pro Lys Glu Lys Glu 

5 10 15 

ttt caa age cea ggg etc aga ggg gtg cgc aca aca ace tta ttt cgt 
Phe Gin Ser Pro Gly Leu Arg Gly Val Arg Thr Thr Thr Leu Phe Arg 
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20 










25 








30 










35 




get 


gtg 


aat 


cca 


gag 


etc 


ttc 


att 


aaa cet 


aac 


aaa 


cct 


gta 


atg 


get 


202 


Ala 


Val 


Asn 


Pro 


Glu 
40 


Leu 


Phe 


He 


Lys Pro 
45 


Asn 


Lys 


Pro 


Val 


Met 
50 


Ala 




ttc 


gga 


ttg 


gta 


act 


ctt 


tea 


ctt 


tgc gtg 


gca 


tat 


att 


ggt 


tat 


eta 


250 








55 




Leu 


Ser 


Leu 


Cys Val 
60 






He 


Gly 
65 


Tyr 


Leu 




cat 


gca 


ata 


caa 


gag 


aat 


aaa 


aag 


gac etc 


tat 


gaa 


get 


att 


gat 


agt 


298 


His 


Ala 


He 


Gin 


Glu 


Asn Lys 


Lys Asp Leu 


Tyr 


Glu 


Ala 


He Asp Ser 








70 










75 








80 










gag 


ggg 


cac 


agt 


tat 


atg 


egg 


egg 


aaa aea 


tee 


aaa 


tgg 


gat 






340 


Glu 


Gly 


His 


Ser 


Tyr 


Met 






Lys Thr 


Ser 




Trp Asp 









85 90 95 



tagtagtgct ggttagtgea gatggaeett tattaaaggt tetgaaatct tcaaatgaaa 400 

gaccttgtga gtgtacagta tcatgtttct tgttctagaa catgctaatg aagagagaag 4 60 

atageagttg eaaceagaca aetgtegtaa attttgteet tteaeagctg cagceattat 520 

ctcattcttt ttccacagag tgagcgtcat aatatttttc tttccttacc tcttataaaa 580 

gtgccatgag aatggaagtt gtttttgtta attattaaat tctgtataat aaaaaaaaaa 640 

aaaaat 646 



<210> 132 
<211> 575 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 177, .470 

<220> 

<2 21> polyA_site 
<222> 561.. 575 

<220> 

<221> polyA_signal 
<222> 543. .548 



<400> 132 

atcgaccacg gaaatttgac acctccgggc ttggaagcag ctctctcctc cttccccgct 60 
gcttataaac ctcagccctg aggctccagc tcactctacc ccatctcctt gccgggtcag 120 
ccctgacaaa ggtcagctag ccccttgagg acatcagctt tggectcagg gtccta atg 179 

Met 



gca 


gca 


gaa 


cca 


ctg 


aca 


gag 


eta 


gag 


gag 


tec 


att 


gag 


aac 


gtg 


gtc 


227 


Ala 


Ala 


Glu 


Pro 
5 


Leu 


Thr 


Glu 




Glu 
10 


Glu 


Ser 


He 


Glu 


Asn 
15 


Val 


Val 




acc 


acc 


ttc 


ttc 


ace 


ttt 


gca 


agg 


cag 


gag 


ggc 


egg 


aak 


gat 


age 


etc 


275 


Thr 


Thr 


Phe 


Phe 


Thr 


Phe 


Ala 


Arg 


Gin 


Glu 


Gly Arg 


Xaa 


Asp 


Ser 


Leu 








20 










25 










30 










age 


gtc 


aac 


gag 


ttc 


aaa 


gag 


ctg 


gtt 


ace 


cag 


cag 


ttg 


ccc 


cat 


ctg 


323 


Ser 


Val 
35 


Asn 


Glu 


Phe 


Lys 


Glu 
40 


Leu 


Val 


Thr 


Gin 


Gin 
45 


Leu 


Pro 


His 


Leu 




etc 


aag 


gat 


gtg 


ggc 


tet 


Ctt 


gat 


gag 


aag 


atg 


aag 


age 


ttg 




gtg 


371 


Leu 


Lys 


Asp 


Val 


Gly 


Ser 


Leu 


Asp 


Glu 


Lys 


Met 


Lys 


Ser 


Leu 


Asp 


Val 




50 










55 










60 










65 




aat 


cag 


gac 


teg 


gag 


etc 


aag 


ttc 


aat 


gag 


tac 


tgg 


aga 


ttg 


att 


ggg 


419 


Asn 


Gin 


Asp 


Ser 


Glu 

70 


Leu 


Lys 


Phe 


Asn 


Glu 

75 


Tyr 


Trp 


Arg 


Leu 


He 

80 


Gly 




gag 


ctg 


gcc 


aag 


gaa 


ate 


agg 


aag 


aag 




gac 


ctk 




ate 


agg 


aag 


4 67 


Glu 


Leu 


Ala 


Lys 


Glu 


He 


Arg 


Lys 


Lys 


Lys 


Asp 


Leu 


Lys 


He 


Arg 







85 90 95 



aag taaagecgee tggctgagat ggggtgggea gggeagaget gateagggee 520 
Lys 
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gagcagaacc gcactcttcc caaataaagc ttcctccttg aaaaaaaaaa aaaaa 575 

<210> 133 
<211> 458 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 75. .419 

<400> 133 

ctttcctttc tcgttccccg gccatcttag cggctgctgt tggttggggg ccgtcccgct 60 



cctaaggcag gaag 


atg 


gtg 


gcc 


gca 


aag 


aag 


acg 




aag 


teg 


ctg 


gag 


110 










Met 


Val 


Ala 


Ala 


Lys 


Lys 


Thr 


Lys 


Lys 


Ser 


Leu 


Glu 












1 








5 










10 








teg 


ate 


aaa 


tet 


agg 






ete 


gtt 


atg 


aaa 


agt 


ggg 


aag 


tac 


gtc 




Ser 


He 


Lys 
15 


Ser 




Leu 


Gin 


20 


Val 


Met 


Lys 


Ser 


Gly 
25 


Lys 


Tyr 


Val 




ctg 


ggg 


tac 


aag 


cag 


act 


ctg 


aag 


atg 


ate 


aga 


caa 


ggc 




gcg 


aaa 


206 


Leu 


Gly 
30 


Tyr 


Lys 


Gin 


Thr 


Leu 
35 


Lys 


Met 


He 


Arg 


Gin 
40 


Gly 


Lys 


Ala 


Lys 




ttg 


gtc 


att 


etc 


get 


aae 


aae 


tge 


eea 


get 


ttg 


agg 


aaa 


tet 


gaa 


ata 


254 




Val 


He 


Leu 


Ala 


Asn 


Asn 


Cys 


Pro 


Ala 


Leu 


Arg 


Lys 


Ser 


Glu 


He 




45 










50 










55 










60 




gag 


tac 


tat 


get 


atg 


ttg 


get 


aaa 


aet 


ggt 


gte 


eat 


eae 


tae 


agt 


gge 


302 


Glu 


Tyr 


Tyr Ala 


Met 


Leu 


Ala 


Lys 


Thr 


Gly 


Val 


His 


His 


Tyr 


Ser 


Gly 












65 










70 










75 






aat 


aat 


att 


gaa 


etg 


gge 


aea 


gea 


tge 


gga 


aaa 


tae 


tae 




gtg 


tge 


350 


Asn 


Asn 


He 


Glu 
80 


Leu 


Gly 


Thr 


Ala 


Cys 
85 


Gly 


Lys 


Tyr 


Tyr 


A?g 
90 


Val 


Cys 




aca 


etg 


get 


ate 


att 


gat 


cea 


rgt 


gac 


tet 


kac 


ate 


att 


aga 


age 


atg 


398 


Thr 


Leu 


Ala 
95 


He 


He 


Asp 


Pro 


Xaa 
100 


Asp 


Ser 


Xaa 


He 


He 
105 


Arg 


Ser 


Met 




cca 


gaa 


cag 


act 


ggt 


gaa 


aag 


taaacetttt caeetaeaaa attteaeetg 


449 


Pro 


Glu 


Gin 


Thr 


Gly 


Glu 


Lys 























110 115 
caaacctta 458 

<210> 134 
<211> 588 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 144. .449 

<220> 

<221> polyA_site 
<222> 573. .588 

<220> 

<221> polyA_signal 
<222> 552. .557 

<400> 134 

gaggggegag ggggetageg gegaeggetg gggcgagege geetgegege tgggtgattt 60 
ttteaegtgt egceagggec ggaetgcgag tetctttgeg gcgctaeaet agageagagt 120 
aegagtctga ggcggaggga gta atg gca gga caa gcg ttt aga aag ttt ctt 173 
Met Ala Gly Gin Ala Phe Arg Lys Phe Leu 
15 10 
cca etc ttt gac cga gta ttg gtt gaa agg agt cet get gaa aet gta 221 
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Pro 


Leu 


Phe 


Asp 


Arg 
15 


Val 


Leu 


Val 


Glu 


Arg 
20 


Ser 


Pro 


Ala 


Glu 


Thr 
25 


Val 




acc 




gga 


ggc 


att 


atg 


ctt 




gaa 


aaa 


tct 


caa 


gga 


aaa 


gta 


ttg 


269 


Thr 


Lys 


Gly 


Gly 
30 


He 


Met 


Leu 


Pro 


Glu 
35 


Lys 


Ser 


Gin 


Gly 


Lys 
40 


Val 






caa 


gca 


aca 


gta 


gtc 


get 


gtt 


gga 


teg 


ggt 


tct 


aaa 


gga 


aag 


ggt 


gga 


317 


Gin 


Ala 


Thr 
45 


Val 


Val 


Ala 


Val 


Gly 
50 


Ser 


Gly 


Ser 


Lys 


Gly 
55 


Lys 


Gly 


Gly 




gag 


att 


caa 


cca 


gtt 


age 


gtg 


aaa 


gtt 


gga 


gat 


aaa 


gtt 


Ctt 


etc 


cca 


365 


Glu 


He 
60 


Gin 


Pro 


Val 


Ser 


Val 
65 


Lys 


Val 


Gly 


Asp 


Lys 
70 


Val 


Leu 


Leu 


Pro 




gaa 


tat 


gga 


ggc 


acc 


aaa 


gta 


gtt 


yta 


gat 


aac 


aag 


gat 


tat 


tte 


eta 


413 


Glu 


Tyr 


Gly 


Gly 


Thr 


Lys 


Val 


Val 


Leu 


Asp 




Lys 


Asp 


Tyr 


Phe 


Leu 




75 










80 










85 










90 




ttt 


aga 


gat 


ggt 


gac 


att 


ctt 


gga 


aag 


tac 


gta 


gac 


tgaaataagt 




459 


Phe 


Arg 


Asp 


Gly 


Asp 
95 


He 


Leu 


Gly 


Lys 


Tyr 
100 


Val 


Asp 













cactattgaa atggcatcaa catgatgctg cccattccac tgaagttctg aaatctttcg 519 
tcatgtaaat aatttccata tttctctttt ataataaact aatgataact aacaaaaaaa 579 
aaaaaaaaa 588 



<210> 135 
<211> 538 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 93. .398 

<220> 

<221> polyA_site 
<222> 523. .538 

<220> 

<221> polyA_signal 
<222> 501. .506 



<400> 135 

gggtgatttt ttcacgtgtc gccagggccg gactgcgagt ctctttgcgg cgctacacta 60 
gagcagagta cgagtctgag gcggagggag ta atg gca gga caa gcg ttt aga 113 





















Met 
1 


Ala 


Gly 


Gin 


Ala 

5 


Phe 


Arg 




aag 
Lys 


ttt 
Phe 


ctt 
Leu 
10 


cea 
Pro 


etc 
Leu 


ttt 
Phe 


gac 
Asp 


cga 
Arg 
15 


gta 
Val 


ttg 
Leu 


gtt 
Val 


gaa 
Glu 


agg 
Arg 
20 


agt 
Ser 


get 
Ala 


get 
Ala 


161 


gaa 
Glu 


act 
Thr 
25 


gta 
Val 


acc 
Thr 


aaa 
Lys 


gga 
Gly 


ggc 
Gly 
30 


att 
He 


atg 
Met 


ctt 
Leu 


cca 
Pro 


gaa 
Glu 
35 


aaa 
Lys 


tct 
Ser 


caa 
Gin 


gga 
Gly 


209 


aaa 
Lys 
40 


gta 
Val 


ttg 
Leu 


caa 
Gin 


gca 
Ala 


aca 
Thr 
45 


gta 
Val 


gtc 
Val 


get 
Ala 


gtt 
Val 


gga teg 
Gly Ser 
50 


ggt 
Gly 


tct 
Ser 


aaa 
Lys 


gga 
Gly 
55 


257 


aag 
Lys 


ggt 
Gly 


gga 
Gly 


gag 
Glu 


att 
He 
60 


caa 
Gin 


cca 
Pro 


gtt 
Val 


age 
Ser 


gtg 
Val 
65 


aaa 
Lys 


gtt 
Val 


gga 
Gly 


gat 
Asp 


aaa 
Lys 
70 


gtt 
Val 


305 


ctt 
Leu 


etc 
Leu 


cea 
Pro 


gaa 
Glu 
75 


tat 
Tyr 


gga 
Gly 


ggc 
Gly 


acc 
Thr 


aaa 
Lys 
80 


gta 
Val 


gtt 
Val 


eta 
Leu 


gat 
Asp 


gac 
Asp 
85 


aag 
Lys 


gat 
Asp 


353 


tat 
Tyr 


tte 
Phe 


eta 
Leu 
90 


ttt 

Phe 


aga 
Arg 


gat 

Asp 


ggt 
Gly 


gac 

Asp 
95 


att 

He 


ctt 


gga 

Gly 


aag 

Lys 


tac 

Tyr 
100 


gta 

Val 


gac 
Asp 




398 



tgaaataagt cactattgaa atggcatcaa catgatgctg cecattccae tgaagttctg 458 
aaatctttcg tcatgtaaat aatttccata tttctctttt ataataaact aatgataact 518 
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aacgaaaaaa aaaaaaaaaa 538 

<210> 136 
<211> 912 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 97. .444 

<220> 

<221> polyA_site 
<222> 897. .912 

<400> 136 

atctttccgt cccgggcagc cagcgccagt cggagccagc gcgagccgcc gccgccatca 60 
ctgccgctgc caagtcctcc acccgctgcc cccgcc atg tct get acc get gee 114 

Met Ser Ala Thr Ala Ala 
1 5 

acg gcc ccc cct get gcc ccg get ggg gag ggt ggt ccc cot gca ccc 162 
Thr Ala Pro Pro Ala Ala Pro Ala Gly Glu Gly Gly Pro Pro Ala Pro 

10 15 20 

cet cca aac etc acc agt aac agg aga ctg cag cag acc cag gcc cag 210 
Pro Pro Asn Leu Thr Ser Asn Arg Arg Leu Gin Gin Thr Gin Ala Gin 

25 30 35 

gtg gat gag gtg gtg gac ate atg agg gtg aac gtg gae aag gtc ctg 258 
Val Asp Glu Val Val Asp lie Met Arg Val Asn Val Asp Lys Val Leu 

40 45 50 

gag cga gac cag aag ctg teg gag ctg gac gac cgt gca gat gca etc 306 
Glu Arg Asp Gin Lys Leu Ser Glu Leu Asp Asp Arg Ala Asp Ala Leu 
55 60 65 70 

cag gcg ggg ccc tec cag ttt gaa aca age gca gcc aag etc aag egc 354 
Gin Ala Gly Pro Ser Gin Phe Glu Thr Ser Ala Ala Lys Leu Lys Arg 

75 80 85 

aaa tac tgg tgg aaa aac etc aag atg atg ate ate ttg gga gtg att 402 
Lys Tyr Trp Trp Lys Asn Leu Lys Met Met lie lie Leu Gly Val lie 

90 95 100 

tgc gee ate ate etc ate ate ate ata gtt tac tte age act 444 
Cys Ala He He Leu He He He He Val Tyr Phe Ser Thr 

105 110 115 

taaateeecg aggagtctgc cctgcctaga gaagggcetc tcccccaacc ctcagccgtt 504 
cctecaeetc tcagccatat ctttcagccc cccctcccct ggatccgtgt gtgtgtgtgt 564 
ccgtgtgtgt gteeecetgt aaatagccag etgttattta tacatatata atattatata 624 
tatttggtct gtttgtagtt ttattactag atgatttttc cggttgtcct taacacccct 584 
tcctgaggtt cccttcaccc ctetetettg cetteettce ctttecettt ettcctgact 744 
ageeceaagt eccttcattt gcatctgeta tgcaatagtc cetctecttt ccttcttctt 804 
ccetcagatt tagetgatec ttcetcceac ectggccttc ctttcctctt tcctcctcae 864 
tctccccgtc atgetcccte tgeeccgecc teaaaaaaaa aaaaaaaa 912 

<210> 137 
<211> 562 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 45.-335 

<220> 

<221> polyA_site 
<222> 545. .562 
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<220> 

<221> polyA_signal 
<222> 526. .531 

<400> 137 

tgttattttg tagctttaat gatttttgtt ttctaatagg gcaa atg tct eta age 56 

Met Ser Leu Ser 



ttg 


gtg 


ttt 


aga 


get 


get 


tea 


tat 


ttt aaa 


eta 


gtt 


cca 


ttc 






104 


Leu 


Val 


Phe 


Arg 


Ala 


Ala 


Ser 


Tyr 


Phe Lys 


Leu 


Val 


Pro 


Phe 


His 


Ser 




5 










10 








15 










20 




tct 


agt 


tea 


aae 


cag 


ttt 


tta 


cag 


cet cct 


ggg 


tgg 


gtc 


gtc 


ttg 


ace 


152 


Ser 


Ser 


Ser 


Asn 


Gin 


Phe 


Leu 


Gin 


Pro Pro 


Gly Trp 


Val 


Val 




Thr 












25 








30 










35 






caa 


act 


ett 


gtg 


ttg 


tta 


cat 


ttt 


gag agg 


ttt 


tea 


tae 


cag 


aat 


gta 


200 


Gin 


Thr 


Leu 


Val 
40 


Leu 


Leu 


His 


Phe 


Glu Arg 
45 


Phe 


Ser 


Tyr 


Gin 
50 


Asn 


Val 




cca 


aaa 


agt 


gca 




ggt 


aaa 


ggt 


aat tta 


cag 


cca 


gaa 


aca 


aat 


ata 


248 


Pro 


Lys 


Ser 
55 


Ala 


Gin 


Gly 


Lys 


Gly 
60 


Asn Leu 


Gin 


Pro 


Glu 
65 


Thr 


Asn 


He 




cat 


ttg 


ttt 


cat 


ttc 


ttg 


act 


ttc 


cct aag 


cag 


ata 


age 


aga 


aac 


ttg 


296 


His 


Leu 
70 


Phe 


His 


Phe 


Leu 


Thr 
75 


Phe 


Pro Lys 


Gin 


He 
80 


Ser 


Arg 


Asn 


Leu 




ttt 


aac 


tea 


tta 


ctt 


tgt 


ttg 


atg 


tgt ctt 


aca 


tat 


ttt 




345 


Phe 


Asn 


Ser 


Leu 


Leu 


Cys 




Met 




Thr 


Tyr 


Phe 










85 










90 








95 















ttatagaatg ttattcctct gggggaatct tttacaggtg gaggaatggg gatagcagta 405 

ttgcctcaga ttcaaactgg catcaecata aaccetgtag geearggtgg aatgaagtca 4 65 

getccttttt atagttgaaa tacaattttt tattcaccat tgtctgcaca aatccttgaa 525 

aataaacctt tttttcccta eaaaaaaaaa aaaaaaa 562 

<210> 138 
<211> 566 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 32. .361 

<220> 

<221> polyA_site 
<222> 551. .566 

<220> 

<221> polyA_signal 
<222> 532. .537 

<400> 138 

aacttgctct gcccgctcaa acagttgcag g atg teg atg aca gac ttg ctg 52 
Met Ser Met Thr Asp Leu Leu 



aac 




gag 


gac 


ate 


aag 




gcg 


] 

gtg 


L 

gga 


gcc 


ttt 


5 

age get 


ace 


gac 


100 


Asn 


Ala 


Glu 
10 


Asp 


He 


Lys 


Lys 


Ala 
15 


Val 


Gly 


Ala 


Phe 


Ser Ala 
20 


Thr 


Asp 




tec 


ttc 


gac 


cac 




aag 


ttc 


ttc 


caa 




gtc 


ggc 


ctg aag 






148 


Ser 


Phe Asp 


His 


Lys 


Lys 


Phe 


Phe 


Gin 


Met 


Val 


Gly 


Leu Lys 


Lys 


Lys 






25 










30 










35 










agt 


gcg 


gat 


gat 


gtg 


aag 


aag 


gtg 


ttt 


cac 


atg 


ctg 


gac aag 


gac 


aaa 


196 


Ser 


Ala 


Asp 


Asp 


Val 


Lys 


Lys 


Val 


Phe 


His 


Met 


Leu 


Asp Lys 


Asp 






40 










45 










50 








55 




agt 


ggc 


ttc 


ate 


gag 


gag 




gag 


ctg 


gga 


ttc 


ate 


eta aaa 


ggc 


ttc 


244 


Ser 


Gly 


Phe 


He 


Glu 


Glu 


Asp 


Glu 


Leu 


Gly 


Phe 


He 


Leu Lys 


Gly 


Phe 
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60 65 70 

tec cca gat gcc aga gac ctg tct get aaa gaa acc aag atg ctg atg 292 
Ser Pro Asp Ala Arg Asp Leu Ser Ala Lys Glu Thr Lys Met Leu Met 

75 80 85 

get get gga gae aaa gat ggg gae gge aaa att ggg gtt gae gaa tte 340 
Ala Ala Gly Asp Lys Asp Gly Asp Gly Lys He Gly Val Asp Glu Phe 

90 95 100 

tee aet ctg gtg get gaa age taagaageae tgaetgeeee tggtetteea 391 
Ser Thr Leu Val Ala Glu Ser 
105 110 
cetetetgce eteaacaeee aateteggee eetcttgeea ceeteetgca tttetgttea 451 
gttegtttat gttatttttt aeteeeceat eeectgtgge cetetaatga eaeeattett 511 
ctggaaaatg ctggagaagc aataaaggtt gtaceagtea aaaaaaaaaa aaaaa 566 



<210> 139 
<211> 567 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 31.. 360 



<220> 

<221> polyA_site 
<222> 552. .567 



<220> 

<221> polyA_signal 
<222> 531. .536 



<400> 139 

aettgetetg ecegeteaaa eagttgcagg atg teg atg aea gae ttg etg aae 54 
Met Ser Met Thr Asp Leu Leu Asn 



get 


gag 


gac 


ate 


aag 


aag 


gcg 


gtg 


1 

gga 


gcc 


ttt 


age 


5 
get 


acc 


gac 


tec 


102 


Ala 


Glu 


Asp 


He 


Lys 


Lys 


Ala 


Val 


Gly 


Ala 


Phe 


Ser 


Ala 


Thr 


Asp 


Ser 






10 










15 










20 












ttc 


gac 


cac 


aaa 


aag 


ttc 


ttc 


caa 


atg 


gtc 


ggc 


ctg 


aag 


aaa 


aag 


agt 


150 


Phe 


Asp 


His 


Lys 


Lys 


Phe 


Phe 


Gin 


Met 


Val 


Gly 


Leu 


Lys 


Lys 


Lys 


Ser 




25 










30 










35 










40 




gcg 




gat 


gtg 


aag 


aag 


gtg 


ttt 


cac 


atg 


ctg 


gac 


aag 


gac 


aaa 


agt 


198 


Ala 


Asp 


Asp 


Val 




Lys 


Val 


Phe 


His 


Met 


Leu 


Asp 


Lys 


Asp 


Lys 


Ser 












45 










50 










55 






ggc 


ttc 


ate 


gag 


gag 


gat 


gag 


ctg 


gga 


ttc 


ate 


eta 




gge 


tte 


tec 


246 


Gly 


Phe 


He 


Glu 


Glu 


Asp 


Glu 


Leu 


Gly 


Phe 


He 




Lys 


Gly 


Phe 


Ser 










60 










65 










70 








cca 


gat 


gee 


aga 


gac 


ctg 


tct 


get 




gaa 


acc 


aag 


atg 


etg 


atg 


get 


294 


Pro 


Asp 


Ala 

75 


Arg 


Asp 


Leu 


Ser 


Ala 
80 


Lys 


Glu 


Thr 


Lys 


Met 

85 


Leu 


Met 


Ala 




get 


gga 


gae 


aaa 


gat 


ggg 


gae 


ggc 


aaa 


att 


ggg 


gtt 


gae 


gaa 


tte 


tec 


342 


Ala 


Gly 


Asp 


Lys 


Asp 


Gly 


Asp 


Gly 


Lys 


He 


Gly 


Val 


Asp 


Glu 


Phe 


Ser 





90 95 100 

aet etg gtg get gaa age taagaageae tgaetgeeee tggtetteea 390 
Thr Leu Val Ala Glu Ser 
105 110 

cetetetgce etgaacaecc aateteggee cctctcgcca ccctcctgca tttetgttea 450 

gttegtttat gttatttttt aeteeeceat eeectgtgge cetetaatga eaeeattett 510 

ctggaaaatg ctggagaagc aataaaggtt gtaceagtea caaaaaaaaa aaaaaaa 567 



<210> 140 
<211> 566 
<212> DNA 
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<213> Homo sapiens 

<220> 
<221> CDS 
<222> 32.. 361 

<220> 

<221> polyA_site 
<222> 551. .566 

<220> 

<221> polyA_signal 
<222> 532. .537 

<400> 140 

aacttgctct gcccgctcaa acagttgcag g atg teg atg aca gac ttg ctg 52 

Met Ser Met Thr Asp Leu Leu 

1 5 



aac 


get 


gag 


gac 




aag 


aag 


gcg 


gtg 


gga 


gcc 


ttt 


age 


get 


acc 


gac 


100 


Asn 


Ala 


Glu 


Asp 


He 


Lys 


Lys 


Ala 


Val 


Gly 


Ala 


Phe 


Ser 


Ala 


Thr 


Asp 








10 










15 










20 










tec 


ttc 


gac 


eac 


aaa 


aag 


ttc 


ttc 


caa 


atg 


gtc 


gge 


ctg 


aag 


aaa 


aag 


148 


Ser 


Phe 


Asp 


His 


Lys 


Lys 


Phe 


Phe 


Gin 


Met 


Val 


Gly 


Leu 


Lys 


Lys 


Lys 






25 










30 










35 












agt 


gcg 


gat 


gat 


gtg 


aag 


aag 


gtg 


ttt 


cac 


atg 


ctg 


gac 


aag 






196 


Ser 


Ala 


Asp 


Asp 


Val 


Lys 


Lys 


Val 


Phe 


His 


Met 


Leu 


Asp 










40 










45 










50 










55 




agt 


gge 


ttc 


ate 


gag 


gag 


gat 


gag 


ctg 


gga 


ttc 


ate 


eta 


aaa 


gge 


ttc 


244 


Ser 


Gly 


Phe 


He 


Glu 


Glu 


Asp 


Glu 


Leu 


Gly 


Phe 


He 


Leu 


Lys 


Gly 


Phe 












60 










65 










70 






tec 


eea 


gat 


gee 


aga 


gac 


ctg 


tct 


get 


aaa 


gaa 


ace 


aag 


atg 


ctg 


atg 


292 


Ser 


Pro 


Asp 


Ala 


Arg 


Asp 




Ser 


Ala 


Lys 


Glu 


Thr 


Lys 


Met 


Leu 


Met 










75 










80 










85 








get 


get 


gga 


gac 


aaa 


gat 


ggg 


gac 


gge 


aaa 


att 


ggg 


gtt 


gac 


gaa 


ttc 


340 


Ala 


Ala 


Gly 


Asp 


Lys 


Asp 


Gly 


Asp 


Gly 


Lys 


He 


Gly 


Val 


Asp 


Glu 


Phe 








90 










95 










100 










tee 


act 


ctg 


gtg 


get 


gaa 


age 


taagaagcac tgactgccce tggtcttcca 


391 


Ser 


Thr 


Leu 


Val 


Ala 


Glu 


Ser 























105 110 



ectctctgce ctcaacaccc aatctcggcc cctcttgeca ccctcctgca tttctgttca 451 
gttcgtttat gttatttttt actcccecat cccetgtggc ectctaatga caccattctt 511 
ctggaaaatg ctggagaagc aataaaggtt gtaccagtca aaaaaaaaaa aaaaa 566 

<210> 141 
<211> 625 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 85. .420 

<220> 

<221> polyA_site 
<222> 610 . . 625 

<400> 141 

cccttctgcg cggtcaegcc gagccagcgc ctgggcctgg aaccgggccg tagccccccc 60 
agtttcgccc accaectecc tacc atg gac ccc cgc aaa gtg aac gak ctt 111 
Met Asp Pro Arg Lys Val Asn Xaa Leu 

1 5 

egg gee ttt gtg aaa atg tgt aag cag gat ccg age gtt ctg cac ace 159 
Arg Ala Phe Val Lys Met Cys Lys Gin Asp Pro Ser Val Leu His Thr 
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10 










15 








20 








25 




gag 
Glu 


gaa 
Glu 


atg 
Met 


cgc 
Arg 


ttc 
Phe 
30 


ctg agg gas 
Leu Arg Xaa 


tgg 
Trp 


gtg 
Val 
35 


gag 
Glu 


age 
Ser 


atg 
Met 


ggt ggt aaa 
Gly Gly Lys 
40 


207 


gta 
Val 


cca 
Pro 


cct 
Pro 


get 
Ala 
45 


act 
Thr 


caa aaa 
Gin Lys 


get 
Ala 


aaa 
Lys 
50 


tea 
Ser 


gaa 
Glu 


gaa 
Glu 


aat 
Asn 


ace 
Thr 

55 


aag gaa 
Lys Glu 


255 


Glu 


Lys 


Pro 
60 


gat 
Asp 


Ser 


aag aag 
Lys Lys 


gtg 
Val 
65 


gag 

Glu 


Glu 


Asp 


tta 
Leu 


Lys 
70 


Ala 


gac gaa 
Asp Glu 


303 


cca 


tea 
75 


agt 


gag 


gaa 


agt gat 
Ser Asp 
80 


eta 
Leu 


gaa 
Glu 


att 
He 


Asp 


Lys 
85 


gaa 
Glu 


ggt 
Gly 


gtg att 
Val He 


351 


Glu 
90 


cca 
Pro 


gac 
Asp 


act 
Thr 


gat 
Asp 


get cet 
Ala Pro 
95 


caa 
Gin 


gaa 
Glu 


atg 
Met 


gga 
Gly 
100 


gat 
Asp 


gaa 
Glu 


aat 
Asn 


gcg gag 
Ala Glu 
105 


399 


ata 

He 


a eg 

Thr 


get 

Ala 


caa 

Gin 


cat 

His 


ttt ttg 

Phe Leu 


tagceacagg cttccacgga ettcacgtea 


450 



110 



ttattggctc aactttcctc actatctgct tcatccgcca actaatattt cactttacat 510 
ceaaacatca ctttggcttc gaagccgccg cetgatactg gcattttgta gatgtggttt 570 
gactatttct gtatgtctcc atctattgat gagggtctta aaaaaaaaaa aaaat 625 

<210> 142 
<211> 685 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 22. .375 

<220> 

<221> polyAsite 
<222> 670. .685 

<220> 

<221> polyA_signal 
<222> 650. .655 

<400> 142 



agtcgagcca gcgtcgccgc g 


atg 


gtg 


ttg 


ttg 


gag 


age 


gag 


cag 


ttc 


ctg 


51 














Met 
1 


Val 


Leu 


Leu 


Glu 
5 


Ser 


Glu 


Gin 


Phe 


Leu 
10 




aeg 


gag 


ctg 


acc 


aga 


ctt 


ttc 


cag 


aag 


tgc 


egg 


acg 


teg 


ggc 


age 


gtc 


99 


Thr 


Glu 


Leu 


Thr 


Arg 


Leu 


Phe 


Gin 


Lys 


Cys 


Arg 


Thr 


Ser 


Gly 


Ser 


Val 












15 










20 










25 






tat 


ate 


ace 


ttg 


aag 


aag 


tat 


gac 


ggt 


cga 


ace 


aaa 


ccc 


att 


eea 


aag 


147 


Tyr 


He 


Thr 


Leu 


Lys 


Lys 


Tyr 


Asp 


Gly 


Arg 


Thr 


Lys 


Pro 


He 


Pro 


Lys 










30 










35 










40 








aaa 


ggt 


act 


gtg 


gag 


ggc 


ttt 


gag 


ccc 


gca 


gac 


aac 


aag 


tgt 


ctg 


tta 


195 


Lys 


Gly 


Thr 


Val 


Glu 


Gly 


Phe 


Glu 


Pro 


Ala 


Asp 


Asn 


Lys 


Cys 


Leu 


Leu 








45 










50 










55 


















ggg 


aag 


aag 




ate 




act 


gtg 


gtg 




tec 




243 


Arg 


Ala 


Thr 


Asp 


Gly 


Lys 


Lys 


Lys 


He 


Ser 


Thr 


Val 


Val 


Ser 


Ser 


Lys 






60 










65 










70 












gaa 


gtg 




aag 


ttt 


cag 


atg 


get 


tat 


tea 




etc 


ctt 


aga 


get 




291 


Glu 


Val 


Asn 


Lys 


Phe 


Gin 


Met 


Ala 


Tyr 


Ser 


Asn 


Leu 


Leu 


Arg 


Ala 


Asn 




75 










80 










85 










90 






gat 


ggg 


ttg 


aag 


aag 


aga 




aaa 


aag 


aac 


gaa 


act 


aag 


aag 


ace 


339 


Met 


Asp 


Gly 


Leu 


Lys 


Lys 


Arg 


Asp 


Lys 


Lys 


Asn 


Glu 


Thr 


Lys 


Lys 


Thr 












95 










100 










105 






aaa 


gca 


gca 


gca 


gca 


gca 


gca 


gca 


aca 


gca 


gca 


cag 


taaagggcat 




385 
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Lys Ala Ala Ala Ala Ala Ala Ala Thr Ala Ala Gin 

110 115 

acatttcctg ctttcaccaa ttaaccactg aattgctatt ttttcctttt ggccagatag 445 

ctaggtttct ggttccccca cagtaggtgt tttcacataa gattagggtc cttttggaaa 505 

gaatagttgc agtgtttata ggatagttgt ggtaagaatc tagtttattt tgcatttggc 565 

taattggtct gtgctgcatg gttatatact cctggattat agattaaaag tctctgtaga 625 

catctctgtg aagagcaagc tatcattaaa catgtctgtt tatcaaaaaa aaaaaaaaaa 685 

<210> 143 
<211> 553 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 9. .335 

<220> 

<221> polyA_site 
<222> 538. .553 

<220> 

<221> polyA_signal 
<222> 520. .525 

<400> 143 

tcaagaat atg gat agt tta cct tct ggt aaa ata cat cga aaa gtg aaa 50 

Met Asp Ser Leu Pro Ser Gly Lys lie His Arg Lys Val Lys 

1 5 10 



ata 


ata 


tta 


gga 


cga 




aga 




gaa 


aat 


ctg 


gaa 


cca 


aat 


get 


gaa 


98 


lie 


He 


Leu 


Gly 


Arg 


Asn 


Arg 


Lys 


Glu 


Asn 


Leu 


Glu 


Pro 


Asn 


Ala 


Glu 




15 










20 










25 










30 




ttt 


gat 


aaa 


aga 


act 


gaa 


ttt 


att 






gaa 


gaa 


aae 


aga 


att 


tgt 


146 


Phe 


Asp 


Lys 


Arg 


Thr 


Glu 


Phe 


He 


Thr 


Gin 


Glu 


Glu 


Asn 


Arg 


He 


Cys 












35 










40 










45 








tea 


ccg 


gta 


cag 


tct 


tta 


eta 


gac 


ttg 


ttt 


eag 


act 


agt 




gag 


194 


Ser 


Ser 


Pro 


Val 


Gin 


Ser 


Leu 


Leu 


Asp 


Leu 


Phe 


Gin 


Thr 


Ser 


Glu 


Glu 










50 










55 










60 








aaa 


tea 


gaa 


ttt 


ttg 


ggt 


ttc 


aca 


age 


tac 


aca 


gaa 


aag 


agt 


ggt 


ata 


242 


Lys 


Ser 


Glu 


Phe 


Leu 


Gly 


Phe 


Thr 


Ser 


Tyr 


Thr 


Glu 


Lys 


Ser 


Gly 


He 








65 










70 










75 










tgc 


aat 


gtt 


tta 


gat 


att 


tgg 


gaa 


gag 


gaa 


aat 


tea 


gat 


aat 


ctg 


tta 


290 


Cys 


Asn 


Val 


Leu 




He 


Trp 


Glu 


Glu 


Glu 


Asn 


Ser 


Asp 


Asn 


Leu 








80 










85 










90 












aca 


gcg 


ttt 


ttc 


teg 


tec 


cct 


tea 


act 


tet 


aca 


ttt 


act 


ggc 


ttt 




335 


Thr 


Ala 


Phe 


Phe 


Ser 


Ser 


Pro 


Ser 


Thr 


Ser 


Thr 


Phe 


Thr 


Gly 


Phe 







95 100 105 

tagaatttaa aaaatgcata cttttcagaa gtgataagga teatattctt gaaattttta 395 

taaatatgta tggaaattct taggattttt ttaccagctt tgtttacaga cccaaatgta 455 

aatattaaaa ataaatattt geaattttet aeagaattga ataeetgtta aagaaaaatt 515 

acagaataaa cttgtgactg gtaaaaaaaa aaaaaaat 553 

<210> 144 
<211> 631 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 156. .464 



<220> 

<221> polyA_site 
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<222> 610. . 631 
<220> 

<221> polyA_signal 
<222> 559. .564 

<400> 144 

aattcatatc tgagtgggcg gtggcgattg gtgttggcgg tctggctcag ctgggcaggg 60 
ggtaacttta ctgatttggg ggtggttttt agtttaattt ttcttttcta gcttcccatc 120 
gacggtcagt gcgcacgttg taatcagctg aggcc atg tea gga gac gga gcc 173 

Met Ser Gly Asp Gly Ala 

1 5 





gag 


cag 


gca 


get 


gag 




gtc 




gag 


aag 


gtg 


aag 




gcg 


gaa 


221 


Thr 


Glu 


Gin 


Ala 
10 


Ala 


Glu 


Tyr 


Val 


Pro 
15 


Glu 


Lys 


Val 


Lys 


Lys 
20 


Ala 


Glu 




aag 




tta 


gaa 


gag 


aat 


cca 


tat 


gac 


Ctt 


gat 


get 


tgg 


age 


att 


etc 


269 


Lys 


Lys 


Leu 
25 


Glu 


Glu 


Asn 


Pro 


Tyr 
30 


Asp 


Leu 


Asp 


Ala 


Trp 
35 


Ser 


He 


Leu 




att 


cga 


gag 


gca 


cag 


aat 


caa 


cct 


ata 


gac 


aaa 


gca 


egg 


aag 


act 


tat 


317 


He 


Arg 
40 


Glu 


Ala 


Gin 


Asn 


Gin 
45 


Pro 


He 


Asp 


Lys 


Ala 

50 


Arg 


Lys 


Thr 


Tyr 




gaa 


cgc 


ctt 


gtt 


gcc 


cag 


ttc 


ccc 


agt 


tct 


ggc 


aga 


ttc 


tgg 


aaa 


etg 


365 


Glu 


Arg 


Leu 


Val 


Ala 


Gin 


Phe 


Pro 


Ser 


Ser 


Gly 


Arg 


Phe 


Trp 


Lys 


Leu 




55 










60 










65 










70 




tac 


att 


gaa 


gca 


gag 


gtt 


act 


att 


tta 


ttt 




ttt 


tte 


tta 


tat 


eag 


413 


Tyr 


He 


Glu 


Ala 


Glu 
75 


Val 


Thr 


He 


Leu 


Phe 

80 


Tyr 


Phe 


Phe 


Leu 


Tyr 
85 


Gin 




tat 


tgc 


age 


att 


cac 


tgt 


agt 










gtt 


asg 




ata 


gcc 


451 


Tyr 


Cys 


Ser 


He 

90 


His 


Cys 


Ser 


Asp 


95^ 




Gin 


Val 


Xaa 


Asn 
100 


He 


Ala 





mat taggacaagg aggatttaaa tgtgtettae etttattttg taaaataggt 514 
Xaa 

ataaaggagt aattaaaatg aatttttgaa atttgggtet tttaeaaget gatgattgtt 574 

geattttgga gttgeaaeaa eattaaaaea gtttcaatga taaaaaaaaa aaaaaaa 631 



<210> 145 
<211> 635 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 160. .375 

<220> 

<221> polyA_site 
<222> 620.. 635 

<220> 

<221> polyA_signal 
<222> 569.-574 

<400> 145 

ttttaattea tatctgagtg ggeggtggcg attggtgttg geggtetgge teagctgggc 60 
agggggtaae tttaetgatt tgggggtggt ttttagttta atttttettt tetagettee 120 
eategacggt eagtgegeae gttgtaatea getgaggee atg tea gga gac gga 174 

Met Ser Gly Asp Gly 
1 5 

gcc acg gag cag gea get gag tat gte tea gag aag gtg aag aaa gcg 222 
Ala Thr Glu Gin Ala Ala Glu Tyr Val Ser Glu Lys Val Lys Lys Ala 

10 15 20 

gaa aag aaa tta gaa gag aat cca tat gac ctt gat get tgg age att 270 
Glu Lys Lys Leu Glu Glu Asn Pro Tyr Asp Leu Asp Ala Trp Ser He 
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25 30 35 

etc att cga gag gca cag aat caa cct ata gac aaa gca egg aag act 318 
Leu lie Arg Glu Ala Gin Asn Gin Pro lie Asp Lys Ala Arg Lys Thr 

40 45 50 

tat gaa ego ctt gtt gcc cag ttc ccc agt tct ggc aga ttc tgg aaa 366 
Tyr Glu Arg Leu Val Ala Gin Phe Pro Ser Ser Gly Arg Phe Trp Lys 

55 60 65 

ctg tac att taagcagagg ttactatttt attttatttt ttcttatatc 415 
Leu Tyr lie 
70 

agtattgcag cattcactgt agtgatagaa aamaagttas gaacatagcc aattaggaca 475 
aggaggattt aaatttgtct tacctttatt ttgtaaaata ggtataaagg agtaattaaa 535 
atgaattttt gaaatttggg tcttttacaa gctgatgatt gttgcatttt ggagttgcaa 595 
caacattaaa acagtttcaa tggcaaaaaa aaaaaaaaaa 635 

<210> 146 
<211> 733 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 32. .349 

<220> 

<221> polyA_site 
<222> 718.. 733 

<220> 

<221> polyA_signal 
<222> 711. .716 

<400> 146 

actaactagc aaacggggac tagaaatagg g atg ctg aaa age aac ggg gag 52 

Met Leu Lys Ser Asn Gly Glu 

1 5 



aga 
Arg 


cgc 
Arg 


agt 
Ser 
10 


cgt 
Arg 


aac 
Asn 


gca 
Ala 


ctt 
Leu 


cog 
Pro 
15 


gcg 
Ala 


gtc 
Val 


tac 
Tyr 


gcg 
Ala 


agg 
Arg 
20 


aag 
Lys 


atg 
Met 


get 
Ala 


100 


gca 
Ala 


tec 
Ser 
25 


cag 
Gin 


cag 
Gin 


caa 
Gin 


get 
Ala 


tea 
Ser 
30 


geg 
Ala 


get 
Ala 


tec 
Ser 


tea 
Ser 


get 
Ala 
35 


get 
Ala 


ggt 
Gly 


gta 
Val 


teg 
Ser 


148 


ggt 
Gly 
40 


cct 
Pro 


agt 
Ser 


teg 
Ser 


get 
Ala 


ggc 
Gly 
45 


ggc 
Gly 


ceg 
Pro 


ggt 
Gly 


ccc 
Pro 


cag 
Gin 
50 


cag 
Gin 


cag 
Gin 


ceg 
Pro 


Gin 


Pro 
55 


196 


cea 
Pro 


gca 
Ala 


caa 
Gin 


ctg 
Leu 


gtg 
Val 
60 


ggc 
Gly 


cct 
Pro 


gcc 
Ala 


Gin 


age 
Ser 
65 


ggc 
Gly 


etc 
Leu 


ctg 
Leu 


Gin 


caa 
Gin 
70 


cag 
Gin 


244 


caa 
Gin 


cag 
Gin 


gac 
Asp 


ttc 
Phe 
75 


Asp 


cct 
Pro 


gtg 
Val 


cag 
Gin 


cgt 
Arg 
80 


tat 
Tyr 


aag 
Lys 


atg 
Met 


etc 
Leu 


ate 
He 
85 


ceg 
Pro 


cag 
Gin 


292 


ctg 
Leu 


aag 


gag 
Glu 
90 


agt 
Ser 


eta 
Leu 


cag 
Gin 


gtg 

Val 


att 
He 
95 


ggc 
Gly 


ctt 
Leu 


aag 

Lys 


cag 
Gin 


cga 

Arg 
100 


gaa 

Glu 


gca 
Ala 


aac 
Asn 


340 



tgg att tgg tagtctagag gggcaggccg rgggccaggt tagtcggaga 389 
Trp He Trp 
105 

gagcgctaag cagaaagagg aatagaacct gctgttttgg cctggctggt gcacgcctgt 449 

gttcccagca ctttgggagg ccgaggcggg cggatcactt gaggtcagga gttcaagacc 509 

agcctggtga aacctcgtct ctactaaaaa tacaaaaatt agccgggcat ggtggcgcgc 569 

gcctgtaatt acagctactc gggaagctga ggcaggagaa tcgcttgaac ccaggaggcg 62 9 

gaggttgcag tgagccgaca tcgtgccatt gcactccagc ctgggcaaca gagcgagctc 68 9 

cgtcaawaaa ataaataaat aaataaataa aaaaaaaaaa aaaa 733 
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<210> 147 
<211> 521 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 111. .368 

<220> 

<221> polyA_site 
<222> 506. .521 

<220> 

<221> polyA_signal 
<222> 483. .488 

<400> 147 

tctcagcttc cggctggtag tagttccgct tcctgtccga ctgtggtgtc tttgctgagg 60 
gtcacattga gctgcaggtt gaatccgggg tgcctttagg attcagcacc atg gcg 116 

Met Ala 



gaa 


gac 


atg 


gag 


acc 


aaa 


ate 


aag 




tac 


aag 




gcc 


cot 


ttt 


gac 


164 


Glu 


Asp 


Met 
5 


Glu 


Thr 


Lys 


He 


Lys 
10 


Asn 


Tyr 


Lys 


Thr 


Ala 
15 


Pro 


Phe 


Asp 




age 


cgc 


ttc 






cag 






act 


aga 






tgg 


cag 


aac 


tac 


212 


Ser 


Arg 
20 


Phe 


Pro 


Asn 


Gin 


Asn 
25 


Gin 


Thr 


Arg 


Asn 


Cys 
30 


Trp 


Gin 


Asn 


Tyr 




ctg 


gac 


ttc 






tgt 


cag 


aag 


gca 


atg 


acc 


get 




gga 


ggc 


gat 


260 


Leu 


Asp 


Phe 


His 


Arg 


Cys 


Gin 


Lys 


Ala 


Met 


Thr 


Ala 


Lys 


Gly 


Gly 


Asp 




35 










40 










45 










50 




ate 


tct 


gtg 


tgc 


gaa 


tgg 


tac 


cag 


cgt 


gtg 


tac 


cag 


tec 


etc 


tgc 


ccc 


308 


He 


Ser 


Val 


Cys 


Glu 
55 


Trp 


Tyr 


Gin 


Arg 


Val 
60 


Tyr 


Gin 


Ser 


Leu 


Cys 
65 


Pro 




aca 


tec 


tgg 


gtc 


aca 


gac 


tgg 


gat 


gag 


caa 


egg 


get 


gaa 


ggc 


acg 


ttt 


356 


Thr 


Ser 


Trp 


Val 
70 


Thr 


Asp 


Trp 


Asp 


Glu 
75 


Gin 


Arg 


Ala 


Glu 


Gly 
80 


Thr 


Phe 





ccc ggg aag aty tgaactggct gcatctccct ttcctctgtc ctccatcctt 408 
Pro Gly Lys He 
85 

ctcccaggat ggtgaagggg gacctggtac ccagtgatcc ccaccccagg atcctaaatc 468 
atgacttacc tgctaataaa aactcattgg aaaagtcaaa aaaaaaaaaa aaa 521 



<210> 148 
<211> 547 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 95. .409 

<220> 

<221> polyA_site 
<222> 533. .547 

<220> 

<221> polyA_signal 
<222> 509. .514 



<400> 148 

ttcgaagtgg ttgtctctgc actatttact gaaaagttca tcctttcccc agtgatttgt 60 



[fije>'/^:/yv'GG208389e.s^] 



WO 02/083898 PCT/IBOl/00914 
114 



aacactgcct ctttcataaa ttaaattttt gtgt atg tgt gtg ggt ctt ttg atg 115 
Met Cys Val Gly Leu Leu Met 
1 5 



gtt 


tct 


att 


ctg 


act 


gac 


ate 


aat 


ttg 


tet 


aat 


ctt 


gta 


gea 


gta 


cag 


163 


Val 


Ser 


He 
10 


Leu 


Thr 


Asp 


He 


Asn 
15 


Leu 


Ser 


Asn 




Val 
20 


Ala 


Val 


Gin 




tac 


agt 


cct 


gat 


tat 


tgc 


aac 


ttt 


agg 




agg 


tct 


gat 


aaa 


aac 


caa 


211 


Tyr 


Ser 
25 


Pro 


Asp 


Tyr 


Cys 


Asn 
30 


Phe 


Arg 


Lys 


Arg 


Ser 
35 


Asp 


Lys 


Asn 


Gin 




gat 


gcc 


tec 


aca 


ttt 


tgt 


eat 


aat 


tgt 


aae 


eaa 


ttt 


eae 


etc 


tgt 


etc 


259 












Cys 
























40 










45 










50 










55 






tat 


cac 


cac 






gtt 


ctt 


cct 


tgg 


agt 


gtc 


ttg 


get 




ctt 


307 


Gin 


Tyr 


His 


His 


Lys 
60 


He 


Val 


Leu 


Pro 


Trp 
65 


Ser 


Val 


Leu 


Ala 


He 
70 


Leu 




age 


caa 


ctg 


ttc 


etc 


eat 


att 


act 


tet 


aga 


att 


aga 


eea 


aea 


att 


tat 


355 


Ser 


Gin 


Leu 


Phe 

75 


Leu 


His 


He 


Thr 


Ser 
80 


Arg 


He 


Arg 


Pro 


Thr 

85 


He 


Tyr 




aaa 


tea 






ccc 


aag 


age 


att 


gaa 


att 


ttg 


att 


gga 


ttt 


gta 


ttg 


403 


Lys 


Ser 


Asn 


Lys 


Pro 


Lys 


Ser 


He 


Glu 


He 




He 


Gly 


Phe 


Val 


Leu 





90 95 100 



aat tta tagattaatc tggtaaacca tgtcatcttt acaatgttgt cttccaatac 459 
Asn Leu 
105 

atgaatatgg tacagctctt catttactta ggcctttgaa atatctttca ataaagttta 519 
taattttcte cataaaaaaa aaaaaaaa 547 



<210> 149 
<211> 3159 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 103. .405 



<400> 149 

acttceggtg cgaaccgcct eggcegttce etcgcggact tactgagege ggecgccgag 60 

eeeagctccg ccgccgagcg cctgtgccgg cacggctaca cc atg gag cgc ccg 114 

Met Glu Arg Pro 



gat 


aag 


gcg 


gcg 


ctg 




gea 


ctg 


cag 


cct 


cct 


gag 


ttc 


aga 


aat 




162 


Asp 


Lys 


Ala 


Ala 


Leu 


Asn 


Ala 


Leu 


Gin 


Fro 


Pro 


Glu 


Phe 


Arg 


Asn 


Glu 




5 










10 










15 










20 




age 


tea 


tta 




tct 




ctg 


aag 


aeg 


etc 


ctg 


tte 


tte 




get 


tta 


210 


Ser 


Ser 


Leu 


Ala 


Ser 
25 


Thr 


Leu 


Lys 


Thr 


Leu 
30 


Leu 


Phe 


Phe 


Thr 


Ala 
35 


Leu 




atg 


ate 


act 


gtt 


cet 


att 


ggg 


tta 


tat 


tte 


aea 


aet 


aaa 


tet 


tae 


ata 


258 


Met 


He 


Thr 


Val 
40 


Pro 


He 


Gly 


Leu 


Tyr 
45 


Phe 


Thr 


Thr 




Ser 
50 


Tyr 


He 




ttt 


gaa 


ggc 


gcc 


ctt 


ggg 


atg 


tec 


aat 


agg 


gac 


age 


tat 


ttt 


tae 


get 


306 


Phe 


Glu 


Gly 
55 


Ala 




Gly 


Met 


Ser 
60 






Asp 


Ser 


Tyr 
65 


Phe 


Tyr 


Ala 




get 


att 


gtt 


gea 


gtg 


gtc 


gcc 


gtc 


cat 


gtg 


gtg 


ctg 


gee 


etc 


ttt 


gtg 


354 


Ala 


He 
70 


Val 


Ala 


Val 


Val 


Ala 
75 


Val 


His 


Val 


Val 


Leu 
80 


Ala 


Leu 


Phe 


Val 




tat 


gtg 


gee 


tgg 


aat 


gaa 


ggc 


tea 


cga 


cag 


tgg cgt 


gaa 


ggc 


aaa 


cag 


402 


Tyr 


Val 


Ala 


Trp 


Asn 


Glu 


Gly 


Ser 


Arg 


Gin 


Trp Arg 


Glu 


Gly 


Lys 


Gin 




85 










90 










95 










100 





gat taaagtgaac ateacctttt tatagcatta aattcatttt ttaaaatgat 455 
Asp 

aaatgetgga gggggceate tgatttgaat aaagttgaaa gaacatgtta aagtcagtet 515 

taaggagtca cgtttgagta tgtaaatttt gatctttcta atatgttggt ttgtatattc 575 
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agttttaact gtatgaatct gatttgcaaa tgagaatttg gaaaagttag ttacaaagaa 635 

atatgttaat ttaattagac aatactctgg aaggaatttt atcttctttc aacaaaacat 695 

gttttatagt attctgactt acggttgctt ttgagtttta ctcatktgga tatattaaga 755 

tgcacacagt gaagcaaatt aaactccact ttacgctgga atgctttctt tagcatgaaa 815 

ataccaggtc cttggatttg ggattttaat ttcctatgga aagttgctta aattgtggac 875 

actggaatta atctgaatgt cactgaggaa tttcacatga agtgtaatcc ctagtcaata 935 

agaattatcc attacattat tttatgggaa aactaggcta aattacatcc attcaggtaa 995 

aaggacctta gcttactgaa ggatctaaag agcaaagcaa agatctcact actcaaacac 1055 

tcagcctgct tccttcaagt ccccttgcag gccagctttg tgctttgcag accaactttt 1115 

taatgagata ctttgcttcc tcattcaaca ttgaagctag gcttcaatta aaaggttcga 1175 

ggaagctcca tttaaaattg tttttttact attttttaaa attgtagtgt atatgatagg 1235 

aatttgcatt taaatatgtt catttttgca tatgttagga gtggaaacaa tctggaaaac 1295 

attttttttt catccaaaaa gtattctcct tgggcatatc tgatggaaaa aaaccttgat 1355 

tttattttcg tatctttagt ctgtgttctt tctagttatt tggtactaat tatgtgcaat 1415 

ctaaaaacac tcccacaagt atttgttttt taattataaa atcatagtat atgttctttg 14 75 

tagaaaactg gaaaaataca tattcaaaca gaaaaaaaaa tcaaaattcc ccataatgtt 1535 

gccatctaaa aataacctct attttagttg atatcccgta ttcatttttg aaagccattc 1595 

cttaatgcta gtttgataca cactaaaagt ttagcttaca agttcaaatt ctgccagctt 1655 

ttcctgacag ctatttgcat ttttttcaga tgagtgatta ttggccattt tctttttctt 1715 

tttctttatt ttatttattt atttttttga gacagagttt tgctctgttg cccaggctgg 1775 

agtgcagtgg tgcaatctcg gctcactgca acctctgcct cctgggttca agtgattctc 1835 

cacctcagcc tcccaagtag ctgggactac ggatgcctgc caccacgcct ggctaatttt 18 95 

tttttgtatt tttttgtaga gacggggttt caccatgttg tccaggctaa tcttgaactt 1955 

gtgacctcag gtgatccacc cgcctcggcc tccgaaagtg ctgggattac aggcgtgagc 2015 

taccacgccc ggccttattg accattttct aaataagcac attctatctt tattctctta 2075 

aaattcaaat tttctgttac tgataatcct aatactagga ttcttgctta agtatgtgaa 2135 

accattaccg atttgttgtt cacatttatt ttttatgttg tgaaactgga ctaaaggaat 2195 

agagggatga ttagtcataa aagtcaaata gcatttgtgt ttaactgttg agaaaagtga 2255 

aagatcagta tgattattat ggaactgttt ttaattcttg cttaaagact acaattttag 2315 

tataatgaca tttgagtcta gggtagtatg tggtagattt ctagatggtc cctaattaag 2375 

aagtattgtt gtatttagaa ttgtccacct aatttctttt tatataatgc caaggtattt 24 35 

cttgtgcttt tgggatctta tgctgtttgt aaaatgttac tgtccaatgt tggattattg 24 95 

ttttggtttc aggcatttgc tgaataggtg atgatacatg ggtatttttc tgcaagtatt 2555 

taaaccaggg gcatatgcaa aggcagttgt aatttcctct tggaaaaagc gccaaatgtt 2615 

tgaaggttaa aatcaaatgc tagggttgat atttaggctt ataacaaaat aggcttgttt 2675 

tcaaagcagt tttttcctag agttttaact gttaactcac tagtttgctg ctgtttttaa 2735 

ctatgttaaa taacatatgg tatttggcaa atagatttat ttttcaaaat gtctcactag 2795 

tttcctttta cacaatgtat atacttcaag atgtatagaa aggaaagcta cagttgagcc 2855 

cttatacatg ttttaaggta gaaatatgtt ccctattgtt tgaaaactga ttgtaagaat 2915 

aacctcagtt aggagatata acttgaagtg tcagtccaaa ctactgattt aaccctattt 2975 

acggtaacac attaccttcc tcacctcctg tttggccctg gagaatgtag tcctttttct 3035 

catttgtgtt gagaaatgaa aagtctgctg tagaatgtat ctgatgtcat tagttcttca 3095 

aatggatacc attgtacata taacagtaga atttggtttg gggttgttag tgaaaaaaaa 3155 

ttta 3159 

<210> 150 
<211> 1033 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 30. .305 

<400> 150 

agaatgtcgt actctcagga ggctccacc atg ttc agg gat ttc gga cgc cga 53 
Met Phe Arg Asp Phe Gly Arg Arg 
1 5 

ctg cag agg gat ttg aag aga gtg gtg gat get agg ctg agg etc age 101 
Leu Gin Arg Asp Leu Lys Arg Val Val Asp Ala Arg Leu Arg Leu Ser 

10 15 20 

gag gag ete ars age ggg agg ate aag eeg aag eet gtg gag gte eag 14 9 
Glu Glu Leu Xaa Xaa Gly Arg lie Lys Pro Lys Pro Val Glu Val Gin 
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25 










30 35 




40 




gtg 


gtc 


a eg 


cat 


cac 


atg cag cgc tac gcc gtg 


tgg 


ttc gga ggc tec 


197 


Val 


Val 


Thr 


His 


His 
45 


Met Gin Arg Tyr Ala Val 
50 


Trp 


Phe Gly Gly Ser 
55 




atg 


ctg 


gcc 


teg 


aet 


ccc gag ttc ttt cag gtc 


tgc 


cac ace aag aag 


245 


Met 


Leu 


Ala 


Ser 
60 


Thr 


Pro Glu Phe Phe Gin Val 
65 


Cys 


His Thr Lys Lys 
70 




gac 


tat 


gaa 


gag 


tac 


ggg ccc age ate tgc cgc 


cac 


aac ccc gtc ttt 


2 93 


Asp 


Tyr 


Glu 
75 


Glu 


Tyr 


Gly Pro Ser lie Cys Arg 
80 


His 


Asn Pro Val Phe 
85 




gga 


gtc 




tec 


tagtgtctgc ctgaacgcgt cgttcgatgg tgteaegttg 


345 


Gly 


Val 
90 


Met 


Ser 













gggaacaagt gtccttcaga acceagagaa ggccgccgtt ctgtaaatag cgacgtcggt 405 

gttgctgcce ageagcgtgc ttgcattgcc ggtgeatgag gcgcggcgcg ggcccttcag 4 65 

taaaagccat ttatccgtgt gccgaccgct gtctgccagc ctcctectte tcecgcectc 525 

cteacectcg ctetccctcc tcctcctect ecgagctgct agctgacaaa tacaattctg 585 

aaggaateca aatgtgactt tgaaaattgt tagagaaaac aacattagaa aatggegeaa 645 

aatcgttagg tcccaggaga gaatgtgggg gcgcaaaccc tttteeteec agcctatttt 705 

tgtaaataaa atgtttaaac ttgaaataca aatcgatgtt tatatttcct atcattttgt 765 

attttatggt atttggtaca actggctgat actaagcacg aatagatatt gatgttatgg 825 

agtgetgtaa tccaaagttt ttaattgtga ggcatgttet gatatgttta taggeaaaca 885 

aataaaacag caaaettttt tgceacatgt ttgctagaaa atgattatac tttattggag 945 

tgacatgaag tttgaacact aaaeagtaat gtatgagaat tactacagat aeatgtatct 1005 

tttagttttt tttgtttgaa ctttctgg 1033 

<210> 151 

<211> 1033 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 30. .305 

<400> 151 

agaatgtcgt actctcagga ggctccacc atg ttc agg gat ttc gga cgc cga 53 
Met Phe Arg Asp Phe Gly Arg Arg 
1 5 



ctg 


cag 


agg 


gat 


ttg 


aag 


aga 


gtg 


gtg gat get agg 


Ctg agg 


etc 


age 


101 


Leu 


Gin 
10 


Arg 


Asp 


Leu 


Lys 


Arg 

15 


Val 


Val Asp Ala Arg 
20 


Leu Arg 


Leu 


Ser 




gag 


gag 


etc 


age 


ggc 


ggg 


agg 


ate 


aag ccg aag cct 


gtg gag 


gtc 


cag 


149 


Glu 


Glu 


Leu 


Ser 


Gly 


Gly 


Arg 


He 


Lys Pro Lys Pro 


Val Glu 


Val 


Gin 




25 










30 






35 






40 




gtg 


gtc 


aeg 


cat 


cac 


atg 


cag 


cgc 


tac gee gtg tgg 


ttc gga 


ggc 


tee 


197 


Val 


Val 


Thr 


His 


His 


Met 


Gin 


Arg 


Tyr Ala Val Trp 


Phe Gly 


Gly Ser 












45 








50 




55 






atg 


ctg 


gcc 


teg 


act 


ccc 


gag 


ttc 


ttt cag gtc tgc 


cac ace 


aag 


aag 


245 


Met 


Leu 


Ala 


Ser 
60 


Thr 


Pro 


Glu 


Phe 


Phe Gin Val Cys 
65 


His Thr 
70 


Lys 


Lys 




gac 


tat 


gaa 


gag 




ggg 


ccc 


age 


ate tgc cgc cac 


aac ccc 


gtc 


ttt 


293 


Asp 


Tyr 


Glu 
75 


Glu 


Tyr 


Gly 


Pro 


Ser 
80 


He Cys Arg His 


Asn Pro 
85 


Val 


Phe 




gga 


gtc 


atg 


tec 


tagtgtctgc ctgaacgcgt cgttcgatgg tgteaegttg 




345 


Gly 


Val 
90 


Met 


Ser 





















gggaacaagt gtccttcaga acceagagaa ggccgccgtt ctgtaaatag cgacgtcggt 405 

gttgctgcce ageagcgtgc ttgcattgcc ggtgeatgag gcgcggcgcg ggcccttcag 4 55 

taaaagccat ttatccgtgt gccgaccgct gtctgccagc ctcctectte tcecgcectc 525 

etcacccteg etetecctee tcctcctect ecgagctgct agctgacaaa tacaattctg 585 

aaggaateca aatgtgactt tgaaaattgt tagagaaaac aacattagaa aatggegeaa 645 

aatcgttagg tcccaggaga gaatgtgggg gcgcaaaccc tttteeteec agcctatttt 705 



[fije>'/^:/yv'GG208389e.s^] 



WO 02/083898 PCT/JlBOl/00914 
117 

tgtaaataaa atgtttaaac ttgaaataca aatcgatgtt tatatttcct atcattttgt 765 

attttatggt atttggtaca actggctgat actaagcacg aatagatatt gatgttatgg 825 

agtgctgtaa tccaaagttt ttaattgtga ggcatgttct gatatgttta taggcaaaca 885 

aataaaacag caaacttttt tgccacatgt ttgctagaaa atgattatac tttattggag 945 

tgacatgaag tttgaacact aaacagtaat gtatgagaat tactacagat acatgtatct 1005 

tttagttttt tttgtttgaa ctttctgg 1033 

<210> 152 
<211> 688 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 159. .452 

<220> 

<221> polyA_site 
<222> 678. .688 

<220> 

<221> polyA_signal 
<222> 670. .675 

<400> 152 

atacgtacac caatgcggac aagctcctag aagcagcaga gcagttggct cagacggggg 60 
aatgtgaccc cgaggagatc tacaaggcag ctcgacacct ggaggtgcgc atccaagact 120 
tcgtgcgcag ggtggagcag cggaagcttc tcctggac atg tct gtt tec ttc cac 17 6 

Met Ser Val Ser Phe His 

1 5 





cac 


acc 




gag 


ttg 


tgg 


aca 


tgg 


atg 


gaa 


gae 


ctt 


cag 


aag 


gag 


224 


Thr 


His 


Thr 


Lys 
10 


Glu 


Leu 


Trp 


Thr 


Trp 
15 


Met 


Glu 


Asp 


Leu 


Gin 
20 


Lys 


Glu 




atg 


ttg 


gag 


gat 


gtc 


tgt 


gca 


gat 


tet 


gtg 


gat 


gea 


gte 


cag 


gaa 


etg 


272 


Met 


Leu 


Glu 
25 


Asp 


Val 


Cys 


Ala 


Asp 

30 


Ser 


Val 


Asp 


Ala 


Val 

35 


Gin 


Glu 


Leu 




ate 


aag 


cag 


ttc 


cag 


cag 


cag 


cag 


acc 


gcc 


act 


eta 


gat 


gcc 


aca 


etc 


320 


He 


Lys 
40 


Gin 


Phe 


Gin 


Gin 


Gin 
45 


Gin 


Thr 


Ala 


Thr 


Leu 
50 


Asp 


Ala 


Thr 


Leu 




aat 


gtc 


ate 


aag 


gaa 


ggc 


gaa 


gae 


ctt 


ate 


cag 


cag 


etc 


agg 


tea 


geg 


368 


Asn 


Val 


He 


Lys 


Glu 


Gly 


Glu 


Asp 


Leu 


He 


Gin 


Gin 


Leu 


Arg 


Ser 


Ala 




55 










60 










65 










70 




cct 


COG 


tec 


etc 


ggg 


gag 


ccc 


age 


gag 


gee 


agg 


tea 


gea 


tgg 


gea 


gag 


416 


Pro 


Pro 


Ser 


Leu 


Gly 
75 


Glu 


Pro 


Ser 


Glu 


Ala 
80 


Arg 


Ser 


Ala 


Trp 


Ala 
85 


Glu 




ctt 


tec 


agt 


ggg 




tgc 


etc 


ggg 


eta 


gae 


gtg 




taagtectgt 




462 


Leu 


Ser 


Ser 


Gly 


Lys 


Cys 


Leu 


Gly 


Leu 


Asp 


Val 


Arg 













90 95 



cceagtcttg actcagcaaa atcttgcttt gtggtcttgt gtgagcttcg gtttctttat 522 

etgtaagatg taaaaatgee tgcaccatac tcaaettgaa tetagaataa aatagtaaat 582 

ggaaaagett taatgtttea aagaattatt tateatteag gatgttatte ageeateagt 642 

aagtaacata aataeaaeaa aaattaaeat aaaaeaaaaa aaaaaa 688 

<210> 153 
<211> 420 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 63. .329 



<220> 
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<221> polyA_site 
<222> 406. .420 

<400> 153 

atgctcgcca ggataccccc tcttcccatg gcgacctcag accgcgggcg tcagcgttgg 60 

eg atg gtc tec gga gac ggc ttc etc gtt tec agg cct gaa gcg att 107 

Met Val Ser Gly Asp Gly Phe Leu Val Ser Arg Pro Glu Ala lie 



] 


L 






5 








10 






15 




cat 


eta 


gga 


ect 


egg 


eag 


gcg 


gtg 


cga 


eca 


age 


gtt 


egg 


gee gag age 


155 


His 


Leu 


Gly 


Pro 


Arg 
20 


Gin 


Ala 


Val 


Arg 


Pro 
25 


Ser 


Val 


Arg 


Ala Glu Ser 
30 




cgt 


cga 


gtg 


gat 


ggt 


gge 


gge 


egg 


age 


eca 


agg 


gaa 


eca 


gat gge egg 


203 


Arg 


Arg 


Val 


Asp 


Gly 


Gly 


Gly 


Arg 


Ser 


Pro 


Arg 


Glu 


Pro Asp Gly Arg 










35 










40 










45 




ggc 


egg 


age 


ege 


eaa 


gcg 


aga 


tte 


tea 


ect 


tae 


eca 


ate 


ect gee gtt 


251 


Gly 


Arg 


Ser 
50 


Arg 


Gin 


Ala 


Arg 


Phe 
55 


Ser 


Pro 


Tyr 


Pro 


He 
60 


Pro Ala Val 




gaa 


ccc 


gat 


etc 


eta 


aga 


agt 


gtg 


ctg 




cag 


cgt 


ttg 


att gca tta 


299 


Glu 


Pro 

65 


Asp 


Leu 


Leu 


Arg 


Ser 

70 


Val 


Leu 


Gin 


Gin 


Arg 

75 


Leu 


He Ala Leu 




gga 


ggt 


gtt 


ate 


gca 


get 


cga 


att 


tea 


gtt 


taaacgaaca cettteetct 


349 


Gly 


Gly 


Val 


He 


Ala 


Ala 


Arg 


He 


Ser 


Val 












80 










85 





















ggccctcact tagcttgtga acaggeettt ttaaaatect ttettggtgt agcagcaaaa 409 
aaaaaaaaaa a 420 

<210> 154 
<211> 640 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 181. .549 

<220> 

<221> polyA_site 
<222> 626. .640 

<400> 154 

atgctcgcca gaatacccet cgttaaagge aaggeggett etggetcttc cgcaggctea 60 

gttatgggtt eetgtgtget tagteatcgg gtccgacaca ggctggactg atctggggag 120 

ecgegaaggg cctgccttca caaagggacg taacgcaagt attgcgggca gtgtttgaat 180 



atg 


gcc 


ctg 


aac 


aat 


gtg 


tec 


ctg 


tec 


tec 


ggt 


gat 


cag 


agg 


age 


agg 


228 


Met 


Ala 


Leu 


Asn 


Asn 


Val 


Ser 


Leu 


Ser 


Ser 


Gly Asp 


Gin 


Arg 


Ser 


Arg 




1 








5 










10 










15 






gtg 


gcc 


tae 


ege 


tet 


tec 


eat 


ggc 


gac 


etc 


aga 


ccg 


egg 


gcg 


tea 


gcg 


276 


Val 


Ala 


Tyr 


Arg 

20 


Ser 


Ser 


His 


Gly 


Asp 

25 


Leu 


Arg 


Pro 


Arg 


Ala 

30 


Ser 


Ala 




ttg 


gcg 


atg 


gtc 


tec 


gga 


gac 


ggc 


ttc 


etc 


gtt 


tec 


agg 


cct 


gaa 


gcg 


324 


Leu 


Ala 


Met 
35 


Val 


Ser 


Gly 


Asp 


Gly 
40 


Phe 


Leu 


Val 


Ser 


Arg 
45 


Pro 


Glu 


Ala 




att 


cat 


eta 


gga 


cct 


egg 


cag 


gcg 


gtg 


cga 


eca 


age 


gtt 


egg 


gcc 


gag 


372 


He 


His 
50 


Leu 


Gly 


Pro 


Arg 


Gin 
55 


Ala 


Val 


Arg 


Pro 


Ser 
60 


Val 


Arg 


Ala 


Glu 




age 


cgt 


cga 


gtg 


gat 


ggt 


ggc 


ggc 


egg 


age 


eca 


agg 


gaa 


eca 


gat 


gge 


420 


Ser 


Arg 


Arg 


Val 


Asp 


Gly 


Gly 


Gly 


Arg 


Ser 


Pro 


Arg 


Glu 


Pro 


Asp 


Gly 




65 










70 










75 










80 




egg 


ggc 


egg 


age 


ege 




gcc 


aga 


ttc 


tea 


cct 


tac 


eca 


ate 


cct 


gcc 


4 68 


Arg 


Gly 


Arg 


Ser 


Arg 
85 


Gin 


Ala 


Arg 


Phe 


Ser 
90 


Pro 


Tyr 


Pro 


He 


Pro 
95 


Ala 




gtt 


gaa 


ccc 


gat 


etc 


eta 


aga 


agt 


gtg 


ctg 


caa 


cag 


cgt 


ttg 


att 


gca 


516 


Val 


Glu 


Pro 




Leu 






Ser 


Val 




Gin 


Gin 




Leu 


He 


Ala 
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100 105 110 

tta gga ggt gtt ate gca get cga att tea gtt taaacgaaca cctttcctct 569 
Leu Gly Gly Val He Ala Ala Arg He Ser Val 

115 120 
ggccctcact tagcttgtga acaggccttt ttaaaatcct ttcttggtgt agcagcaaaa 629 
aaaaaaaaaa a 640 



<210> 155 
<211> 1582 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 8. .523 



<220> 

<221> polyA_site 
<222> 1567. .1582 



<400> 155 



agtcaag 


atg 


gcg 


gga 


gca 


get 


acc 


cag 


get 


tec 


Ctg 


gag 


teg 


gcc 


cea 


49 






Met 


Ala 


Gly Ala Ala 


Thr 


Gin 


Ala 


Ser 


Leu 


Glu 


Ser 


Ala 


Pro 








1 








5 










10 












egg 


ate 


atg 


egg 


ctg 


gtg 


gee 


gaa 


tgc 


age 


cgc 


tec 


agg 


gcc 


egg 


gca 


97 


Arg 


He 


Met 


Arg 


Leu 


Val 


Ala 


Glu 


Cys 


Ser 


Arg 


Ser Arg 


Ala 


Arg 


Ala 




15 










20 










25 










30 




gge 


gag 


Gtg 


tgg 


ctg 


ccg 


cat 


ggg 


aea 


gtg 


gcc 


act 




gtg 




atg 


145 


Gly 


Glu 


Leu 


Trp 


Leu 

35 


Pro 


His 


Gly 


Thr 


Val 
40 


Ala 


Thr 


Pro 


Val 


Phe 
45 


Met 




cea 


gtg 


gge 


aeg 




gee 


ace 




aag 


gge 


ate 


aeg 


acc 


gaa 


cag 


ctg 


193 


Pro 


Val 


Gly 


Thr 
50 


Gin 


Ala 


Thr 


Met 


Lys 
55 


Gly 


He 


Thr 


Thr 


Glu 
60 


Gin 


Leu 




gac 


get 


ctg 


ggt 


tgc 


cgc 


ate 


tgc 


ctg 


gge 


aat 


acc 


tac 


cat 


ctg 


ggt 


241 


Asp 


Ala 


Leu 


Gly 


Cys 


Arg 


He 


Cys 


Leu Gly Asn Thr 


Tyr 


His 


Leu 


Gly 








65 










70 










75 










eta 


agg 


ccg 


gga 




gag 


ctg 


ate 


cag 


aaa 


gcc 




ggt 


etc 


cac 


gge 


289 


Leu 


Arg 


Pro 


Gly 


Pro 


Glu 


Leu 


He 


Gin 


Lys 


Ala 


Asn Gly Leu 


His 


Gly 






80 










85 










90 












ttc 


atg 


aat 


tgg 


cct 


cat 


aat 


ctg 


eta 


aeg 


gac 


age 


gge ggt 


ttc 


cag 


337 


Phe 


Met 


Asn 


Trp 


Pro 


His 


Asn 


Leu 


Leu Thr Asp 


Ser 


Gly Gly 


Phe 


Gin 




95 










100 










105 










110 




atg 


gtg 


teg 


ctg 


gtg 


tct 


ctg 


tec 


gag 


gtg 


aeg 


gag 


gag 


gge 


gtc 


cgc 


385 


Met 


Val 


Ser 


Leu 


Val 


Ser 


Leu 


Ser 


Glu 


Val 


Thr 


Glu 


Glu Gly 


Val 


Arg 












115 










120 










125 






ttc 


cgc 


tec 


ccc 


tac 


gac 


gge 


aat 


gag 


acc 


ctg 


ctg 


age 


ccg 


gag 




433 


Phe 


Arg 


Ser 


Pro 


Tyr Asp Gly Asn 


Glu 


Thr 


Leu 


Leu 


Ser 


Pro 


Glu 


Lys 










130 










135 










140 








tec 


gtg 


cag 


ate 


cag 




gcg 


ctg 


gge 


teg 


gac 


ate 


ate 


atg 




ctg . 


481 


Ser 


Val 


Gin 


He 


Gin 


Asn 


Ala 


Leu 


Gly 


Ser Asp 


He 


He 


Met 


Gin 


Leu 








145 










150 










155 










gae 




gtg 


gtt 


age 


agt 


act 


gtg 


act 


ggg 






gtg 


gag 






523 


Asp 


Asp 
160 


Val 


Val 


Ser 


Ser 


Thr 
165 


Val 


Thr 


Gly 


Pro 


Arg 
170 


Val 


Glu 









taggccatgt aeaggteaat eegetggetg gaeeggtgea ttgeageeca teageggceg 583 

gacaagcaga acetettcgc cattatceag ggtgggctgg acgcagatct ccgggccacc 643 

tgccttgaag agatgaccaa gcgagacgtg cctggcttcg ccatcggggg cctgagcggg 703 

ggtgagagca agtcgcagtt etggcggatg gtggcgetga geacctcteg getgecgaag 763 

gacaagceec gatatctgat gggggttggg tatgttgtgg atagggaagc eagageceta 823 

cctgtgggaa gtggattcct ggggaccccc taccctgctt ggggaggtgg catttggggg 883 

aaacggaeac aggtctgate tgaggagaet aggaagaeat ggetgtcect tgggggceat 943 

tetgagggaa tatggcceag tetggggeag tgtgagtgtt gggaggggee etgggaagec 1003 

cctgaggtte tctgccecct cccgtcatgg ctgcaaccec agctatgeca etgatctggt 1063 
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agtctgcgtg gctcttggat gtgacatgtt cgactgcgtc ttccccacac ggacagcgcg 1123 

ctttggctct gccctggtgc ccactgggaa cctgcagttg aggaagaagg tgtttgagaa 1183 

ggacttcggc cccatagacc cggagtgcac ctgccccacg tgccaaaagc acagccgcgc 1243 

cttcctgcac gcactgctgc acagtgacaa cacggccgcg ctgcaccacc tcacggtcca 1303 

caacatcgcc taccagctgc agctcatgag cgccgtccgc accagcatcg tggagaagcg 1363 

cttcccggac ttcgtgcggg acttcatggg cgccatgtac ggggatccca ccctctgtcc 1423 

cacctgggcc actgacgctc tggcctctgt gggaatcaca ctgggctgac ctggcattgg 1483 

gagagggagg gaggaaggaa gggagggagg ggctggaaga tactgaagga ttcctttttg 1543 

aaaggttttt tttattgtaa cttaaaaaaa aaaaaaaaa 1582 



<210> 156 
<211> 104 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 45. . 104 

<400> 156 

agagctgagc tgaagcggga cccggagccc gagcagccgc cgcc atg gca ate aaa 56 

Met Ala He Lys 
1 

ttt ctg gaa gtc ate aag ccc ttc tgt gtc ate ctg caa eta aca tct 104 
Phe Leu Glu Val He Lys Pro Phe Cys Val He Leu Gin Leu Thr Ser 
5 10 15 20 

<210> 157 
<211> 150 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 2. .70 

<220> 

<221> polyA_site 
<222> 135. .150 

<220> 

<2 21> polyA_signal 
<222> 115. .120 

<400> 157 

t ttt etc aac aat tec tea ccg cag gag cct ctg aag etc cca cea gge 4 9 

Phe Leu Asn Asn Ser Ser Pro Gin Glu Pro Leu Lys Leu Pro Pro Gly 
15 10 15 

cag cte tec tec ace cae egc taacteteag eceeagteae eetettggag 100 

Gin Leu Ser Ser Thr His Arg 
20 

cttccctgct ttgaattaaa gaeeaetcat gcgeaaaaaa aaaaaaaaaa 150 

<210> 158 
<211> 510 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 150. .284 



<220> 
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<221> polyA_site 
<222> 495. .510 

<220> 

<221> polyA_signal 
<222> 475. .480 

<400> 158 

aaccaaagca tgggctcttt gtttcctctt tgccaggcct gggtccttcc cagtgacccc 60 
tcagccttca cttcctcctc actcccgggg agctgtgaca ggaagaggag atcccagaac 120 
tggagattga cgtggatgag ctcctggac atg gag agt gac gat gcc egg get 173 

Met Glu Ser Asp Asp Ala Arg Ala 

1 5 

gcc agg gtc aag gag ctg ctg gtt gac tgt tac aaa ccc aca gag gcc 221 
Ala Arg Val Lys Glu Leu Leu Val Asp Cys Tyr Lys Pro Thr Glu Ala 

10 15 20 

ttc att tyt ggc ctg ctg gac aag ate egg ggc atg eag aag ctg age 2 69 

Phe lie Xaa Gly Leu Leu Asp Lys lie Arg Gly Met Gin Lys Leu Ser 
25 30 35 40 

aca ccc cag aag aag tgagggtccc cgacccagga gaacggtggc tcccacagga 324 
Thr Pro Gin Lys Lys 
45 

caatcgctgc cccccaacct cgtagcaaca gcaataccgg gggaccctgc ggccaggcct 384 
ggtgeeatga gcagggctcc tcgtgcccct ggcccagggg tctcttcccc tgccccctca 4 44 
gttttccact tttggggttt ttttattgtt attaaactga tgggaettet aaaaaaaaaa 504 
aaaaaa 510 

<210> 159 
<211> 390 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 10. . 336 

<220> 

<221> polyA_site 
<222> 373. .390 

<220> 

<221> polyA_signal 
<222> 354 . .359 

<400> 159 

agtctgaag atg geg gee tea gea geg ega ggt get geg geg ctg egt aga 51 
Met Ala Ala Ser Ala Ala Arg Gly Ala Ala Ala Leu Arg Arg 







1 








5 










10 










agt 


ate 


aat 


cag 


ccg 


gtt 


get 


ttt 


gtg 


aga 




att 


cct 


tgg 


act 


geg 


99 


Ser 


He 




Gin 


Pro 


Val 


Ala 


Phe 


Val 


Arg 


Arg 


He 


Pro 


Trp 


Thr 


Ala 




15 










20 










25 










30 




geg 


teg 


agt 


cag 


ctg 




gaa 




ttt 


gca 


cag 


ttc 


ggc 


cat 


gtc 




147 


Ala 


Ser 


Ser 


Gin 


Leu 
35 


Lys 


Glu 


His 


Phe 


Ala 
40 


Gin 


Phe 


Gly 


His 


Val 
45 


Arg 




agg 


tgc 


att 


tta 


cct 


ttt 


gac 


aag 


gag 


act 


ggc 


ttt 


eac 


aga 


ggt 


ttg 


195 


Arg 


Cys 


He 


Leu 
50 


Pro 


Phe 


Asp 


Lys 


Glu 
55 


Thr 


Gly 


Phe 


His 


Arg 
60 


Gly 


Leu 




ggt 


tgg 


gtt 


eag 


ttt 


tet 


tea 


gaa 


gaa 


gga 


ett 


egg 


aat 


gea 


eta 


eaa 


243 


Gly 


Trp 


Val 
65 


Gin 


Phe 


Ser 


Ser 


Glu 

70 


Glu 


Gly 


Leu 


Arg 


75 


Ala 


Leu 


Gin 




cag 


gaa 




cat 


att 


ata 


gat 


gga 


gta 


aag 


gtc 


cag 


gtt 




act 




291 


Gin 


Glu 
80 


Asn 


His 


He 


He 


Asp 
85 


Gly 


Val 


Lys 


Val 


Gin 
90 


Val 


His 


Thr 


Arg 
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agg cca aaa ctt ccg caa aca tct gat gat gaa aag aaa gat ttt 
Arg Pro Lys Leu Pro Gin Thr Ser Asp Asp Glu Lys Lys Asp Phe 
95 100 105 

tgagactgca gcctattaat aaagttaaca taactgacaa aaaaaaaaaa aaaa 

<210> 160 

<211> 398 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 23. .175 

<220> 

<221> polyA_site 
<222> 383. .398 

<220> 

<221> polyA_signal 
<222> 365. .370 

<400> 160 

gagaaggtgc tttagtctga ag atg gcg gcc tea gca gcg cga ggt get geg 52 

Met Ala Ala Ser Ala Ala Arg Gly Ala Ala 
15 10 
geg ctg eat aag tat caa tea gee ggt tgc ttt tgt gag aag aat tec 100 
Ala Leu His Lys Tyr Gin Ser Ala Gly Cys Phe Cys Glu Lys Asn Ser 

15 20 25 

ttg gac tgc ggc gte gag tea get gaa aga aca ctt tgc aca gtt egg 148 
Leu Asp Cys Gly Val Glu Ser Ala Glu Arg Thr Leu Cys Thr Val Arg 

30 35 40 

cca tgt cag aag gtg cat ttt acc ttt tgacaaggag actggctttc 195 
Pro Cys Gin Lys Val His Phe Thr Phe 

45 50 
acagaggttt gggttgggtt cagttttctt cagaagaagg acttcggaat gcactacaac 255 
aggaaaatca tattatagat ggagtaaagg tccaggttca cactagaagg ccaaaacttc 315 
cgcaaacatc tgatgatgaa aagaaagatt tttgagactg cagcctatta ataaagttaa 375 
cataaeeaaa aaaaaaaaaa aaa 398 

<210> 161 
<211> 425 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 127. .423 

<220> 

<221> polyAsite 
<222> 394 . . 425 

<400> 161 

c cca gtc aga ggg ctg aac atg tec gag ttt gte tgt aac ctg tea gca 4 9 

Pro Val Arg Gly Leu Asn Met Ser Glu Phe Val Cys Asn Leu Ser Ala 
15 10 15 

agg cct tat gtg tac gac etc att gee gtg tec aat cat tat gga gcc 97 
Arg Pro Tyr Val Tyr Asp Leu lie Ala Val Ser Asn His Tyr Gly Ala 

20 25 30 

atg ggg gtt ggc cac tac taaagcagct tatgtgctat tttaccaacg 145 
Met Gly Val Gly His Tyr 
35 



PCT/IBOl/00914 

336 

390 
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tcgagatgat gaattttata agacaccttc acttagcagt tctggttcct ctgatggagg 205 

gacacgacca agcagctctc agcagggctt tggggatgat gaggcttgca gcatggacac 265 

caactaatgc tgactccacg atcctgccac cctgtagcgc cagtgtaatc ccccaggaga 325 

acatctttga cactctgcag actgctagtg ttctgtctaa aaaccagaca aggaaatacc 385 

cttcttttat gagcagaagg aawcaaaaaa aaaaaaaaaa 425 



<210> 162 
<211> 438 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 3. .41 



<220> 

<221> polyA_site 
<222> 419. .438 



<400> 162 

ag atg tta ggc ttt gcc aaa aac tgg ate ttt agt ate act tgagcactca 51 

Met Leu Gly Phe Ala Lys Asn Trp He Phe Ser He Thr 
15 10 



ttggacacaa 


gaggtcaact 


gttttgtcat 


ttgtgtagta 


agaacaaaat 


aaagtgtgtg 


111 


gcattaaaga 


aataacactg 


tctggccggg 


atcggtggct 


cacgcctgta 


atcccagcac 


171 


tttgggaggc 


cgaggcgggc 


agatcacgag 


gtcaggagtt 


cgagaccagc 




231 


tgatgaaacc 


ctgtctctac 




aaaattagcc 


aggcatggtg 


gtgcacacct 


291 


gtagtcccag 


ctgctcagga 


ggctgaggca 


ggagaatcgc 


tcgaacccgg 


gaggtggagg 


351 


ttgcagtgag 


ctgagatggc 


gccactgcat 


tccaggcctg 


gggggtgaca 


gagcaagact 


411 


cagtctcaga 


aaaaaaaaaa 


aaaaaag 








438 



<210> 163 
<211> 435 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 3. .41 



<220> 

<221> polyA_site 
<222> 419. .436 



<400> 163 

ag atg tta ggc ttt gcc aaa aac tgg ate ttt agt ate act tgagcactca 51 
Met Leu Gly Phe Ala Lys Asn Trp He Phe Ser He Thr 
15 10 

ttggacacaa gaggtcaact gttttgtcat ttgtgtagta agaacaaaat aaagtgtgtg 111 

gcattaaaga aataacactg tctggccggg atcggtggct cacgcctgta atcccagcac 171 

tttgggaggc cgaggcgggc agatcacgag gtcaggagtt cgagaccagc ctgaccaaca 231 

tgatgaaacc ctgtctctac taaaaataca aaaattagcc aggcatggtg gtgcacacct 2 91 

gtagtcccag ctgctcagga ggctgaggca ggagaatcgc tcgaacccgg gaggtggagg 351 

ttgcagtgag ctgagatggc gccactgcat tccaggcctg gggggtgaca gagcaagact 411 

cagtctcaga aaaaaaaaaa aaaaa 436 



<210> 166 
<211> 436 
<212> DNA 
<213> Homo sapiei 



<220> 
<221> CDS 
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<222> 3. .41 
<220> 

<221> polyA_site 
<222> 421. .436 

<400> 166 

ag atg tta ggc ttt gcc aaa aac tgg ate ttt agt ate aet tgagcaetca 51 

Met Leu Gly Phe Ala Lys Asn Trp lie Phe Ser lie Thr 

15 10 

ttggacacaa gaggtcaact gttttgtcat ttgtgtagta agaacaaaat aaagtgtgtg 111 

gcattaaaga aataacactg tctggccggg atcggtggct cacgcctgta atcccagcac 171 

tttgggaggc cgaggcgggc agatcacgag gtcaggagtt cgagaccagc ctgaccaaca 231 

tgatgaaacc ctgtctctac taaaaataca aaaattagcc aggcatggtg gtgcacacct 291 

gtagtcccag ctgctcagga ggctgaggca ggagaatcgc tcgaacccgg gaggtggagg 351 

ttgcagtgag ctgagatggc gccactgcat tccaggcctg gggggtgaca gagcaagact 411 

cagtctcaga aaaaaaaaaa aaaaa 436 

<210> 167 
<211> 436 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 3. .41 

<220> 

<221> polyA_site 
<222> 419. .436 

<400> 167 

ag atg tta ggc ttt gcc aaa aac tgg ate ttt agt ate aet tgagcaetca 51 

Met Leu Gly Phe Ala Lys Asn Trp lie Phe Ser lie Thr 

1 5 10 

ttggacacaa gaggtcaact gttttgtcat ttgtgtagta agaacaaaat taagtgtgtg 111 

gcattaaaga aataacactg tctggccggg atcggtggct cacgcctgta atcccagcac 171 

tttgggaggc cgaggcgggc agatcacgag gtcaggagtt cgagaccagc ctgaccaaca 231 

tgatgaaacc ctgtctctac taaaaataca aaaattagcc aggcatggtg gtgcacacct 291 

gtagtcccag ctgctcagga ggctgaggca ggagaatcgc tcgaacccgg gaggtggagg 351 

ttgcagtgag ctgagatggc gccactgcat tccaggcctg gggggtgaca gagcaagact 411 

cagtctcaga aaaaaaaaaa aaaaa 436 

<210> 168 
<211> 394 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 10. .336 

<220> 

<221> polyA_site 
<222> 373. .394 

<220> 

<221> polyA_signal 
<222> 354 . .359 

<400> 168 

agtctgaag atg gcg gcc tea gca gcg ega ggt get gcg gcg etg cgt aga 51 
Met Ala Ala Ser Ala Ala Arg Gly Ala Ala Ala Leu Arg Arg 



[fije>'/^:/yv'GG208389e.s^] 



WO 02/083898 PCT/IBOl/00914 



125 

15 10 



agt 


ate 


aat 


cag 


ccg 


gtt 


get 


ttt 


gtg 


aga 


aga 


atw 


cct 


tgg 


act 


gcg 


99 


Ser 


lie 


Asn 


Gin 


Pro 


Val 


Ala 


Phe 


Val 


Arg 


Arg 


He 


Pro 


Trp 


Thr 




































30^ 




gcg 


teg 


agt 


cag 


ctg 


aaa 


gaa 


cac 


ttt 


gca 


cag 


ttc 


ggc 


cat 


gtc 


aga 


147 


Ala 


Ser 


Ser 


Gin 


Leu 


Lys 


Glu 


His 


Phe 


Ala 


Gin 


Phe 


Gly 


His 




Arg 




agg 


tgc 


att 


tta 


cct 


ttt 


gac 


aag 


gag 


act 


ggc 


ttt 


cac 


aga 


ggt 


ttg 


195 


Arg 


Cys 


lie 


Leu 
50 


Pro 


Phe 


Asp 


Lys 


Glu 

^aa 


Thr 


Gly 


Phe 


His 


50^ 


Gly 


Leu 




ggt 


tgg 


gtt 




ttt 


tct 


t 








ctt 




aat 


gca 


eta 




243 


Gly 


Trp 


Val 
65 


Gin 


Phe 


Ser 


Ser 


Glu 
70 


Glu 


Gly 


Leu 


Arg 


Asn 
75 


Ala 




Gin 




cag 


gaa 


aat 


cat 


att 


ata 


gat 


gga 


gta 


aag 






gtt 




act 


aga 


291 


Gin 


Glu 
80 


Asn 


His 


He 


He 


Asp 
85 


Gly 


Val 


Lys 


Val 


Gin 
90 


Val 


His 


Thr 


Arg 




agg 


cca 


aaa 


ctt 


ccg 


caa 


aca 


tct 


gat 


gat 


gaa 


aag 


aaa 


gat 


ttt 




336 


Arg 


Pro 


Lys 


Leu 


Pro 


Gin 


Thr 


Ser 


Asp 


Asp 


Glu 


Lys 


Lys Asp 


Phe 







95 100 105 

tgagactgca gcctattaat aaagttaaca taactgaaaa aaaaaaaaaa aaggtcaa 394 



<210> 169 
<211> 772 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 595. .771 



<400> 169 

atataggcgt ggctgaatac gggcctcctg cgtacgtgcg tggtgtacgt acgtgcgtga 60 
ttacgcgcac acgtacgttc ctcatgaaag ggacgacggg agctgcatga aagccgaagt 120 
tatggaccgc tagcatctgt cactggccac cggtttccgg gagtaagcgg cagctacctt 180 
acagccctga cacgagccgg gtgctctctc ttctcaccgc ggcccacgtc tcctcgctgg 240 
ctccggtggc ctcgctgggt cgcgaggagg cggaggactg tactctgagg ccaaaagcca 300 
gagtcggccc tgaacgccca cgactctcag ggtccagagg ccgtgagacc ggccgcggct 360 
gaaaggtaaa gaaaccaagt ggaagagtgt ttcctcctct ggccgtaaag caggtactct 420 
ctgcagcacc agctgtcccc gccctactcc ggaccgcccc aaagactcca tgggatggac 480 
ctgagtcagc cgaatcctag ccccttccct tgggcctgct gtggtgctcg acatcagtga 540 
cagacggaag cagcagacca tcaaggctac gggaggcccg gggcgcttgc gaag atg 597 

Met 
1 

aag ttt ggc tgc etc tec tte egg eag ect tat get ggc ttt gtc tta 645 
Lys Phe Gly Cys Leu Ser Phe Arg Gin Pro Tyr Ala Gly Phe Val Leu 

5 10 15 

aat gga ate aag act gtg gag aeg cgc tgg cgt cct ctg ytg age age 693 
Asn Gly He Lys Thr Val Glu Thr Arg Trp Arg Pro Leu Leu Ser Ser 

20 25 30 

cag egg aac tgt ace ate gee gtc cac att get cac agg gac tgg gaa 741 
Gin Arg Asn Cys Thr He Ala Val His He Ala His Arg Asp Trp Glu 

35 40 45 

ggc gat gee tgt egg gag ctg ctg gtg gag a 772 
Gly Asp Ala Cys Arg Glu Leu Leu Val Glu 



<210> 170 
<211> 128 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> SIGNAL 
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<222> -34. .-1 
<220> 

<221> UNSURE 
<222> 7 

<223> Xaa = Ile,Thr 



<400> 170 



Met Ser 


Leu 


Arg 


Leu Asp 


Thr 


Thr 


Pro 




Cys Asn 


Ser 


Ala 


Arg 


Pro 








-30 








-25 








-20 




Leu His 


Ala 


Leu 


Gin Val 




Leu 


Leu 


Leu 


Ser Leu 


Leu 


Leu 


Thr 


Ala 






-15 








-10 








-5 






Leu Ala 


Ser 


Ser 


Thr Lys 


Gly Gin Xaa 


Lys 


Arg Asn 


Leu 


Ala 


Lys 


Gly 




1 






5 








10 










Lys Asp 


Glu 


Ser 


Leu Asp 


Ser 


Asp 


Leu 


Tyr 


Ala Glu 


Leu Arg 


Cys 


Met 


15 






20 










25 








30 


Cys lie 


Lys 


Thr 


Thr Ser 


Gly 


He 


His 


Pro 


Lys Asn 


He 


Gin 


Ser 


Leu 








35 








40 








45 




Glu Val 


He 


Gly 


Lys Gly 


Thr 


His 


Cys 


Asn 


Gin Val 


Glu 


Val 


He 


Ala 






50 








55 








60 






Thr Leu 


Lys Asp Gly Arg 


Lys 


He 


Cys 


Leu 


Asp Pro Asp Ala 


Pro Arg 




65 








70 








75 








lie Lys 


Lys 


He 


Val Gin 


Lys 


Lys 


Leu 


Ala 


Gly Asp Glu 


Ser 


Ala 


Asp 


80 








85 








90 










<210> 171 
























<211> 160 
























<212> PRT 
























<213> Homo sapiens 




















<220> 


























<221> SIGNAL 






















<222> -] 


LB . .- 


-1 






















<400> 171 
























Met Lys 


Ala 


Ala 


Val Leu 




Le-j 


Ala 


Val 


Leu Phe 


Leu 


Thr 


Gly 








-15 








-10 








-5 






Gin Ala 


Arg 


His 


Phe Trp 


Gin 


Gin 


Asp 


Glu 


Pro Pro 


Gin 


Ser 


Pro 


Trp 




1 






5 








10 










Asp Arg 


Val 


Lys 




Ala 


Thr 


Val 


Tyr 


Val Asp 


Val 


Leu 


Lys 


Asp 


15 






^ 20 










25 








30 


Ser Gly 


Arg 


Asp 


Tyr Val 


Ser 


Gin 


Phe 


Glu 


Gly Ser 


Ala 


Leu 


Gly Lys 








35 








40 








45 




Gin Leu 


Asn 


Leu 


Lys Leu 


Leu 


Asp 


Asn 


Trp 


Asp Ser 


Val 


Thr 


Ser 


Ala 






50 








55 








60 






Phe Ser 


Asn 


Leu 


Arg Glu 


Gin 


Leu Gly 


Pro 


Val Thr 


Gin 


Glu 


Phe 


Trp 




65 








70 








75 








Asp Asn 


Leu 


Glu 


Lys Glu 


Thr 


Glu Gly Leu 


Arg Gin 


Glu 


Met 


Ser 


Lys 


80 








85 








90 










Asp Leu 


Glu 


Glu 


Val Lys 


Ala 


Lys 


Val 


Gin 


Pro Tyr 


Leu 


Asp 


Asp 


Phe 


95 






100 










105 








110 


Gin Lys 


Lys 


Trp 


Gin Glu 


Glu 


Met 


Glu 


Leu 


Tyr Arg 


Gin 


Lys 


Ala 


Ala 








115 








120 








125 




Phe Leu 


Thr 


He 


Leu Thr 


Ser 


Leu Gly Pro 


Asn Gly Asn Lys 


Ala 


Phe 






130 








135 








140 







<210> 172 
<211> 80 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SIGNAL 
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<222> -25. . -1 
<220> 

<221> UNSURE 
<222> 52 

<223> Xaa = Leu, Pro 

<400> 172 

Met Gly Arg Ala Met Val Ala Arg Leu 
-25 -20 

Ala Leu Leu Leu Pro Thr Gin lie Tyr 

-10 -5 

Thr Ser Ser Asn Ser Ser Gin Ser Thr 
10 15 

Asn Pro Thr Asn Ala Thr Thr Lys Val 
25 30 

Thr Ala Ser Leu Phe Val Val Ser Leu 
40 45 



PCT/lBOl/00914 

127 



Gly Leu Gly Leu Leu Leu Leu 
-15 

Ser Ser Glu Thr Thr Thr Gly 

1 5 
Ser Asn Ser Gly Leu Ala Pro 
20 

Ala Gly Gly Ala Leu Gin Ser 
35 

Ser Leu Leu His Xaa Tyr Ser 
50 



<210> 173 
<211> 101 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> SIGNAL 

<222> -27. .-1 

<223> Von Heijne matrix 

score 11.8486806626293 
seq LFLGLLLLPLWA/FA 



Met 


Ser 


Ser 
-25 


Ala 


Ala 


Gly 


Phe 


Cys 
-20 


Ala 


Ser 


Arg 


Pro 


Gly 
-15 


Leu Leu 


Phe 


Leu 


Gly 
-10 


Leu 


Leu 


Leu 


Leu 


Pro 
-5 


Leu 


Val 


Val 


Ala 


Phe 
1 


Ala 


Ser Ala 


Glu 
5 


Ala 


Glu 


Glu 


Asp 


Gly 


Asp 


Leu 


Gin 


Cys 


Leu 


Cys Val 


Lys 


Thr Thr 


Ser 










10 










15 








20 




Gin 


Val 


Arg 


Pro 
25 


Arg 


His 


He 


Thr 


Ser 
30 


Leu 


Glu 


Val 


He 


Lys Ala 
35 


Gly 


Pro 


His 


Cys 


Pro 


Thr 


Ala 


Gin 


Leu 


He 


Ala 


Thr 


Leu 


Lys 


Asn Gly Arg 






40 










45 










50 






Lys 


He 
55 


Cys 


Leu 


Asp 


Leu 


Gin 
60 


Ala 


Pro 


Leu 


Tyr 


Lys 
65 


Lys 


He He 


Lys 


Lys 


Leu 


Leu 


Glu 


Ser 






















70 































<210> 174 
<211> 101 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> SIGNAL 

<222> -27. .-1 

<223> Von Heijne matrix 

score 11.8486806626293 
seq LFLGLLLLPLVVA/FA 



<400> 174 
Met Ser Ser Ala 
-25 

Leu Gly Leu Leu 



Ala Gly Phe Cys 
-20 

Leu Leu Pro Leu 



Ala Ser Arg Pro 
Val Val Ala Phe 



Gly Leu Leu Phe 
-15 

Ala Ser Ala Glu 
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-10 




-5 














Ala Glu Glu Asp Gly 


Asp 


Leu 


Gin 


Cys 


Leu Cys 


Val 


Lys Thr 


Thr Ser 


10 










15 






20 


Gin Val Arg Pro Arg 


His 


He 


Thr 




Leu Glu 


Val 




Ala Gly 


25 








30 






35 




Pro His Cys Pro Thr 


Ala 


Gin 


Leu 


He 


Ala Thr 


Leu 


Lys Asn 


Gly Arg 


40 






45 








50 




Lys lie Cys Leu Asp 


Leu 


Gin 


Ala 


Pro 


Leu Tyr Lys 


Lys He 


He Lys 


55 




60 








65 






Lys Leu Leu Glu Ser 


















70 


















<210> 175 


















<211> 86 


















<212> PRT 


















<213> Homo sapiens 


















<220> 


















<221> SIGNAL 


















<222> -18. .-1 


















<400> 175 


















Met lie Pro Ala Val 


Val 


Leu 


Leu 


Leu 


Leu Leu 


Leu 


Val Glu 


Gin Ala 


-15 








-10 






-5 




Ala Ala Leu Gly Glu 


Pro 


Gin 


Leu 


Cys Tyr He Leu Asp Ala 


He Leu 


1 




5 








10 






Phe Leu Tyr Gly lie 


Val 


Leu 


Thr 


Leu 


Leu Tyr 


Cys 


Arg Leu 


Lys He 


15 


20 








25 






30 


Gin Val Arg Lys Ala 


Ala 


He 


Thr 


Ser 


Tyr Glu 


Lys 


Ser Asp 


Gly Val 


35 
















45 


Tyr Thr Gly Leu Ser 


Thr Arg Asn 


Gin 


Glu Thr 


Tyr 


Glu Thr 


Leu Lys 


50 








55 






60 




His Glu Lys Pro Pro 


Gin 
















65 


















<210> 176 


















<211> 92 


















<212> PRT 


















<213> Homo sapiens 


















<220> 


















<221> SIGNAL 


















<222> -24. .-1 


















<220> 


















<221> UNSURE 


















<222> 52 


















<223> Xaa = Arg, Thr 


















<400> 176 


















Met Val Met Gly Leu 


Gly Val 


Leu 


Leu 


Leu Val 


Phe 


Val Leu 


Gly Leu 


-20 










-15 






-10 


Gly Leu Thr Pro Pro 


Thr 


Leu 


Ala 


Gin 


Asp Asn 


Ser 


Arg Tyr 


Thr His 


-5 








1 






5 






Tyr 


Asp 


Ala 


Lys 


Pro Gin 


Gly Arg Asp 


Asp Arg 


10 




15 








20 






Tyr Cys Glu Ser lie 


Met 


Arg Arg 


Arg 


Gly Leu 


Thr 


Ser Pro 


Cys Lys 


25 


30 








35 






40 


Asp He Asn Thr Phe 


He 


His Gly 


Asn 


Lys Arg 


Xaa 


Ser Arg 


Pro Ser 


45 










50 






55 


Val Lys Thr Arg Met 


Glu 


Thr 


Leu 


Thr 


Glu Lys 


Thr 







60 65 
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<210> 177 
<211> 111 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -16. .-1 

<400> 177 



Met Leu Leu 


lie Leu 


Leu 


Ser 


Val 


Ala 


Leu 


Leu Ala 


Phe 


Ser 


Ser 


Ala 


-15 






-10 








-5 










Gin Asp Leu 


Asp Glu 


Asp Val 


Ser 


Gin Glu Asp Val 


Pro 


Leu 


Val 


He 


1 


5 










10 








15 




Ser Asp Gly 


Gly Asp 


Ser 


Glu 


Gin 


Phe 


He Asp Glu 


Glu Arg Gin Gly 




20 








25 








30 






Pro Pro Leu 


Gly Gly 


Gin 


Gin 


Ser 


Gin 


Pro 


Ser Ala 


Gly Asp 


Gly Asn 


35 








40 








45 








Gin Asn Asp 


Gly Pro 


Gin 


Gin 


Gly 


Pro 


Pro 


Gin Gin 


Gly 


Gly 


Gin 


Gin 


50 






55 








60 










Gin Gin Gly 


Pro Pro 


Pro 


Pro 


Gin 


Gly Lys 


Pro Gin 


Gly 


Pro 


Pro 


Pro 


65 




70 










75 








80 


Gin Gly Gly 


Arg Pro 


Gin 


Gly 


Pro 


Pro 


Gin 


Gly Gin 


Ser 


Pro 


Gin 






85 










90 








95 




<210> 178 
























<211> 106 
























<212> PRT 
























<213> Homo sapiens 






















<220> 
























<221> SIGNAL 






















<222> -16. .- 


-1 






















<400> 178 
























Met Leu Leu 


lie Leu 


Leu 


Ser 


Val 


Ala 


Leu 


Leu Ala 


Phe 


Ser 


Ser 


Ala 


-15 






-10 








-5 










Gin Asp Leu 


Asp Glu 


Asp 


Val 


Ser 


Gin 


Glu 


Asp Val 


Pro 


Leu 


Val 


He 


1 


5 










10 








15 




Ser Asp Gly 


Gly Asp 


Ser 


Glu 


Gin 


Phe 


He 


Asp Glu 


Glu 


Arg 


Gin 


Gly 




20 








25 








30 






Pro Pro Leu 


Gly Gly 


Gin 


Gin 


Ser 


Gin 


Pro 


Ser Ala 


Gly Asp 


Gly Asn 


35 








40 








45 








Gin Asp Asp 


Gly Pro 


Gin 


Gin 


Gly 


Pro 


Pro 


Gin Gin 


Gly Gly 


Gin 


Gin 


50 






55 








60 










Gin Gin Gly 


Pro Pro 


Pro 


Pro 


Gin 


Gly 


Lys 


Pro Gin 


Gly 


Pro 


Pro 


Gin 


65 




70 










75 








80 


Gin Gly Gly 


Gin Ser 


Cys 


Cys 


Cys 


Asp 


Lys 














85 










90 












<210> 179 
























<211> 141 
























<212> PRT 
























<213> Homo sapiens 






















<220> 
























<221> SIGNAL 






















<222> -24. .- 


-1 






















<400> 179 
























Met Pro Ser 


Ser Val 


Ser 


Trp 


Gly 


He 


Leu 


Leu Leu 


Ala 


Gly 


Leu 


Cys 




-20 










-15 








-10 




Cys Leu Val 


Pro Val 


Ser 


Leu 


Ala 


Glu 


Asp 


Pro Gin 


Gly Asp 


Ala 


Ala 
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sp 




er 




IS 


sp 


Gin 


Asp 




ro 


Th 


Phe 


Asn 


n ^ys r 








15^ 










20^ 










Lys lie Thr 


ro 


A n 
sn 




Al 


Glu 


Ph 


Ala 


Phe 


Ser 
er 


Leu 
eu 


yr 




Gin 


















35 










40 


L6U Ala His 






sn 


er 




sn 


He 


Phe 


Ph 


r 


Pr 


Val 


Ser 






45^ 










50 










55 




lie Ala Thr 




Ph 


Ala 


Met 


Leu 
eu 


Ser 


Leu 


Gly 


Thr 


ys 


Ala 


Asp Thr 




60^ 




















70 






His Asp Glu 


He 


Leu 


Glu 


Gly 


Leu 


Asn 


Phe 


Asn 


Leu 


Thr 


Glu 


He 


Pro 


75 










80 










85 








Glu Ala Gin 


He 


His 


Glu 




Phe 


Gin 


Glu 


Leu 




Arg 


Thr 


Leu 


Asn 










95^ 










100 










Gin Pro Asp 


Ser 


Gin 


Leu 


Gin 


Leu 


Thr 


Thr 


Gly 


L s 
ys 


Asn 








105 






110 










115 












<210> 180 




























<211> 124 




























<212> PRT 




























<213> Homo sapiens 
























<220> 




























<221> SIGNAL 


























<222> -30. .- 


-1 


























<400> 180 




























Met Pro Ala 


Cys 


Arg 




Gly Pro 


Le 


Ala 


Ala 


Ala 


Leu 


Leu 




Ser 


-30 






-25 










-20 










-15 


Leu Leu Leu 


Phe 


Gly 




Thr 


Leu 


V 


Ser 


Gly 


Thr Gly 


Al 


Glu 


Lys 






-10 










-5 










1 




Thr Gly Val 


Cys 


Pro 


Gl 


Leu 


Gin 


Al 


Asp 


Gin 


Asn 


Cys 


Thr 


Gin 


Glu 


5 










10 










15 








Cys Val Ser 


Asp 


Ser 




Cys 


Ala 


sp 


Asn 


Leu 


Lys 


Cys 


Cys 


Ser 


Ala 


20 








25 










30 










Gly Cys Ala 


Thr 


Phe 




Ser 


Leu 


ro 


Asn 


Asp 


Lys 


Glu 


Gly 


Ser 


Cys 


35 






40^ 










45 










50 


Pro Gin Val 


Asn 


He 


Asn 


Phe 


Pro 


Gin 


Leu 


Gly 


Leu 


Cys 


Arg Asp Gin 






55 










60 










65 




Cys Gin Val Asp 


Ser 


Gl 


Cys 


Pro 


Gl 


Gin 


Met 


Lys 


Cys 


Cys Arg Asn 




70 










75^ 










80 






Gly Cys Gly Lys 


Val 


Ser 


Cys Val 


Thr 


Pro 


Asn 


Phe 










85 










90 


















<210> 181 




























<211> 124 




























<212> PRT 




























<213> Homo sapiens 
























<220> 




























<221> SIGNAL 


























<222> -30. .- 


-1 


























<400> 181 




























Met Pro Ala 


Cys 




Leu 


Gly 


Pro 


Leu 


Ala 


Ala 


Ala 


Leu 




Leu 


Ser 


-30 






-25 










-20 










-15 


Leu Leu Leu 


Phe 


Gly 


Phe 


Thr 


Leu 


Val 


Ser 


Gly 


Thr Gly 


Ala 


Glu 


Lys 






-10 




















1 




Thr Gly Val 


Cys 


Pro 


Glu 




Gin 


Ala 


Asp 


Gin 


Asn 


Cys 


Thr 


Gin 


Glu 


5 










10 










15 








Cys Val Ser 


Asp 


Ser 


Glu 


Cys 


Ala 


Asp 


Asn 


He 


Lys 


Cys 


Cys 


Ser 


Ala 


20 








25 










30 










Gly Cys Ala 


Thr 


Phe 


Cys 


Ser 


Leu 


Pro 




Asp 




Glu 


Gly 


Ser 


Cys 


35 






40 










45 










50 
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Pro Gin Val Asn He Asn Phe Pro Gin Leu Gly Leu Cys Arg Asp Gin 

55 60 65 

Cys Gin Val Asp Ser Gin Cys Pro Gly Gin Met Lys Cys Cys Arg Asn 

70 75 80 

Gly Cys Gly Lys Val Ser Cys Val Thr Pro Asn Phe 
85 90 

<210> 182 
<211> 69 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -17. .-1 
<223> Von Heijne matrix 

score 9.5148977432302 

seq FVFLLFVISLAAA/AH 

<400> 182 

Met Phe Pro Gly Phe Val Phe Leu Leu Phe Val He Ser Leu Ala Ala 

-15 -10 -5 

Ala Ala His Leu Trp Val Leu Ala Ala Phe Met Gly Arg He Thr Val 

15 10 15 

Lys Val Cys Ser Phe Thr Pro Glu Ala Ser Lys Thr Val Ser Pro Pro 

20 25 30 

Glu Gly Ala Asn Asn Ser Arg Arg Thr Ala Phe Lys Ser Cys Asn Thr 

35 40 45 

His His Lys Gly Leu 
50 

<210> 183 
<211> 77 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -33. .-1 

<220> 

<221> UNSURE 
<222> 6 

<223> Xaa = Lys, Met 
<400> 183 

Met Gly Pro Val Lys Gin Leu Lys Arg Met Phe Glu Pro Thr Arg Leu 

-30 -25 -20 

He Ala Thr He Met Val Leu Leu Ser Phe Ala Leu Thr Leu Cys Ser 

-15 -10 -5 

Ala Phe Trp Trp His Asn Xaa Gly Leu Ala Leu He Phe Cys He Leu 

15 10 15 

Gin Ser Leu Ala Leu Thr Trp Tyr Ser Leu Ser Phe He Pro Phe Ala 

20 25 30 

Arg Asp Ala Val Lys Lys Cys Phe Ala Val Cys Leu Ala 
35 40 

<210> 184 
<211> 146 
<212> PRT 

<213> Homo sapiens 



<220> 
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<221> SIGNAL 
<222> -26. .-1 



<400> 184 



Met 


Thr 


Met 


Arg 


Ser 


Leu 


Leu 




Thr Pro 


Phe Leu 


Cys 


Gly 


Leu 


Leu 




-25 










-20 






-15 










Trp 


Ala 


Phe 


Cys 


Ala 


Pro 


Gly Ala 


Arg Ala 


Glu Glu 


Pro 


Ala 


Ala 


Ser 


-10 










-5 








1 






5 




Phe 


Ser 


Gin 


Pro 


Gly 


Ser 


Met 


Gly 


Leu Asp 


Lys Asn 


Thr 


Val 


His 


Asp 








10 










15 






20 






Gin 


Glu 


His 




Met 


Glu 


His 


Leu 


Glu Gly 


Val He 


Asn 




Pro 


Glu 






25 










30 






35 








Ala 


Glu 


Met 


Ser 


Pro 


Gin 


Glu 


Leu 


Gin Leu 


His Tyr 


Phe 


Lys 


Met 


His 




40 










45 






50 










Asp 


Tyr 


Asp 


Gly 


Asn 


Asn 


Leu 


Leu 


Asp Gly 


Leu Glu 


Leu 


Ser 


Thr 


Ala 


55 










60 








65 








70 


He 


Thr 


His 


Val 


His 


Lys 


Glu 


Glu 


Gly Ser 


Glu Gin 


Ala 


Pro 


Leu 


Met 










75 








80 








85 




Ser 


Glu 




Glu 




He 


Asn 


He 


He Asp 


Gly Val 


Leu 


Arg 


Asp 


Asp 








90 










95 






100 






Asp 


Lys 


Asn 


Asn 


Asp 


Gly 


Tyr 


He Asp Tyr 


Ala Glu 


Phe 


Ala 


Lys 


Ser 



105 110 115 



Leu Gin 
120 



<210> 185 
<211> 62 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> SIGNAL 

<222> -46. .-1 

<223> Von Heijne matrix 

score 8.64329745298384 
seq AVLLLILFAIVFG/LL 



<400> 185 

Met Asp Gin Leu Val Phe Lys Glu 

-45 -40 
Gin Asn Pro Trp Asp Gin Gly Gly 
-30 -25 

Thr Ala Val Leu Leu Leu He Leu 
-10 

Thr Ser Thr Glu Asn Thr Gin Cys 
5 10 



Thr He Trp Asn Asp Ala Phe Trp 
-35 

Leu Ala Val He He Leu Phe He 
-20 -15 

Phe Ala He Val Phe Gly Leu Leu 

-5 1 
Glu Ala Gly Glu Glu Glu 
15 



<210> 186 
<211> 62 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> SIGNAL 

<222> -46. .-1 

<223> Von Heijne matrix 

score 8.64329745298384 
seq AVLLLILFAIVFG/LL 



<400> 186 

Met Asp Gin Leu Val Phe Lys Glu Thr He Trp Asn Asp Ala Phe Trp 

-45 -40 -35 

Gin Asn Pro Trp Asp Gin Gly Gly Leu Ala Val He He Leu Phe He 
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-30 -25 
Thr Ala Val Leu Leu Leu lie Leu Phe 
-10 

Thr Ser Thr Glu Asn Thr Gin Cys Glu 
5 10 
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-20 -15 
Ala He Val Phe Gly Leu Leu 
-5 1 
Ala Gly Glu Glu Glu 
15 



<210> 187 
<211> 162 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> SIGNAL 
<222> -47. .-1 



<400> 187 
























Met 


Arg 


He 


Ala 


Asn Arg 


Thr Arg 


Phe 


Ser 


Ser 


Pro Phe 


Leu 


Ala 


Arg 






-45 








-40 








-35 








Gly 


Ala 


Gly 


Trp 


Thr His 


Gly Arg 


Gly 


Met 


Met 


Val Val 


Gly 


Thr 


Gly 




-30 








-25 










-20 








Thr 


Ser 


Leu 


Ala 


Leu Ser 


Ser 


Leu 


Arg 


Ser 


Leu 


Leu Leu 


Phe 


Ala 


Gly 


-15 








-10 










-5 








1 


Met 


Gin 


Met 


Tyr 


Ser Arg 


Gin 


Leu 


Ala 


Ser 


Thr 


Glu Trp 




Thr 


He 








5 








10 








15 






Gin 


Gly 


Gly 


Leu 


Leu Gly 


Ser 


Gly 


Leu 


Phe 


Val 


Phe Ser 


Leu 


Thr 


Ala 






20 








25 








30 








Phe 


Asn 


Asn 


Leu 


Glu Asn 


Leu 


Val 


Phe 


Gly Lys 


Gly Phe 


Gin 


Ala 


Lys 




35 








40 










45 








He 


Phe 


Pro 


Glu 


He Leu 


Leu 


Cys 


Leu 


Leu 


Leu 


Ala Leu 


Phe 


Ala 


Ser 


50 








55 










60 








65 


Gly 


Leu 


He 


His 


Arg Val 


Cys 


Val 


Thr 


Thr 


Cys 


Phe He 


Phe 


Ser 


Met 










70 








75 








80 




Val 


Gly 


Leu 


Tyr 


Tyr He 


Asn 




He 


Ser 


Ser 


Thr Leu 


Tyr 


Gin 


Ala 








85 








90 








95 






Ala 


Ala 


Pro 


Val 


Leu Thr 


Pro 


Ala 


Lys 


Val 


Thr 


Gly Lys 


Ser 


Lys 


Lys 






100 








105 








110 








Arg 


Asn 



























115 



<210> 188 
<211> 90 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -35. .-1 



<400> 188 



Met 


Glu 


Thr 


Gly 


Ala 


Ser 


Ala 


Ser 


He 


Pro Glu Leu 


He 


Cys 


Glu 


Ala 


-35 










-30 








-25 








-20 


Met 


Arg 


Arg 


He 


Trp 
-15 


Ser 


Leu 


Gly 


Leu 


Gly Leu Val 
-10 


Thr 


Leu 


Thr 
-5 


Ala 


Ser 


Trp 


Ala 


Ala 


Leu 


Phe 


His 


Asp 


Gly Phe Ala Val 


Leu 


Gly Gly Asn 








1 








5 






10 








He 


Val 
15 


Ser 


Asp 


Leu 


Ser 


Thr 
20 


Val 


Arg 


Phe Val Ala 
25 


Gin 


Gin 


Gin 


His 


Phe 


Gin 


Leu 


Leu 


Asp Val 


Trp 


Thr 


Arg 


Asn Phe Arg 


Lys 


Pro 


Leu 


Gly 


30 










35 








40 








45 


Ser 


Met 


Cys 


Phe 


Val 
50 


Phe 


Leu 


Leu 


Leu 


Pro 
55 











<210> 189 
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<211> 146 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -25. .-1 

<220> 

<221> UNSURE 
<222> 60 

<223> Xaa = Lys,Arg 



<400> 189 



Met Arg 


Leu Leu 


Gin 


Leu Leu 


Phe Arg Ala 


Ser 


Pro 


Ala 


Thr 


Leu 


Leu 


-25 






-20 




-15 










-10 


Leu Val 


Leu Cys 


Leu 

-5 


Gin Leu 


Gly Ala Asn 
1 


Lys 


Ala 


Gin Asp Asn Thr 


Arg Lys 


He He 


He 


Lys Asn 


Phe Asp He 


Pro 


Lys 


Ser 


Val 


Arg 


Pro 




10 






15 






20 








Asn Asp 


Glu Val 


Thr 


Ala Val 


Leu Ala Val 


Gin 


Thr 


Glu 


Leu 


Lys 


Glu 


25 






30 






35 










Cys Met 


Val Val 


Lys 


Thr Tyr 


Leu He Ser 


Ser 


He 


Pro 


Leu 


Gin Gly 


40 






45 




50 










55 


Ala Phe 


Asn Tyr 


Xaa 


Tyr Thr 


Ala Cys Leu 


Cys 


Asp 


Asp Asn 


Pro Lys 






60 




65 










70 




Thr Phe 


Tyr Trp 


Asp 


Phe Tyr 


Thr Asn Arg 


Thr 


Val 


Gin 


He 


Ala 


Ala 




75 






80 








85 






Val Val 


Asp Val 


He 


Arg Glu 


Leu Gly He 


Cys 


Pro 


Asp Asp 


Ala 


Ala 




90 






95 






100 








Val He 


Pro He 


Lys Asn Asn Arg Phe Tyr 


Thr 


He 


Glu 


He 


Leu 


Lys 


105 






110 






115 










Val Glu 






















120 






















<210> 190 




















<211> 120 




















<212> PRT 




















<213> Homo sapiens 


















<220> 






















<221> SIGNAL 




















<222> -32. .-1 




















<223> Von Heijne matrix 
















S( 


:ore 7 . 54815044221151 














seq VTLGIGFFALASA/LW 
















<400> 190 




















Met Ala 


Ser Pro 


Lys 


Gly Phe 


Phe Asn Tyr 


Leu 


Thr 


Tyr 


Phe 


Leu 


Ala 




-30 






-25 






-20 








Ala Gly 


Ala Val 


Thr 


Leu Gly 


He Gly Phe 


Phe 


Ala 


Leu 


Ala 


Ser 


Ala 


-15 






-10 






-5 










Leu Trp 


Phe Leu 


He 


Cys Lys 


Arg Arg Glu 


He 


Phe 


Gin 


Asn 


Ser 


Lys 


1 




5 




10 










15 




Phe Lys 


Ala He 


Asp 


Glu Arg 


Cys Arg Gin 




Pro 


Ser 


Met 


Ala 


Lys 




20 






25 








30 






He Lys 


Ser His 


Ser 


Gin Cys 


Val Phe He 


Ser Arg 


Asn 


Phe 


His 


Thr 




35 






40 






45 








Gly Arg 


Phe Gin 




Gin Gin 


Leu Lys He 


He 


Leu 


Lys 


Met 


Asn 


Pro 


50 






55 






60 










Asn Leu 


Gin Gin 


Lys 


He Ser 


Phe Val He 


Pro 


Gin 


Arg 


Pro 


Ala 


Pro 


65 






70 




75 










80 


Gin Gin 


He Ala 


Ala 


Val Leu 


His 
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<210> 191 
<211> 78 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> SIGNAL 
<222> -19. .-1 

<400> 191 



Met 


Lys 


Phe 


Phe 


Val Phe Ala Leu Val 


Leu 


Ala 


Leu Met He Ser 


Met 










-15 


-10 




-5 




He 


Ser Ala Asp 


Ser His Glu Lys Arg His 


His 


Gly Tyr Arg Arg Lys 








1 


5 






10 




Phe 


His 


Glu 


Lys 


His His Ser Tyr His 


He 


Thr 


Leu Leu Pro Leu 


Phe 




15 






20 






25 




Glu 


Glu 


Ser 


Ser 


Lys Ser Asn Ala Asn 


Glu 


Lys 


His Tyr Asn Leu 


Leu 


30 








35 




40 




45 


Tyr 


Thr 


Leu 


Cys 


Phe Arg He Leu Ala 


Phe 


Ser 


He Val Thr 





50 55 



<210> 192 
<211> 87 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> SIGNAL 

<222> -20. .-1 

<223> Von Heijne matrix 

score 7.42708462116258 
seq LLVFLAGFPVLDA/ND 

<400> 192 



Met 


Gin 


Lys 


Val 


Thr 


Leu 


Gly Leu 


-20 










-15 




Val 


Leu 


Asp 


Ala 


Asn 

1 


Asp 


Leu Glu 


Asp 


Trp 


His 


Ser 


Leu 


Gin 


Val Gly 






15 








20 


Cys 


Ala 


Met 


Gly 


He 


He 


He Val 




30 










35 


Phe 


Gly 


Gin 


Lys 


Ser 


Gly 


His His 


45 










50 




Thr 


Pro 


Gly 


Ser 


Ala 


Gin 


Ser 



65 



Leu Val Phe Leu Ala Gly Phe Pro 

-10 -5 
Asp Lys Asn Ser Pro Phe Tyr Tyr 
5 10 
Gly Leu He Cys Ala Gly Val Leu 
25 

Met Ser Ala Lys Cys Lys Cys Lys 

40 

Pro Gly Glu Thr Pro Pro Leu He 
55 60 



<210> 193 
<211> 105 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> SIGNAL 

<222> -19. .-1 

<223> Von Heijne matrix 

score 7.20796835452081 
seq VSIMLLLVTVSDC/AV 



<220> 

<221> unsure 
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<222> 37 














<223> Xaa = 


* , Glu 












<400> 193 














Met Arg Gly 


Ala Thr Arg 


Val 


Ser 


He Met Leu 


Leu Leu Val 


Thr Val 




-15 






-10 




-5 


Ser Asp Cys 


Ala Val He 


Thr 


Gly 


Ala Cys Glu 


Arg Asp Val 


Gin Cys 




1 




5 




10 




Gly Ala Gly 


Thr Cys Cys 


Ala 


He 


Ser Leu Trp 


Leu Arg Gly 




15 




20 






25 




Met Cys Thr 


Pro Leu Gly Arg Xaa 


Gly Glu Glu 


Cys His Pro Gly Ser 




35 










45 


His Lys He 


Pro Phe Phe 


Arg Lys 


Arg Lys His 


His Thr Cys 


Pro Cys 




50 






55 




60 


Leu Pro Asn 


Leu Leu Cys 


Ser 




Phe Pro Asp 


Gly Arg Tyr 


Arg Cys 




65 






70 


75 




Ser Met Asp 


Leu Lys Asn 


He 


Asn 


Phe 






80 






85 









<210> 194 
<211> 105 
<212> PRT 

<213> Homo sapiens 

<220> 

<221> SIGNAL 

<222> -19. .-1 

<223> Von Heijne matrix 

score 7.20796835452081 
seq VSIMLLLVTVSDC/AV 



<400> 194 












Met 


Arg 


Gly Ala 


Thr 


Arg Val Ser 


He Met Leu 


Leu Leu 


Val Thr Val 








-15 




-10 




-5 


Ser 


Asp 


Cys Ala 


Val 


He Thr Gly 


Ala Cys Glu 


Arg Asp Val Gin Cys 






1 




5 




10 




Gly 


Ala 


Gly Thr 


Cys 


Cys Ala He 


Ser Leu Trp 


Leu Arg 


Gly Leu Arg 




15 






20 




25 




Met 


Cys 


Thr Pro 


Leu 


Gly Arg Glu 


Gly Glu Glu 


Cys His 


Pro Gly Ser 


30 








35 


40 




45 


His 


Lys 


He Pro 


Phe 


Phe Arg Lys 


Arg Lys His 


His Thr 


Cys Pro Cys 








50 




55 




60 


Leu 


Pro 


Asn Leu 


Leu 


Cys Ser Arg 


Phe Pro Asp 


Gly Arg 


Tyr Arg Cys 






65 






70 




75 


Ser 


Met 


Asp Leu 


Lys 


Asn He Asn 


Phe 







80 85 



<210> 195 
<211> 102 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> SIGNAL 
<222> -36. .-1 

<220> 

<2 21> UNSURE 

<222> 21 

<223> Xaa = He, Met 
<220> 

<221> UNSURE 
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<222> 52 



<223> Xaa = 


He, 


Ser 










<400> 195 














Met Ser Asn 


Thr 


Thr 


Val 


Pro 


Asn 


Ala 


-35 








-30 






Met Val Gly 


Tyr Val 


Leu 


Gly 


Pro 


Phe 


-20 














Val Val Val 


Ala 


Val 


Val 


Met 


Tyr 


Val 






1 










Arg Leu Arg 


His 


His 


Leu 


Leu 


Pro 


Xaa 


15 










20 




Glu Leu His 


Glu 


Ala 


Glu 


Gin 


Gin 


Leu 


30 








35 






Lys Val Val 


His 


Gly 




Gin 


Xaa 


Gly 


45 






50^ 








Leu Leu Asp 


Val 


Lys 


Thr 












65 










<210> 196 














<211> 137 














<212> PRT 














<213> Homo sapiens 










<220> 














<221> SIGNAL 












<222> -18. .- 


-1 












<400> 196 














Met Lys Ala 


Leu 


He 


Val 


Leu 


Gly 


Leu 




-15 










-10 


Gin Gly Lys 


Val 


Phe 


Glu 


Arg 


Cys 


Glu 


1 














Leu Gly Met 


Asp 


Gly 


Tyr 


Arg 


Gly 


He 


15 






20 








Leu Ala Lys 


Trp 


Glu 


Ser 


Gly 


Tyr 


Asn 






35 










Ala Gly Asp Arg 


Ser 


Thr 


Asp 


Tyr 






50 










55^ 


Tyr Trp Cys Asn 


Asp 


Gly 


Lys 


Thr 


Pro 


65 










70 




Leu Ser Cys 


Ser 


Gly 


Trp 


His 


Gly 


Glu 


80 














Ser Val Ser 


Met 


Phe 


Lys 


V^l 


Val 


Gl 
Glu 


95 






100 








Leu Gin Leu 


He 


Leu 


Ser 


Leu 


Ser 


His 






115 










<210> 197 














<211> 101 














<212> PRT 














<213> Homo sapiens 










<220> 














<221> SIGNAL 












<222> -29. .- 


-1 












<400> 197 














Met Arg Glu 


Glu 


Lys 


Lys 


Pro 


Phe 


Glu 






-25 










Val Cys Met 


Cys 


Val 


Phe 


Ser 


Thr 


Gin 




-10 










-5 


Ala His Phe 


He 


Asp 


Glu 


Lys 


Leu 


Arg 
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Gin 


Ala 


Asn 


Ser 


Asp Ser 






-25 










Phe 


Leu 


He 


Thr 


Leu 


Val 


Gly 




-10 










-5 


Gin 


Lys 


Lys 


Lys 


Arg 


Val 


Asp 










10 






Tyr 


Ser 


Tyr 


Asp 


Pro 


Ala 


Glu 








25 








Leu 


Ser Asp 


Met 


Gly 


Asp 


Pro 






40 










Tyr 


Gin 


His 


Lys Arg 


Met 


Pro 




55 










60 


Val 




Leu 


Ser 


Val 


Thr 


Val 










-5 






Leu 


Ala 


Arg 


Thr 


Leu 


Lys 


Arg 






10 












Leu 


Ala 


Asn 


Trp 


Met 


Cys 




25 










30 


Thr 


Arg 


Ala 


Thr 


Asn 


Tyr Asn 


40 










45 




He 


Phe 


Gin 


He 


Asn 


Ser 


Arg 










60 






Gly 


Ala 


Val 


Asn 


Ala 


Cys 


His 








75 








He 


Val 


Val 


Lys 


Thr 


Glu 


Met 


Cys 


Asn 


90 


Arg 


He 


Phe 






105 










110 


Arg 


Glu 


Arg 


Glu 


Ser 


Val 


Cys 


-20 










-15 




Gly 


Ala 


Leu 


Gly Glu 


Met 


Ala 










1 






Pro 


Ser 


Glu 


Gly Asn 


Gly His 
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5 10 
Arg Gly Thr Leu Asp Ser Leu Ser Ser 
20 25 
Ser Thr Ala Asp Pro Thr Gin Ala Gly 
40 

Leu Pro Val Thr Ser Arg Ser Gin Pro 

55 60 
Pro Ala Ala Ala Leu 

70 



PCT/IBOl/00914 
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15 

Asp Gin Glu Ser Tyr lie Pro 

30 35 
Pro Glu Leu Leu His Lys Asn 
45 50 
Leu Pro Ser Asp Leu Ala lie 
65 



<210> 198 
<211> 128 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -26. .-1 



<220> 

<221> UNSURE 
<222> 49 

<223> Xaa = Glu, Lys 



<400> 198 



Met 


Cys 


Trp 


Leu 


Arg 


Ala 


Trp Gly 


Gin 


He Leu 


Leu Pro Val Phe 


Leu 




-25 










-20 






-15 




Ser 


Leu 


Phe 


Leu 


He 


Gin 


Leu Leu 


He 


Ser Phe 


Ser Glu Asn Gly 


Phe 


-10 










-5 






1 


5 




He 


His 


Ser 


Pro Arg 


Asn 


Asn Gin 


Lys 


Pro Arg 


Asp Gly Asn Glu 


Glu 








10 








15 




20 




Glu 


Cys 


Ala 


Val 


Lys 


Lys 


Ser Cys 


Gin 


Leu Cys 


Thr Glu Asp Lys 


Lys 






25 








30 






35 




Cys 


Val 


Trp 


Cys 


Ser 


Glu 


Glu Lys 


Ala 


Cys Xaa 


Lys Tyr Cys Phe 


Pro 




40 










45 






50 




Tyr 


Phe 


Gly 


Cys 


Arg 


Phe 


Ser Ser 


He 


Tyr Trp 


Leu Asn Cys Lys 


Val 


55 










60 






65 




70 


Asp 


Met 


Phe 


Gly 


He 


Met 


Met Leu 


Leu 


Leu He 


Ala Val Leu He 


Thr 










75 








80 


85 




Gly 


Phe 


Val 


Trp 


Tyr Cys 


Cys Ala 


Tyr 


His Phe 


Tyr Leu Gin Asp 


He 



90 95 100 



<210> 199 
<211> 142 
<212> PRT 
<213> Homo sapiens 



<220> 

<221> SIGNAL 
<222> -24. .-1 



<220> 

<221> UNSURE 
<222> 90 

<223> Xaa = Asp, Glu 
<220> 

<221> UNSURE 
<222> 80 

<223> Xaa = Leu, Arg, Trp 



<220> 

<221> UNSURE 
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<222> 79 

<223> Xaa = Lys,Asn 
<400> 199 



Met 


Arg 


Leu Ser Trp 


Phe 


Arg 


Val 


Leu Thr Val 


Leu 


Ser 


He 


Cys 


Leu 






-20 








-15 








-10 




Ser 


Ala 


Val Ala Thr 


Ala 


Thr 


Gly Ala Glu Gly 


Lys 


Arg 


Lys 


Leu 


Gin 


lie 


Gly 


-5 

Val Lys Lys 


Arg 


Val 


Asp 


His Cys Pro 


He 


Lys 


Ser 


Arg 


Lys 




10 






15 






20 










Gly 


Asp 


Val Leu His 


Met 


His 


Tyr 


Thr Gly Lys 


Leu 


Glu 


Asp 


Gly 


Thr 














35 












Glu 


Phe 


Asp Ser Ser 


Leu 


Pro 


Gin 


Asn Gin Pro 


Phe 


Val 


Phe 


Ser 


Leu 






45 








50 








55 




Gly 


Thr 


Gly Gin Val 


He 


Lys 


Gly 


Trp Asp Gin 


Gly 


Leu 


Leu 


Gly 


Met 






60 








65 






70 






Cys 


Glu 


Gly Glu Lys 


Arg 


Xaa 


Xaa 


Val He Pro 


Ser 


Glu 


Leu 


Gly 


Tyr 






75 






80 






85 








Gly 


Xaa 


Arg Gly Ala 


Pro 


Pro 


Lys 


He Pro Gly 


Gly 


Ala 


Thr 


Leu 


Val 




90 






95 






100 










Phe 


Glu 


Val Glu Leu 


Leu 


Lys 


He 


Glu Arg Arg 


Thr 


Glu 


Leu 






105 






110 






115 













<210> 200 
<211> 70 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> SIGNAL 

<222> -24. .-1 

<223> Von Heijne matrix 

score 6.54728084640774 
seq ILLTRPACLGSWA/EI 



<400> 200 












Met Ala 


Ser 


Ser 


Thr 


Ser 


Val Ser Thr Gly Gin 


He Leu Leu Thr Arg 








-20 




-15 


-10 


Pro Ala 


Cys 


Leu 


Gly 


Ser 


Trp Ala Glu He Arg 


Ser Pro Val Arg Thr 






-5 






1 


5 


He Ser 


He 


Ala 


Ser 


Asp 


Phe Pro Thr Ala Arg 


Val Ser Leu Trp Val 


10 










15 


20 


Pro Pro 


Ala 


Pro 


Gly 


Met 


Val Pro He Lys He 


Ser Gly Cys Ala Asn 


25 








30 


35 


40 


Trp Ala 


Phe 


Ser 


Pro 


Ala 







45 



<210> 201 
<211> 113 
<212> PRT 

<213> Homo sapiens 

<220> 

<221> SIGNAL 
<222> -23. .-1 

<220> 

<221> UNSURE 

<222> 8 

<223> Xaa = His, Lys, Asn, Gin 

<400> 201 

Met Ala Gly Pro Ala Ala Ala Phe Arg Arg Leu Gly Ala Leu Ser Gly 
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-20 




15 




-10 






Ala Ala Ala Leu Gly Phe Ala 


Ser 


Tyr Gly Ala His 


Gly Ala 


Xaa 


Phe 
















Pro Asp Ala Tyr Gly Lys Glu 


Leu 


Phe Asp Lys Ala 


Asn 


Lys 


His 


His 






20 








25 


Phe Leu His Ser Leu Ala Leu 


Leu 


Gly Val Pro His 


Cys Arg 


Lys 


Pro 


30 




35 






40 




Leu Trp Ala Gly Leu Leu Leu 


Ala 


Ser Gly Thr Thr 


Leu 


Phe 


Cys 


Thr 


45 




50 




55 






Ser Phe Tyr Tyr Gin Ala Leu 


Ser 


Gly Asp Pro Ser 


lie 


Gin 


Thr 


Leu 


60 


65 




70 








Ala Pro Ala Gly Gly Thr Leu Leu 


Leu Leu Gly Trp 


Leu 


Ala 


Leu 


Ala 


75 80 




85 










Leu 














90 














<210> 202 














<211> 113 














<212> PRT 














<213> Homo sapiens 














<220> 














<221> SIGNAL 














<222> -23. .-1 














<220> 














<221> UNSURE 














<222> 8 














<223> Xaa = Lys, Gin 














<400> 202 














Met Ala Gly Pro Ala Ala Ala 


Phe 


Arg Arg Leu Gly 


Ala 


Leu 


Ser 


Gly 


-20 




-15 




-10 






Ala Ala Ala Leu Gly Phe Ala 


Ser 


Tyr Gly Ala His 


Gly Ala Xaa 


Phe 


-5 


1 












Pro Asp Ala Tyr Gly Lys Glu 


Leu 


Phe Asp Lys Ala 


Asn 


Lys 


His 


His 


10 15 




20 








25 


Phe Leu His Ser Leu Ala Leu 


Leu 


Gly Val Pro His 


Cys 


Arg 


Lys 


Pro 


30 




35 






40 




Leu Trp Ala Gly Leu Leu Leu 


Ala 


Ser Gly Thr Thr 


Leu 


Phe 


Cys 


Thr 


45 




50 




55 






Ser Phe Tyr Tyr Gin Ala Leu 


Ser 


Gly Asp Pro Ser 


He 


Gin 


Thr 


Leu 


60 


65 




70 








Ala Pro Ala Gly Gly Thr Leu Leu 


Leu Leu Gly Trp 


Leu 


Ala 


Leu 


Ala 


75 80 














Leu 














90 














<210> 203 














<211> 54 














<212> PRT 














<213> Homo sapiens 














<220> 














<221> SIGNAL 














<222> -17. .-1 














<223> Von Heijne matrix 














score 6.48133858229514 












seq LTFAQLLFATVLG/IA 














<400> 203 














Met Phe Arg Arg Leu Thr Phe 


Ala 


Gin Leu Leu Phe 


Ala 


Thr 


Val 


Leu 


-15 


-10 




-5 









[fije>'/^:/yv'GG208389e.s^] 



WO 02/083898 

Gly He Ala Gly Gly Val Tyr He Phe 

1 5 
Ala Lys Asp Gin Lys Glu Leu Lys Glu 
20 

Ser Glu Glu Lys Lys Ser 
35 



<210> 204 








<211> 83 








<212> PRT 








<213> Homo sapiens 








<220> 








<221> SIGNAL 








<222> -15. .-1 








<400> 204 








Met Arg Leu Phe Leu Ser 


Leu 


Pro 


Val 










Val Leu Glu Gly Pro Ala 


Pro 


Ala 


Gin 


5 






10 


Ala Leu Asp Lys Leu Lys 


Glu 




Gly 






25^ 




Arg Glu Leu He Ser Arg 


He 


Lys 


Gin 


35 


40 






Arg Glu Trp Phe Ser Glu 


Thr 


Phe 


Gin 


50 55 








He Asp Ser 








<210> 205 








<211> 98 








<212> PRT 








<213> Homo sapiens 








<220> 








<221> SIGNAL 








<222> -43. .-1 








<400> 205 








Met Phe Arg He Glu Gly 


Leu 


Ala 


Pro 


-40 






-35 


Lys Arg Lys Met Arg Glu 


Asp 


Val 


He 


-25 




-20 




He Tyr Val Ala Leu Leu Arg Val 


Ser 


-10 


-5 






Cys Asp Thr Ser Gly Glu 


Leu 


Thr 


Asp 


10 








Gly His Arg Gly Leu Arg 


Thr 


Glu 




25 






30 


Gly Ser He Tyr Leu Lys 


Glu 


He 


Gly 


40 




45 





His He 
55 



<210> 206 

<211> 82 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SIGNAL 
<222> -43. .-1 
<223> Von Heijne matrix 



PCT/lBOl/00914 

141 

Gin Pro Val Phe Glu Gin Tyr 

10 15 
Lys Met Gin Leu Val Gin Glu 
25 30 



Leu Val Val Val Leu Ser He 

-5 1 
Gly Thr Pro Asp Val Ser Ser 
15 

Asn Thr Leu Glu Asp Lys Ala 
30 

Ser Glu Leu Ser Ala Lys Met 
45 

Lys Val Lys Asp Lys Leu Lys 
60 65 



Lys Leu Asp Pro Glu Glu Met 
-30 

Ser Ser He Arg Asn Phe Leu 
-15 

Glu Cys Leu Pro Gly Cys Asp 

1 5 
Gly His Pro Leu Thr Leu Arg 
15 20 
Asn Gly Ser Gly Glu Gin Gly 
35 

Gin His Met Lys Thr Gly His 
50 
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score 5.65876793443964 
seq WSFALIATLVYA/LF 



<400> 206 

Met Ala Lys Tyr Gin Gly Glu Val 
-40 

Ser Val lie Glu Gly Val Ser Asp 
-25 -20 
Ser Phe Ala Leu lie Ala Thr Leu 

-10 -5 
His Gin Asn lie His Pro Glu Asn 
10 

Glu Gin Leu Gin Thr Glu Gin Asp 
25 

Leu His 



Gin Ser Leu Lys Leu Asp Asp Asp 
-35 -30 
Gin Val Leu Val Ala Val Val Val 
-15 

Val Tyr Ala Leu Phe Arg Asn Val 

1 5 
Gin Glu Leu Val Arg Val Leu Arg 

15 20 
Ala Pro Ala Asp Ser Thr Ala Val 
30 35 



<210> 207 
<211> 82 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -43. .-1 



<400> 207 












Met 


Ala Lys 


Tyr 


Gin Gly Glu Val 


Gin Ser Leu 


Lys 


Leu Asp Asp Asp 






-40 




-35 




-30 


Ser 


Val He 


Glu 


Gly Val Ser Asp 


Gin Val Leu 


Val 


Ala Val Val Val 




-25 




-20 






-15 


Ser 


Phe Ala 


Leu 


He Ala Thr Leu 


Val Tyr Ala 


Leu 


Phe Arg Asn Val 




-10 




-5 




1 


5 


His 


Gin Asn 


He 


His Pro Glu Asn 


Gin Glu Leu 


Val 


Arg Val Leu Arg 



10 15 20 



Glu Gin Leu Gin Thr Glu Gin Asp Ala Pro Ala Asp Ser Thr Ala Val 
25 30 35 

Leu His 



<210> 208 
<211> 82 
<212> PRT 
<213> Homo sapi( 



<220> 

<221> SIGNAL 
<222> -43. .-1 



<220> 

<221> UNSURE 
<222> 32 

<223> Xaa = Ala, Gly 



<400> 208 


























Met 


Ala 


Lys 


Tyr 


Gin 


Gly Glu Val 


Gin 


Ser 


Leu 


Lys 


Leu Asp Asp Asp 








-40 








-35 










-30 






Ser 


Val 


He 


Glu 


Gly Val 


Ser Asp 


Gin 


Val 


Leu 


Val 


Ala 


Val 


Val 


Val 






-25 








-20 










-15 








Ser 


Phe 


Ala 


Leu 


He 


Ala 


Thr Leu 


Val 


Tyr Ala 


Leu 


Phe 




Asn 


Val 




-10 










-5 








1 








5 


His 


Gin 


Asn 


He 


His 
10 


Pro 


Glu Asn 


Gin 


Glu 
15 


Leu 


Val 


Arg 


Val 


20 


Arg 


Glu 


Gin 


Leu 


Gin 
25 


Thr 


Glu 


Gin Asp 


Ala 
30 


Pro 


Xaa 


Asp 


Ser 


Thr 
35 


Ala 


Val 
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Leu His 

<210> 209 

<211> 82 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SIGNAL 
<222> -43. .-1 

<400> 209 



Met Ala Lys 


Tyr 


Gin Gly 


Glu 


Val Gin Ser Leu 


Lys 


Leu 


Asp Asp Asp 




-40 






-35 






-30 






Ser Val lie 


Glu 


Gly Val 


Ser 


Asp Gin Val Leu 


Val 


Ala 


Val 


Val 


Val 


-25 








-20 




-15 








Ser Phe Ala 


Leu 


He Ala 


Thr 


Leu Val Tyr Ala 


Leu 


Phe 


Arg 


Asn 


Val 


-10 






-5 




1 








5 


His Gin Asn 


He 


His Pro 


Glu 


Asn Gin Glu Leu 


Val 


Arg 


Val 


Leu 


Arg 






10 




15 








20 




Glu Gin Leu 


Gin 


Thr Glu 


Gin Asp Ala Pro Ala 


Asp 


Ser 


Thr 


Ala 


Val 




25 






30 






35 






Leu His 




















<210> 210 




















<211> 99 




















<212> PRT 




















<213> Homo sapiens 
















<220> 




















<221> SIGNAL 


















<222> -28. .- 


-1 


















<220> 




















<221> UNSURE 


















<222> 2,16 




















<223> Xaa = 


Phe, 


,Ser 
















<400> 210 




















Met Thr Ser 


Ala 


Leu Thr 


Gin Gly Leu Glu Arg 


He 


Pro 


Asp 


Gin 


Leu 




-25 






-20 






-15 






Gly Tyr Leu 


Val 


Leu Ser 


Glu 


Gly Ala Val Leu 


Ala 


Ser 


Xaa 


Gly Asp 


-10 








-5 




1 








Leu Glu Asn 


Asp 


Glu Gin 


Ala 


Ala Ser Ala He 


Xaa 


Glu 


Leu 


Val 


Ser 


5 




10 




15 










20 


Thr Ala Cys 


Gly 


Phe Arg 


Leu 


His Arg Gly Met Asn 


Val 


Pro 


Phe 


Lys 






25 




30 








35 




Arg Leu Ser 


Val 


Val Phe 


Gly 


Glu His Thr Leu 


Leu 


Val 


Thr 


Val 


Ser 




40 






45 






50 






Gly Gin Arg 


Val 


Phe Val 


Val 


Lys Arg Gin Asn Arg Gly 


Arg 


Glu 


Pro 


55 








60 




65 








He Asp Val 




















70 





















<210> 211 
<211> 99 
<212> PRT 

<213> Homo sapiens 

<220> 

<221> SIGNAL 
<222> -28 . .-1 
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<220> 

<221> UNSURE 
<222> 16 

<223> Xaa = Phe,Ser 
<400> 211 



Met 


Thr 


Ser 


Ala 


Leu 


Thr 


Gin Gly 


Leu 


Glu Arg He 


Pro 


Asp 


Gin 


Leu 


Gly 


Tyr 


Leu 


Val 


Leu 


Ser 


Glu Gly 


Ala 


Val Leu Ala 


Ser 


Ser 


Gly Asp 






-10 








-5 






1 








Leu 


Glu 


Asn 


Asp 


Glu 


Gin 


Ala Ala 


Ser 


Ala He Xaa 


Glu 


Leu 


Val 


Ser 


5 










10 






15 








20 


Thr 


Ala 


Cys 


Gly 


Phe 


Arg Leu His 


Arg 


Gly Met Asn Val 


Pro 


Phe 


Lys 










25 








30 






35 




Arg 


Leu 


Ser 


Val 
40 


Val 


Phe 


Gly Glu 


His 
45 


Thr Leu Leu 


Val 


Thr 
50 


Val 


Ser 


Gly 


Gin 


Arg 
55 


Val 


Phe 


Val 


Val Lys 
60 


Arg 


Gin Asn Arg 


Gly 
65 


Arg 


Glu 


Pro 


He 


Asp 
70 


Val 























<210> 212 
<211> 186 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> SIGNAL 
<222> -15. .-1 

<220> 

<221> UNSURE 
<222> 37 

<223> Xaa = Cys, Gly 

<220> 

<221> UNSURE 
<222> 11 

<223> Xaa = Cys,Trp 
<220> 

<221> UNSURE 
<222> 149 

<223> Xaa = Gly, Arg 



<400> 212 






















Met 


His 


Val 


Met 


Ala 


Ala 


Ser 


Met 


Ala 


Arg 


Gly Gly Val 


Ser Ala 


Arg 


-15 










-10 










-5 




1 


Val 


Leu 


Leu 


Gin 


Ala 


Ala 


Arg 


Gly 


Thr 


Xaa 


Trp Asn Arg 


Pro Gly 


Gly 








5 










10 






15 




Thr 


Ser 


Gly 


Ser 


Gly 


Glu 


Gly 


Val 


Ala 




Gly Thr Thr 


Arg Lys 


Phe 






20 










25 






30 






Gin 


Ala 


Thr 


Xaa 


Ser 


Arg 


Pro 


Ala 


Gly Glu 


Glu Asp Ala 


Gly Gly 


Pro 




35 










40 








45 






Glu 


Arg 


Pro 


Gly 


Asp 


Val 


Val 


Asn 


Val 


Val 


Phe Val Asp 


Arg Ser 


Gly 


50 










55 










60 




65 


Gin 




He 


Pro 


Val 


Ser 


Gly Arg 


Val 


Gly 


Asp Asn Val 


Leu His 


Leu 










70 










75 




80 




Ala 


Gin 


Arg 


His 


Gly 


Val 




Leu 


Glu 


Gly Ala Cys Glu 


Ala Ser 










85 










90 






95 




Ala 


Cys 


Ser 


Thr 


Cys 


His 


Val 


Tyr 


Val 


Ser 


Glu Asp His 


Leu Asp 


Leu 






100 










105 






110 






Leu 


Pro 


Pro 


Pro 


Glu 


Glu 




Glu 


Asp 


Asp 


Met Leu Asp 


Met Ala 


Pro 
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115 120 125 

Leu Leu Gin Glu Asn Ser Arg Leu Gly Cys Gin lie Val Leu Thr Pro 
130 135 140 145 

Glu Leu Glu Xaa Ala Glu Phe Thr Leu Pro Lys lie Thr Arg Asn Phe 

150 155 160 

Tyr Val Asp Gly His Val Pro Lys Pro His 
165 170 

<210> 213 
<211> 123 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -48. .-1 

<220> 

<221> UNSURE 
<222> 75 

<223> Xaa = Ala, Asp 
<400> 213 

Met Glu Ala Leu Gly Lys Leu Lys Gin Phe Asp Ala Tyr Pro Lys Thr 

-45 -40 -35 

Leu Glu Asp Phe Arg Val Lys Thr Cys Gly Gly Ala Thr Val Thr lie 

-30 -25 -20 

Val Ser Gly Leu Leu Met Leu Leu Leu Phe Leu Ser Glu Leu Gin Tyr 

-15 -10 -5 

Tyr Leu Thr Thr Glu Val His Pro Glu Leu Tyr Val Asp Lys Ser Arg 
15 10 15 

Gly Asp Lys Leu Lys lie Asn lie Asp Val Leu Phe Pro His Met Pro 

20 25 30 

Cys Ala Tyr Leu Ser He Asp Ala Met Asp Val Ala Gly Glu Gin Gin 

35 40 45 

Leu Asp Val Glu His Asn Leu Phe Lys Gin Arg Leu Asp Lys Asp Gly 

50 55 60 

He Pro Val Ser Ser Glu Ala Glu Arg His Xaa 
65 70 75 

<210> 214 
<211> 77 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -29. .-1 

<400> 214 

Met Ala Ala Thr Asp 

-25 

Leu Leu Leu Lys Leu 
-10 

Val Ala Phe Ser Val 
5 

Leu Leu He Ala Cys 

20 

Arg Arg Gly Arg Lys 
40 



Phe Val Gin Glu Met Arg 

-20 

Gin Arg Leu Pro Gin Ala 
-5 

He He Leu Phe Thr Ala 
10 

Ser Cys Cys Cys Thr His 
25 30 
Val Gin Val Gin Pro Thr 
45 



Ala Val Gly Glu Arg 

-15 

Glu Pro Val Glu He 
1 

Thr Val Leu Leu Leu 
15 

Cys Cys Cys Pro Glu 
35 

Pro Pro 



<210> 215 
<211> 107 



[fije>'/^:/yv'GG208389e.s^] 



WO 02/083898 PCT/IBOl/00914 
146 



<212> PRT 


















<213> Homo sapiens 
















<220> 


















<221> SIGNAL 


















<222> -27. .-1 


















<400> 215 


















Met Leu Phe Glu 


Glu 


Ala 


Leu 


Pro Leu 


Ser 


Cys 


Ser Asp Pro 


Val Leu 


-25 








-20 






-15 




Ser Thr Leu Ser 




Val 


Gin 


Phe Ser 


Pro 


Ser 


Gly Arg Thr 


Gin Asp 


-10 






-5 








1 




Leu Leu Ser Pro 


Gly Val 


Glu 


Asn Leu 


Ser 


Val 


Leu Asp Val 


Ser Pro 




10 








15 






20 


Leu Gly Leu Ala 


Cys 


Cys 


Leu 


Leu Thr 


Leu 


Thr 


Met Ser Cys 


Pro Gly 


25 








30 






35 




Pro Asp Pro Pro 


Glu Gly 


Pro 


Gly Thr Gin 


Arg 


Val Trp Gin 


Gly Ala 


40 








45 






50 




Leu Arg lie Leu 


Gin 


Leu 


Pro 


Gly Ala 


Pro Asp 


Gly Val Ser 


Pro Tyr 


55 






60 








65 




Gin Pro Val Trp 


Ser 


Arg 


Thr 


Pro Asp 


Leu 


Lys 






70 




75 








80 






<210> 216 


















<211> 112 


















<212> PRT 


















<213> Homo sapiens 
















<220> 


















<221> SIGNAL 


















<222> -39. .-1 


















<400> 216 


















Met Leu Glu His 


Leu 


Ser 


Ser 


Leu Pro 


Thr 


Gin Met Asp Tyr 


Lys Gly 




-35 








-30 






-25 


Gin Lys Leu Ala 


Glu 


Gin 


Met 


Phe Gin 


Gly 


He 


He Leu Phe 


Ser Ala 


-20 








-15 






-10 




He Val Gly Phe 


He 


Tyr 


Gly 


Tyr Val 


Ala 


Glu 


Gin Phe Gly 


Trp Thr 


-5 








1 






5 




Val Tyr He Val 


Met 


Ala 


Gly 


Phe Ala 


Phe 


Ser 


Cys Leu Leu 


Thr Leu 


10 




15 








20 






Pro Pro Trp Pro 


He 


Tyr 


Arg 


Arg His 


Pro 


Leu 


Lys Trp Leu 


Pro Val 




30 








35 






40 


Gin Ala Gin Thr 


Thr Arg 


Asn Gin Gly Lys Glu Lys Leu Arg 


Gly Met 


45 








50 






55 




Leu Lys lie lie 


Glu 


Val 


Phe 


Met He 


Gin 


His 


Leu Leu Leu 


Phe Leu 


60 








65 






70 





<210> 217 
<211> 71 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 

<222> -23. .-1 

<223> Von Heijne matrix 

score 5.918755252201 
seq SALMLLPCRPVLT/SV 



<400> 217 

Met Met Asn Asn Gly Leu Leu Gin Gin Pro Ser Ala Leu Met Leu Leu 
-20 -15 -10 
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Pro 


Cys Arg 
-5 


Pro Val 


Leu 


Thr 


Ser 
1 


Val 


Ala 


Leu 


Asn 
5 


Ala 


Asn 


Phe 


Val 


Ser 


Trp Lys 


Ser Arg 


Thr 


Lys 


Tyr 


Thr 


He 


Thr 


Pro 


Val 


Lys 


Met 




10 






15 










20 












Lys 


Ser Gly 


Gly Arg 
30 


Asp 


His 


Thr 


Gly 


Ala 
35 


Gly 


Asn 


Val 


Arg 


Ser 
40 


Asn 


Ser 


Arg Pro 


Ser He 
45 


Gin 


Arg 





















<210> 218 
<211> 71 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> SIGNAL 

<222> -23. .-1 

<223> Von Heijne matrix 

score 5.918755252201 
seq SALMLLPCRPVLT/SV 



<400> 218 




















Met 


Met Asn 


Asn Gly 
-20 


Leu Leu 


Gin 


Gin 
-15 


Pro Ser Ala 


Leu 


Met 
-10 


Leu 


Leu 


Pro 


Cys Arg 
-5 


Pro Val 


Leu Thr 


Ser 

1 


Val 


Ala Leu Asn 
5 


Ala 


Asn 


Phe 


Val 


Ser 


Trp Lys 


Ser Arg 


Thr Lys 


Tyr 


Thr 


He Thr Pro 


Val 


Lys 


Met 


Arg 


10 






15 






20 








25 


Lys 


Ser Gly 


Gly Arg 
30 


Asp His 


Thr 


Gly 


Ala Gly Asn 
35 


Val 


Arg 


Ser 
40 


Asn 


Ser 


Arg Pro 


Ser He 
45 


Gin Arg 

















<210> 219 
<211> 92 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 

<222> -15. .-1 

<223> Von Heijne matrix 

score 4.68058603039205 
seq VLAGSLLGPTSRS/AA 



<400> 219 






















Met 


Ala Val 


Leu Ala Gly Ser 


Leu 


Leu 


Gly 


Pro 


Thr 


Ser 


Arg 


Ser 


Ala 


-15 




-10 








-5 










1 


Ala 


Leu Leu 


Gly Gly Arg Trp 
5 


Leu 


Gin 
10 


Pro 


Arg 


Ala 


Trp 


Leu 
15 


Gly 


Phe 


Pro 


Asp Ala 


Trp Gly Leu Pro 


Thr 


Pro 


Gin 


Gin 


Ala 


Arg 


Gly Lys 


Ala 




20 




25 










30 








Arg 


Gly Asn 


Glu Tyr Gin Pro 


Ser 


Asn 


He 


Lys 


Arg 


Lys Asn 


Lys 


His 




35 


40 










45 










Gly 


Trp Val 


Arg Arg Leu Ser 


Thr 


Pro 


Ala 


Gly 


Xaa 


Gin 


Val 


He 


Leu 


50 




55 








60 










65 


Arg 


Arg Met 


Leu Lys Gly Arg 


Lys 


Ser 


Leu 


Ser 


His 











70 75 



<210> 220 
<211> 92 
<212> PRT 

<213> Homo sapiens 
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<220> 

<221> SIGNAL 
<222> -15. .-1 

<220> 

<221> UNSURE 

<222> 56 

<223> Xaa = Asn,Ser 

<400> 220 



Met Ala Val Leu 


Ala Gly Ser 


Leu Leu Gly 


Pro Thr Ser Arg 


Ser 


Ala 


-15 


-10 














Ala Leu Leu Gly 


Gly Arg Trp 


Leu 


Gin 


Pro 


Arg Ala Trp Leu 


Gly 


Phe 


5 






10 




15 






Pro Asp Ala Trp 


Gly Leu Pro 


Thr 


Pro 


Gin 


Gin Ala Arg Gly 


Lys 


Ala 


20 




25 






30 






Arg Gly Asn Glu 


Tyr Gin Pro 


Ser 


Asn 


lie 


Lys Arg Lys Asn 


Lys 


His 


35 


40 














Gly Trp Val Arg Arg Leu Xaa 


Thr 


Pro 


Ala 


Gly Val Gin Val 


He 


Leu 


50 


55 








60 




65 


Arg Arg Met Leu 


Lys Gly Arg 


Lys 


Ser 


Leu 


Ser His 








70 






75 








<210> 221 
















<211> 92 
















<212> PRT 
















<213> Homo sapiens 














<220> 
















<221> SIGNAL 
















<222> -15. .-1 
















<220> 
















<221> UNSURE 
















<222> 56 
















<223> Xaa = Asn, 


,Ser 














<220> 
















<221> UNSURE 
















<222> 62 
















<223> Xaa = Lys, 


, Gin, Arg 














<400> 221 
















Met Ala Val Leu 


Ala Gly Ser 


Leu 


Leu Gly 


Pro Thr Ser Arg 


Ser 


Ala 


-15 


-10 








-5 




1 


Ala Leu Leu Gly 


Gly Arg Trp 


Leu 


Gin 


Pro 


Arg Ala Trp Leu 


Gly 


Phe 


5 






10 




15 






Pro Asp Ala Trp 


Gly Leu Pro 


Thr 


Pro 


Gin 


Gin Ala Arg Gly 


Lys 


Ala 


20 




25 






30 






Arg Gly Asn Glu 


Tyr Gin Pro 


Ser 


Asn 


He 


Lys Arg Lys Asn 


Lys 


His 


35 


40 








45 






Gly Trp Val Arg 


Arg Leu Xaa 


Thr 


Pro 


Ala 


Gly Val Xaa Val 


He 


Leu 


50 


55 








60 




65 


Arg Arg Met Leu 


Lys Gly Arg 




Ser 


Leu 


Ser His 








70 






75 









<210> 222 
<211> 92 
<212> PRT 

<213> Homo sapiens 



<220> 
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<221> SIGNAL 
<222> -15. .-1 

<220> 

<221> UNSURE 
<222> 61 

<223> Xaa = * ,Glu,Leu,Val 
<220> 

<221> UNSURE 
<222> 62 

<223> Xaa = Ala, Glu, Pro, Gin 
<220> 

<221> UNSURE 
<222> 56 

<223> Xaa = Asn,Ser 

<220> 

<221> UNSURE 
<222> -6 

<223> Xaa = Asp, Gly 
<220> 

<221> UNSURE 
<222> 6 

<223> Xaa = Gly, Ser 

<220> 

<221> UNSURE 
<222> 63 

<223> Xaa = Phe,Val 
<400> 222 



Met Ala Val 


Leu 


Ala 


Gly Ser Leu Leu 


Xaa 


Pro Thr Ser 


Arg 


Ser 


Ala 


-15 






-10 




-5 






1 


Ala Leu Leu 


Gly 


Xaa 


Arg Trp Leu Gin 


Pro 


Arg Ala Trp Leu 


Gly Phe 




5 




10 






15 






Pro Asp Ala 


Trp 


Gly 


Leu Pro Thr Pro 


Gin 


Gin Ala Arg 


Gly 


Lys 


Ala 


20 






25 




30 








Arg Gly Asn 


Glu 


Tyr 


Gin Pro Ser Asn 


lie 


Lys Arg Lys 


Asn 


Lys 


His 


35 






40 




45 








Gly Trp Val 


Arg 


Arg 


Leu Xaa Thr Pro 


Ala 


Gly Xaa Xaa 


Xaa 


He 


Leu 


50 






55 




60 






65 


Arg Arg Met 


Leu 


Lys 


Gly Arg Lys Ser 




Ser His 












70 




75 










<210> 223 


















<211> 87 


















<212> PRT 


















<213> Homo sapiens 














<220> 


















<221> SIGNAL 
















<222> -39. .- 


-1 
















<400> 223 


















Met Lys Val 




Leu Leu Thr Gly Leu Gly 


Ala Leu Phe 


Phe 


Ala 


Tyr 






-35 




-30 






-25 




Tyr Trp Asp 


Asp 


Asn 


Phe Asp Pro Ala 


Ser 


Leu Gin Gly 


Ala 


Arg 


Val 




-20 




-15 






-10 






Leu Leu Thr 


Gly 


Ala 


Asn Ala Gly Val 


Gly 


Glu Glu Leu 


Ala 


Tyr 


His 



-5 15 



[fije>'/^:/yv'GG208389e.s^] 



WO 02/083898 PCI 
150 

Tyr Ala Arg Leu Gly Ser His Leu Val Leu Thr Ala His Thr Glu Ala 
10 15 20 25 

Leu Leu Gin Lys Ala Arg Trp Leu Thr Leu Val Val Ser Thr Leu Gly 

30 35 40 

Gly Arg Gly Lys Trp lie Thr 
45 

<210> 224 
<211> 125 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -15. .-1 



<220> 

<221> UNSURE 
<222> 100 

<223> Xaa = Asn,Thr 

<220> 

<221> UNSURE 
<222> 30 

<223> Xaa = He, Met 



<400> 224 
























Met 


Lys 


Leu 


Leu 


Thr 


His 


Asn 


Leu 


Leu 


Ser 


Ser 


His 


Val Arg 


Gly Val 


-15 










-10 










-5 






1 


Gly 


Ser 


Arg 


Gly 


Phe 


Pro 


Leu Arg 




Gin 


Ala 


Thr 


Glu Val 


Arg He 








5 










10 








15 




Cys 


Pro 


Val 
20 


Glu 


Phe 


Asn 


Pro 


Asn 
25 


Phe 


Val 


Ala 


Arg 


Xaa He 
30 


Pro Lys 


Val 


Glu 


Trp 


Ser 


Ala 


Phe 


Leu 


Glu 


Ala 


Ala 


Asp 


Asn Leu Arg 


Leu He 




35 










40 










45 






Gin 


Val 


Pro 


Lys 


Gly 


Pro 


Val 


Glu 


Gly 


Tyr 


Glu 


Glu 


Asn Glu 


Glu Phe 


50 










55 










60 






65 


Leu 


Arg 


Thr 


Met 


His 
70 


His 


Leu 


Leu 


Leu 


Glu 
75 


Val 


Glu 


Val He 


Glu Gly 
80 


Thr 


Leu 


Gin 


Cys 


Pro 


Glu 


Ser 


Gly 


Arg Met 


Phe 


Pro 


He Ser 


Arg Gly 








85 










90 








95 




He 


Pro 


Xaa 
100 


Met 


Leu 


Leu 


Ser 


Glu 
105 


Glu 


Glu 


Thr 


Glu 


Ser 
110 





<210> 225 
<211> 93 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -22. .-1 



<400> 225 

Met Lys He Ser Val Ala Ala He 
-20 -15 
Ala Leu Gly Thr Lys Thr Glu Ser 

-5 1 
Ser Glu Cys Cys Phe Thr Tyr Thr 
15 

He Met Asp Tyr Tyr Glu Thr Asn 
30 

Val Phe He Thr Lys Arg Gly His 



Pro Phe Phe Leu Leu He Thr He 
-10 

Ser Ser Arg Gly Pro Tyr His Pro 

5 10 
Thr Tyr Lys He Pro Arg Gin Arg 

20 25 
Ser Gin Cys Ser Lys Pro Gly He 
35 40 
Ser Val Cys Thr Asn Pro Ser Asp 
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Lys Trp Val Gin Asp Tyr lie Lys Asp Met Lys Glu Asn 



<210> 226 
<211> 72 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -42. .-1 



<400> 225 

Met Asn Arg Leu Ala Gly Val Gly 
-40 -35 
Ser Ser Leu Leu Gin Ser Val Glu 

-25 -20 
Glu Val Ala Ala Ala Pro Gly Thr 
-10 -5 
Leu Ser Ala Asp Lys Phe Gin Val 
10 

Gin Thr Leu Met Ser Ser Leu Gly 
25 30 



Trp Arg Val Asp Tyr Thr Leu Ser 
-30 

Glu Pro Met Val His Leu Arg Leu 

-15 

Pro Ala Gin Pro Val Ala Met Ser 

1 5 
Leu Leu Ala Glu Leu Lys Gin Ala 
15 20 



<210> 227 
<211> 84 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> SIGNAL 

<222> -20. .-1 

<223> Von Heijne matrix 

score 4.37180298395146 
seq SLLLSLPPHQGLT/FS 



<400> 227 

Met Ala Ala Ala Ala Val Pro Ser Leu Leu Leu Ser Leu Pro Pro His 
-20 -15 -10 -5 

Gin Gly Leu Thr Phe Ser Asn Lys lie Gin Pro Phe Gly Ala Gin Gly 

15 10 
Val Leu His Pro Glu Pro Gly Leu Arg Asp Trp Leu Leu Pro Thr Cys 

15 20 25 

Ser Arg Gin Leu Arg Val Ala Leu Pro Glu Lys Gly Ser Glu Gly Ser 

30 35 40 

Leu Cys Gin Thr Gin Leu Pro Ala Thr Pro Cys Phe Leu Pro Ser Asn 
45 50 55 60 

Thr Val Arg Thr 

<210> 228 
<211> 54 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -19. .-1 



<400> 228 

Met Lys Phe Asn Phe Val Leu Leu lie Leu Ser Lys Val Leu Asp Asp 

-15 -10 -5 

Thr Phe Gin Ser Val Lys Lys Trp Leu Asn Tyr Phe Gin Phe Thr Leu 
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15 10 
Arg Asn Gly Leu Met Trp Pro Gly Ala Val Ala His Ala Cys Asn Pro 

15 20 25 

Ser Thr Gly Ser Arg Leu 
30 35 



<210> 229 
<211> 54 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> SIGNAL 

<222> -19. .-1 

<223> Von Heijne matrix 

score 4.24406450728052 
seq LLILSKVLDDTFQ/SV 



<400> 229 

Met Lys Phe Asn Phe Val Leu Leu 
-15 

Thr Phe Gin Ser Val Lys Lys Trp 

1 5 
Arg Asn Gly Leu Met Trp Pro Gly 

15 20 
Ser Thr Gly Ser Arg Leu 
30 35 



lie Leu Ser Lys Val Leu Asp Asp 

-10 -5 
Leu Asn Tyr Phe Gin Phe Thr Leu 
10 

Ala Val Ala His Ala Cys Asn Pro 
25 



<210> 230 
<211> 54 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> SIGNAL 

<222> -19. .-1 

<223> Von Heijne matrix 

score 4.24406450728052 
seq LLILSKVLDDTFQ/SV 



<400> 230 

Met Lys Phe Asn Phe Val Leu Leu lie Leu Ser Lys Val Leu Asp Asp 

-15 -10 -5 

Thr Phe Gin Ser Val Lys Lys Trp Leu Asn Tyr Phe Gin Phe Thr Leu 

15 10 
Arg Asn Gly Leu Met Trp Pro Gly Ala Val Ala His Ala Cys Asn Pro 

15 20 25 

Ser Thr Gly Ser Arg Leu 
30 35 

<210> 231 
<211> 91 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -19. .-1 



<400> 231 

Met Tyr Gly Lys lie lie Phe Val Leu Leu Leu Ser Glu lie Val Ser 

-15 -10 -5 

lie Ser Ala Leu Ser Thr Thr Glu Val Ala Met His Thr Ser Thr Ser 
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1 5 
Ser Ser Val Thr Lys Ser Tyr lie Ser 

15 20 
Gly Gin Leu Val His Arg Phe Thr Val 
30 35 
Leu lie He Leu Cys Val Met Ala Gly 
50 

He Ser Tyr Ser He Arg Arg Leu He 
65 70 
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10 

Ser Gin Thr Asn Gly Glu Met 
25 

Pro Ala Pro Val Val He He 

40 45 
He He Gly Thr He Leu Leu 
55 60 
Lys Ala 



<210> 232 
<211> 114 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -32. .-1 



<400> 232 












Met 


Gin 


Met 


Tyr 


Ser 


Arg 


Gin 








-30 










-25 


Gin 


Gly 


Gly 


Leu 


Leu 


Gly 


Ser Gly 




-15 










-10 




Phe 


Asn 


Asn 


Leu 


Glu 


Asn 


Leu 


Val 


1 








5 








He 


Phe 


Pro 


Glu 


He 


Leu 


Leu 


Cys 








20 










Gly 


Leu 


He 


His 


Arg 


Val 


Cys 


Val 






35 










40 


Val 


Gly 


Leu 


Tyr 


Tyr 


He 


Asn 


Lys 




50 










55 




Ala 


Ala 


Pro 


Val 


Leu 


Thr 


Pro 


Ala 



65 70 

Arg Asn 



Ala 


Ser 


Thr 


Glu 


Trp 


Leu 


Thr 


He 










-20 








Leu 


Phe 


Val 


Phe 
-5 


Ser 


Leu 


Thr 


Ala 


Phe 


Gly Lys 


Gly 


Phe 


Gin 


Ala 


Lys 




10 










15 




Leu 


Leu 


Leu 


Ala 


Leu 


Phe 


Ala 


Ser 


25 










30 






Thr 


Thr 


Cys 


Phe 


He 


Phe 


Ser 


Met 










45 








He 


Ser 


Ser 


Thr 


Leu 


Tyr 


Gin 


Ala 








60 










Lys 


Val 


Thr 


Gly 


Lys 


Ser 


Lys 


Lys 



<210> 233 
<211> 75 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> SIGNAL 
<222> -35. .-1 



<400> 233 



Met 


Ala 


Pro Glu 


Val 


Leu 


Pro 


Lys 


Pro 


Arg 


Met 


Arg Gly Leu Leu Ala 


-35 








-30 










-25 


-20 


Arg 






Asn 
-15 


His 


Met 


Ala 


Val 


Ala 
-10 


Phe 


Val Leu Ser Leu Gly 
-5 


Val 


Ala 


Ala Leu 

1 


Tyr 


Lys 


Phe 


Arg 
5 


Val 


Ala 


Asp 


Gin Arg Lys Asn Ala 
10 


Tyr 


Ala 
15 


Asp Phe 


Tyr 


Arg 


Asn 
20 


Tyr 


Asp 


Val 


Met 


Lys Asp Phe Glu Glu 
25 


Met 


Arg 


Lys Ala 


Gly 


He 


Phe 


Gin 


Ser 


Val 


Lys 




30 








35 










40 





<210> 234 
<211> 108 
<212> PRT 

<213> Homo sapiens 



<220> 
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<221> SIGNAL 
<222> -40. .-1 



<400> 234 



Met Gin Lys Phe 


Arg 


Lys 


Met 


Ser 


Glu 


Thr 


His His 


Ser Val He Ser 


-40 




-35 










-30 


-25 


Val Lys Ala Ser 


Ser 


Pro 


Trp 


Leu 


Ser 


Ser 


Ser Val 


Thr Ala Pro Ser 




-20 










-15 




-10 


Met Val Ala Pro 


Val 


Thr 


Phe 


Ala 


Ser 


He 


Val Glu 


Glu Glu Leu Gin 


-5 










1 






5 


Gin Glu Ala Ala 


Leu 


He 


Arg 


Ser 


Arg Glu 


Lys Pro 


Leu Ala Leu He 


10 






15 








20 




Gin He Glu Glu 


His 


Ala 


He 


Gin 


Asp 




Leu Val 


Phe Tyr Glu Ala 


25 




30 










35 


40 


Phe Gly Asn Pro 


Glu 


Glu 


Phe 


Val 


He 


Val 


Glu Arg 


Thr Pro Gin Gly 




45 










50 




55 


Pro Leu Ala Val 


Pro 


Met 


Trp 


Asn 


Lys 


His 


Gly Cys 




60 










65 








<210> 235 


















<211> 127 


















<212> PRT 


















<213> Homo sapiens 
















<220> 


















<221> SIGNAL 


















<222> -47. .-1 


















<400> 235 


















Met Phe Thr Pro 


Leu 


Thr 


Val 


Lys 


Tyr Ala 


Tyr Tyr 


Asp Thr Glu Arg 


-45 








-40 








-35 


He Gly Val Asp 


Leu 


He 


Met 


Lys 


Thr Cys 


Phe Ser 


Pro Asn Arg Val 


-30 






-25 








-20 




He Gly Leu Ser 


Ser 


Asp 


Leu 


Gin 


Gin 


Val 


Gly Gly 


Ala Ser Ala Arg 


-15 




-10 










-5 


1 


He Gin Asp Ala 


Leu 


Ser 


Thr 


Val 


Leu 


Gin 


Tyr Ala 


Glu Asp Val Leu 


5 










10 






15 


Ser Gly Lys Val 


Ser 


Ala 


Asp Asn 


Thr 


Val 


Gly Arg 


Phe Leu Met Ser 


20 








25 








30 


Leu Val Asn Gin 


Val 


Pro 


Lys 


He 


Val 


Pro Asp Asp 


Phe Glu Thr Met 


35 






40 








45 




Leu Asn Ser Asn 


He 


Asn 


Asp 




Leu 


Met 


Val Thr 


Tyr Leu Ala Asn 


50 




55 










60 


65 


Leu Thr Gin Ser 


Gin 


He 


Ala 


Leu 


Asn 


Glu Lys Leu 


Val Asn Leu 




70 










75 




80 


<210> 236 


















<211> 85 


















<212> PRT 


















<213> Homo sapiens 
















<220> 


















<221> SIGNAL 


















<222> -32. .-1 


















<400> 236 


















Met Lys Ala Ala 


Leu 


Glu 


Asp 


Thr 


Leu 


Ala 


Glu Thr 


Glu Ala Arg Phe 


-30 








-25 








-20 


Gly Ala Gin Leu 


Ala 


His 


He 


Gin 


Ala 


Leu 


He Ser 


Gly He Glu Ala 


-15 






-10 








-5 




Gin Leu Gly Asp 


Val 


Arg 


Ala 


Asp 


Ser 


Glu 


Arg Gin 


Asn Gin Glu Tyr 


1 


5 










10 




15 


Gin Arg Leu Met 


Asp 


He 


Lys 


Ser 


Arg 




Glu Gin 


Glu He Ala Thr 
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20 25 30 

Tyr Arg Ser Leu Leu Glu Gly Gin Glu Asp His Tyr Asn Asn Leu Ser 

35 40 45 

Ala Ser Lys Val Leu 
50 

<210> 237 
<211> 101 
<212> PRT 

<213> Homo sapiens 

<220> 

<221> SIGNAL 
<222> -25. .-1 



<400> 237 



Met 


Val 


Asp 


Arg 


Glu 


Leu 


Ala Asp 


He 


His Glu Asp Ala 


Lys Thr 


Cys 


-25 










-20 






-15 




-10 


Leu 


Val 


Leu 


Cys 


Ser 
-5 


Arg 


Val Leu 


Ser 


Val He Ser Val 
1 


Lys Glu 
5 


He 


Lys 


Thr 


Gin 




Ser 


Leu 


Gly Arg 


His 


Pro He He Ser 


Asn Trp 


Phe 






10 








15 




20 






Asp 


Tyr 


He 


Pro 


Ser 


Thr 


Arg Tyr 


Lys 


Asp Pro Cys Glu 


Leu Leu 


His 




25 










30 




35 






Leu 


Cys 


Arg 


Leu 


Thr 


He Arg Asn Gin Leu Leu Thr Asn Asn Met 


Leu 


40 










45 






50 




55 


Pro 


Asp 


Gly 


He 


Phe 


Ser 


Leu Leu 


He 


Pro Ala Arg Leu 


Gin Asn 


Tyr 










60 








65 


70 




Leu 


Asn 


Leu 


Glu 


He 















75 

<210> 238 
<211> 68 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -47. .-1 

<400> 238 



Met 


Val 


Lys 
-45 


Ser 


Ser 


Leu 


Gin 


Arg 
-40 


He 


Leu Asn Ser His 
-35 


Cys 


Phe 


Ala 


Arg 


Glu 
-30 


Lys 


Glu 


Gly 


Asp 


Lys 
-25 


Pro 


Ser 


Ala Thr He His 
-20 


Ala 


Ser 


Arg 


Thr 


Met 


Pro 


Leu 




Ser 


Leu 


His 


Ser 


Arg Gly Gly Ser 


Ser 


Ser 


Glu 


-15 










-10 








-5 






1 


Ser 


Ser 


Arg 


Val 
5 


Ser 


Leu 


His 


Cys 


Cys 
10 


Ser Asn Pro Gly 


Pro 
15 


Gly 


Pro 


Arg 


Trp 


Cys 
20 


Ser 





















<210> 239 

<211> 68 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SIGNAL 
<222> -47. .-1 



<400> 239 

Met Val Lys Ser Ser Leu Gin Arg He Leu Asn Ser His Cys Phe Ala 
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-45 -40 
Arg Glu Lys Glu Gly Asp Lys Pro Ser 

-30 -25 
Thr Met Pro Leu Leu Ser Leu His Ser 
-15 -10 
Ser Ser Arg Val Ser Leu His Cys Cys 

5 10 
Arg Trp Cys Ser 
20 
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-35 

Ala Thr He His Ala Ser Arg 
-20 

Arg Gly Gly Ser Ser Ser Glu 

-5 1 
Ser Asn Pro Gly Pro Gly Pro 
15 



<210> 240 
<211> 68 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> SIGNAL 

<222> -47. .-1 

<223> Von Heijne matrix 

score 3.72491712201476 
seq PLLSLHSRGGSSS/ES 



<400> 240 

Met Val Lys Ser Ser Leu Gin Arg 
-45 -40 
Arg Glu Lys Glu Gly Asp Lys Pro 

-30 -25 
Thr Met Pro Leu Leu Ser Leu His 
-15 -10 
Ser Ser Arg Val Ser Leu His Cys 
5 

Arg Trp Cys Ser 
20 



He Leu Asn Ser His Cys Phe Ala 
-35 

Ser Ala Thr He His Ala Ser Arg 
-20 

Ser Arg Gly Gly Ser Ser Ser Glu 

-5 1 
Cys Ser Asn Pro Gly Pro Gly Pro 
10 15 



<210> 241 

<211> 68 

<212> PRT 

<213> Homo sapiens 



<220> 

<221> SIGNAL 

<222> -47. .-1 

<223> Von Heijne matrix 

score 3.72491712201476 
seq PLLSLHSRGGSSS/ES 



<220> 

<221> unsure 
<222> 12 

<223> Xaa = Lys, Asn 



<400> 241 

Met Val Lys Ser Ser Leu Gin Arg 
-45 -40 
Arg Glu Lys Glu Gly Asp Lys Pro 

-30 -25 
Thr Met Pro Leu Leu Ser Leu His 
-15 -10 
Ser Ser Arg Val Ser Leu Xaa Xaa 
5 

Arg Trp Cys Ser 
20 



He Leu Asn Ser His Cys Phe Ala 
-35 

Ser Ala Thr He His Ala Ser Arg 
-20 

Ser Arg Gly Gly Ser Ser Ser Glu 

-5 1 
Cys Ser Xaa Pro Gly Pro Gly Pro 
10 15 
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<210> 242 
<211> 129 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -16. .-1 
<223> Von Heijne matrix 

score 3.61791418904325 

seq LLLLLRALRRGPG/PG 

<220> 

<221> unsure 
<222> 92 

<223> Xaa = His, Gin 
<220> 

<221> unsure 
<222> 84 

<223> Xaa = Phe,Tyr 



<400> 242 



Met Ala 


Val Leu 


Leu 


Leu Leu 


Leu 


Arg Ala 


Leu 


Arg 


Arg 


Gly 


Pro Gly 


-15 






-10 








-5 








Pro Gly 


Pro Arg 


Pro 


Leu Trp 


Gly 


Pro Gly 


Pro 


Ala 


Trp 


Ser 


Pro Gly 


1 




5 






10 










15 


Phe Pro 


Ala Arg 


Pro 


Gly Arg 


Gly 


Arg Pro 


Tyr Met 


Ala 


Ser 


Arg Pro 




20 








25 








30 




Pro Gly 




Ala 


Glu Ala 


Gly 


Gly Arg 


Ala 


Leu 


Gin 


Ser 


Leu Gin 




35 






40 








45 






Leu Arg 


Leu Leu 


Thr 


Pro Thr 


Phe 


Glu Gly 


lie 


Asn 


Gly Leu 


Leu Leu 


50 






55 








60 








Lys Gin 


His Leu 


Val 


Gin Asn 


Pro 




Leu 


Trp 


Gin 


Leu 


Leu Gly 


65 






70 






75 








80 


Gly Thr 


Phe Xaa 


Phe 


Asn Thr 


Ser 


Arg Leu 


Lys 


Xaa 


Lys 


Asn 


Lys Glu 






85 






90 










95 


Lys Asp 


Lys Ser 


Lys 


Gly Lys 


Ala 


Pro Glu 


Glu 


Asp Glu 


Gly 


He Phe 




100 








105 








110 




lie 






















<210> 243 




















<211> 129 




















<212> PRT 




















<213> Homo sapie 




















<220> 






















<221> SIGNAL 




















<222> -: 


16. .-1 




















<223> Von Heijne matrix 
















score 3.61791418904325 














seq LLLLLRALRRGPG/PG 
















<400> 243 




















Met Ala 


Val Leu 


Leu 


Leu Leu 




Arg Ala 




Arg Arg 


Gly 


Pro Gly 


-15 






-10 








-5 








Pro Gly 


Pro Arg 


Pro Leu Trp Gly 


Pro Gly 


Pro 


Ala 


Trp 


Ser 


Pro Gly 


1 




5 






10 










15 


Phe Pro Ala Arg 


Pro Gly Arg 


Gly 


Arg Pro 


Tyr 


Met 


Ala 


Ser 


Arg Pro 




20 








25 








30 




Pro Gly Asp Leu 


Ala 


Glu Ala 


Gly 


Gly Arg 


Ala 


Leu 


Gin 


Ser 


Leu Gin 




35 






40 








45 






Leu Arg 


Leu Leu 


Thr 


Pro Thr 


Phe 


Glu Gly 


He 


Asn 


Gly 




Leu Leu 
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50 55 
Lys Gin His Leu Val Gin Asn Pro Val 
65 70 
Gly Thr Phe Tyr Phe Asn Thr Ser Arg 
85 

Lys Asp Lys Ser Lys Gly Lys Ala Pro 
100 105 

lie 
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60 

Arg Leu Trp Gin Leu Leu Gly 

75 80 
Leu Lys Gin Lys Asn Lys Glu 
90 95 
Glu Glu Asp Glu Gly lie Phe 
110 



<210> 244 
<211> 109 
<212> PRT 
<213> Homo sapiens 



<220> 

<221> SIGNAL 
<222> -33. .-1 



<220> 

<221> UNSURE 
<222> 43 

<223> Xaa = Asp, Gly, Asn, Ser 



<400> 244 
















Met 


Ala 


Ala Ser Gly 


Ala 


Pro Arg lie Leu 


Val Asp 


Leu 


Leu 


Lys 


Leu 






-30 




-25 






-20 






Asn 


Val 


Ala Pro Leu 


Ala 


Val Phe Gin Met 


Leu Lys 


Ser 


Met 


Cys Ala 






-15 




-10 




-5 








Gly 


Gin 


Arg Leu Ala 


Ser 


Glu Pro Gin Asp 


Pro Ala 


Ala 


Val 


Ser 


Leu 




1 




5 




10 








15 


Pro 


Thr 


Ser Ser Val 


Pro 


Glu Thr Arg Gly 


Arg Asn 


Lys 


Gly 


Ser 


Ala 






20 




25 








30 




Ala 


Leu 


Gly Gly Ala 


Leu 


Ala Leu Ala Glu 


Arg Xaa 


Ser Arg 


Glu 


Gly 






35 




40 






45 






Ser 


Ser 


Gin Arg Met 


Pro Arg Gin Pro Ser 


Ala Thr Arg 


Leu 


Pro 








50 




55 




60 








Gly 


Gly 


Gly Pro Gly 


Lys 


Ser Pro Thr Arg 


Gly Ser 


Thr 










65 






70 


75 











<210> 245 
<211> 103 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -36. .-1 



<400> 245 



Met 


Ala 
-35 


Asn 


Leu 


Phe 


He 


Arg Lys Met 
-30 


Val 


Asn 


Pro 
-25 


Leu 


Leu 


Tyr 


Leu 


Arg 


Arg 


His 


Thr 


Val 


Lys 


Pro Arg Ala 


Leu 


Ser 


Thr 


Phe 




Phe 


Gly 


-20 










-15 






-10 










-5 


Ser 


He 


Arg 


Gly 


Ala 


Ala 


Pro Val Ala 


Val 


Glu 


Pro 


Gly Ala 


Ala 


Val 










1 




5 










10 






Arg 


Ser 


Leu 


Leu 


Ser 


Pro 


Gly Leu Leu 


Pro 


His 


Leu 


Leu 


Pro 


Ala 


Leu 




15 








20 








25 








Gly 


Phe 


Lys 


Asn 




Thr 


Val Leu Lys 


Lys 


Arg Cys 


Lys 


Asp 


Cys 


Tyr 




30 










35 






40 










Leu 


Val 


Lys 


Arg 


Arg 


Gly 


Arg Trp Tyr Val 


Tyr 


Cys 


Lys 


Thr 


His 


Pro 


45 










50 






55 










60 


Arg 


His 




Gin 


Arg 
65 


Gin 


Met 
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<210> 246 
<211> 86 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -36. .-1 

<400> 246 



Met Gly Pro Ala Leu 


Arg 


Ser 


Leu Gin 


Val 


Lys 


Lys 


Gly 


Thr 


Glu His 


-35 




-30 








-25 








Ala Asp Pro Leu Pro 


Phe 


Pro 


Ser Val 


Ser 


Leu 


Ser 


Gly 


Phe 


Thr Val 


-20 


-15 








-10 








-5 


Gly Thr Leu Ser Glu 


Thr 


Ser 


Thr Gly 


Gly 


Pro 


Ala 


Thr 


Pro 


Thr Trp 


1 






5 










10 




Lys Glu Cys Pro lie 


Cys 


Lys 


Glu Arg 


Phe 


Pro 


Ala 


Glu 


Ser 


Asp Lys 


15 






20 








25 






Asp Ala Leu Glu Asp His 


Met Asp Gly His 


Phe 


Phe 


Phe 


Ser 


Thr Gin 


30 




35 








40 








Asp Pro Phe Thr Phe 


Glu 


















45 


50 


















<210> 247 




















<211> 52 




















<212> PRT 




















<213> Homo sapiens 




















<220> 




















<221> SIGNAL 




















<222> -30. .-1 




















<400> 247 




















Met Ser Ser Cys Gly 


He 


Val 


Gly Ser 


Ser 


Val 


Ser 


Phe 


Gin 


Leu Asp 


-30 


-25 








-20 








-15 


Ala Val Lys Leu Leu 


Leu 


Lys 


Met Val 


Ser 


Ser 


Ala 


Thr 


Thr 


Glu Arg 


-10 








-5 










1 


Cys Cys Asn Gly Ser 


Ala 


Asn 


Phe His 


Lys Asn Leu Cys 


Ala 


Thr Gly 


5 






10 








15 






lie Lys Asn Phe 





















20 



<210> 248 
<211> 219 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> SIGNAL 

<222> -40. .-1 

<223> Von Heijne matrix 

score 6.4143157541603 
seq ITMSATWDAVNA/AP 

<220> 

<221> unsure 

<222> 13 

<223> Xaa = Glu, Gin 
<220> 

<221> unsure 
<222> -39 
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<223> Xaa = Ile,Asn 



<400> 248 



Pro 


Xaa 






He Thr Arg Leu 


Ala 


-40 








-35 




Gly 


Ala 


Ser Cys 


Gly Gly Ala Gly 


Gly 










-20 




Thr 


Val 


Val 


Asp 


Ala Val Asn Ala 


Ala 








-5 




1 


Met 


Ser 


Leu 


Glu 


Xaa Pro Lys Lys 


Met 




10 






15 




Lys 


Lys 


Glu 


Leu 


Glu Glu Gin Arg 


Lys 


25 








30 




Val 


Asp 


Glu 


Glu 


Gly Lys Asp He 


Asn 










45 




Ser 


Ser 


Val 


Pro 


Trp Tyr He Asp 


Pro 








60 




65 


His 


Gin 


Arg 


Pro 


Gin Pro Glu Lys 


Gin 






75 




80 




Glu 


Trp 


Tyr 


Lys 


Arg Gly Val Lys 


Glu 




90 






95 




Arg 


Lys 


Gly 


Ala 


Cys Glu Asn Cys 


Gly 


105 








110 




Asp 


Cys 


Phe 


Glu 


Arg Pro Arg Arg 


Val 










125 




Asn 


He 


Ala 


Pro 


Asp Glu His Val 


Gin 








140 




145 


Asp 


Gly 


Lys 


Arg 


Asp Arg Trp Asn 


Gly 






155 




160 




Lys 


He 


Val 


Glu 


Glu Tyr Ala Lys 


Val 




170 






175 
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Trp 


He 


Leu 


Ser Asp Gly 


Leu 




-30 








-25 


Gly 


Gin 


He 


Thr 


Met Ser 


Ala 


-15 








-10 




Pro 


Leu 


Ser 


Gly 
5 


Ser Lys 


Glu 


Thr 


Arg Glu 


Asp 


Trp Arg 








20 








Leu 


Gly 


Asn 


Ala 


Pro Ala 


Glu 




35 








40 


Pro 


His 


He 


Pro 


Gin Tyr 


He 


50 








55 




Ser 


Lys 


Arg 


Pro 


Thr Leu Lys 










70 




Lys 


Gin 


Phe 


Ser 


Ser Ser Gly 








85 






Asn 


Ser 


He 


He 


Thr Lys 


Tyr 






100 








Ala 


Met 


Thr 


His 




Lys 




115 








120 


Gly 


Ala 


Lys 


Phe 


Thr Gly 


Thr 


130 








135 




Pro 


Gin 


Leu 


Met 


Phe Asp 


Tyr 










150 




Tyr 


Asn 


Pro 


Glu 


Glu His 


Met 



165 



Asp Leu 



<210> 249 
<211> 215 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> SIGNAL 

<222> -36. .-1 

<223> Von Heijne matrix 

score 5.52542399647426 
seq ITMSATVXDAVNA/AP 



<400> 249 



He 


Thr 


Arg 


Leu 


Ala 


Trp 


He 




Xaa Asp 


Gly Leu 


Gly Ala 


Ser 


Cys 




-35 










-30 








-25 










Gly 


Gly 


Ala 


Gly 


Gly 


Gly 


Gin 


He 


Thr Met 


Ser 


Ala 


Thr 


Val 


Xaa 


Asp 


-20 










-15 








-10 










-5 


Ala 


Val 


Asn 


Ala 


Ala 
1 


Pro 


Leu 


Ser 


Gly Ser 
5 




Glu 


Met 


Ser 
10 




Glu 


Glu 


Pro 


Lys 


Lys 


Met 


Thr 


Arg 


Glu 


Asp Trp 


Arg Lys 


Lys 


Lys 


Glu 


Leu 






15 










20 








25 








Glu 


Glu 


Gin 


Arg 


Lys 


Leu 


Gly 


Asn 


Ala Pro 


Ala 


Glu 


Val Asp Glu 


Glu 




30 










35 








40 










Gly 


Lys 




He 


Asn 


Pro 


His 


He 


Pro Gin 


Tyr 


He 


Ser 


Ser 


Val 


Pro 


45 










50 








55 










60 


Trp 


Tyr 


He 


Asp 


Pro 

65 


Ser 


Lys 


Arg 


Pro Thr 
70 


Leu 


Lys 


His 


Gin 


Arg 
75 


Pro 


Gin 


Pro 


Glu 




Gin 


Lys 


Gin 


Phe 


Ser Ser 


Ser 


Gly 


Glu 


Trp Tyr 


Lys 








80 










85 








90 






Arg 


Gly 


Val 




Glu 


Asn 


Ser 


He 


He Thr 




Tyr 






Gly 


Ala 



95 100 105 
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Cys Glu 


Asn 


Cys 


Gly 


Ala 


Met 


Thr 


His 


110 










115 






Arg Pro 


Arg 


Arg 


Val 


Gly 


Ala 


Lys 


Phe 


125 








130 








Asp Glu 


His 


Val 


Gin 


Pro 


Gin 


Leu 


Met 








145 










Asp Arg 


Trp 


Asn 


Gly 


Tyr 


Asn 


Pro 


Glu 






160 










165 


Glu Tyr 


Ala 


Lys 


Val 


Asp 
sp 


Leu 








175 














<210> 250 














<211> 214 














<212> PRT 














<213> Homo sapiens 










<220> 
















<221> SIGNAL 












<222> -36. .- 


■1 












<220> 
















<221> UNSURE 












<222> 120 














<223> Xaa = 


Glu, Lys 










<400> 250 














lie Thr 


Arg 


Leu 


Ala 


Trp 


lie 


Leu 


Ser 


-35 










-30 






Gly Gly 


Ala 


Gly 


Gly 


Gly 


Gin 


lie 


Thr 


-20 








-15 








Ala Val 


Asn 


Ala 


Ala 


Pro 


Leu 


Ser 


Gly 








1 








5 


Glu Pro 


Lys 


Lys 


Met 


Thr 


Arg 


Glu 


Asp 




15 










20 




Glu Glu 


Gin 


Arg 


Lys 


Leu 


Gly 


Asn 


Ala 


30 










35 






Gly Lys 




lie 




Pro 


His 


lie 


Pro 


45 








50 








Trp Tyr 


lie 


Asp 


Pro 


Ser 


Lys 


Arg 


Pro 








65 










Gin Pro 


Glu 


Lys 


Gin 


Lys 


Gin 


Phe 


Ser 






80 










85 


Arg Gly 


Val 


Lys 


Glu 


Asn 


Ser 


He 


He 




95 










100 




Cys Glu 


Asn 


Cys 


Gly 


Ala 


Met 


Thr 


His 


110 










115 






Arg Pro 


Arg 


Arg 


Val 


Gly 


Ala 


Lys 


Phe 


125 








130 








Asp Glu 


His 


Val 


Gin 


Pro 


Gin 


Leu 


Met 








145 










Asp Arg 


Trp 


Asn 


Gly 


Tyr 


Asn 


Pro 


Glu 






160 










165 


Glu Tyr 


Ala 


Lys 


Val 


Asp 










175 















<210> 251 
<211> 147 

<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -15. .-1 
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Lys 




Lys 


Asp 


Cys 


Phe 


Glu 






120 










Thr 


Gly 


Thr 


Asn 


He 


Ala 


Pro 




135 










140 


Phe 




Tyr 




Gly 


Lys 


Arg 


150 










155 




Glu 


His 


Met 




He 


Val 


Glu 



170 





Gly 


Leu Gly 


Ala 


Ser Cys 






-25 








Met 


Ser 


Ala Thr 


Val 


Val 


Asp 




-10 








-5 


Ser 


Lys 


Glu Met 


Ser 


Leu 


Glu 








10 






Trp 


Arg 


Lys Lys 


Lys 


Glu 


Leu 






25 








Pro 


Ala 


Glu Val 


Asp 


Glu 


Glu 






40 








Gin 


Tyr 


He Ser 


Ser 


Val 


Pro 




55 








60 


Thr 


Leu 


Lys His 


Gin Arg 


Pro 


70 








75 




Ser 


Ser 


Gly Glu 


Trp Tyr 


Lys 








90 






Thr 




Tyr Arg 




Gly 


Ala 






105 








Lys 


Lys 


Xaa Asp 


Cys 


Phe 


Glu 






120 








Thr 


Gly 


Thr Asn 


He 


Ala 


Pro 




135 








140 


Phe 


Asp 


Tyr Asp 


Gly Lys 


Arg 


150 








155 




Glu 


His 


Met Lys 


He 


Val 


Glu 



170 
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<220> 

<221> UNSURE 
<222> -9 

<223> Xaa = Glu,Lys 
<400> 251 



Met 


Asp 


Val 


Leu 


Val Ser 


Xaa 


Cys 


Ser Ala Arg 


Leu 


Leu 


Gin 


Gin Glu 


-15 








-10 






-5 








1 


Glu 


Glu 


lie 


Lys 


Ser Leu 


Thr 


Ala 


Glu He Asp 


Arg 


Leu 


Lys 


Asn Cys 








5 








10 






15 




Gly 


Cys 


Leu 


Gly 


Ala Ser 


Pro 


Asn 


Leu Glu Gin 


Leu 


Gin 


Glu 


Glu Asn 




















30 






Leu 


Lys 


Leu 


Lys 


Tyr Arg 


Leu 


Asn 


He Leu Arg 


Lys 


Ser 


Leu 


Gin Ala 




35 








40 






45 








Glu 


Arg 


Asn 


Lys 


Pro Thr 


Lys 


Asn 


Met He Asn 


He 


He 


Ser 


Arg Leu 


50 








55 






60 








65 


Gin 


Glu 


Val 


Phe 


Gly His 


Ala 


He 


Lys Ala Ala 


Tyr 


Pro Asp Leu Glu 










70 






75 








80 


Asn 


Pro 


Pro 


Leu 


Leu Val 


Thr 


Pro 


Ser Gin Gin 


Ala 


Lys 


Phe 


Gly Asp 








85 








90 






95 




Tyr 


Gin 


Cys 


Asn 


Ser Ala 


Met 


Gly 


He Ser Gin 


Val 


Met 


Tyr 


Cys His 






100 








105 






110 






Asp 


Ser 


Trp 


Leu 


Phe Asp 


Phe 


Phe 


Lys Tyr Tyr 


Tyr 


His 


His 


Cys His 




115 








120 






125 








Leu 


Gin 


Lys 





















130 

<210> 252 
<211> 117 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 

<222> -43. .-1 

<223> Von Heijne matrix 

score 4.10872570449087 
seq LGQVAAAAAAANA/AS 



<400> 252 

Leu Ser Pro Ala Gin Ala Ser Gly 

-40 

Ser Gly Trp Ser Glu Arg Gly Ser 
-25 -20 
Gin Val Ala Ala Ala Ala Ala Ala 

-10 -5 
Ser Ala Asn Ser Leu Gly Ser Gly 
10 

Thr Ser Arg Pro Ser Gin Ser Arg 
25 

Phe Pro Pro Ser Pro Leu Pro Pro 

40 45 
Cys Tyr Cys Phe Leu Leu He Asn 

55 60 
Lys Lys Lys Lys Lys 
70 



He Trp Cys Arg Asp Ser Ala Phe 
-35 -30 
Ser Ser Arg Arg Gly Cys Leu Gly 
-15 

Ala Asn Ala Ala Ser Pro Pro Pro 

1 5 
Gly Arg Cys Lys Leu Arg Ala Pro 

15 20 
Pro Arg Ser Leu Pro Gin Ala Arg 
30 35 
Ser Pro Ala Gly Thr Ser Cys Thr 
50 

Ala Asp Leu He Lys Trp Ser Pro 
65 



<210> 253 
<211> 88 
<212> PRT 

<213> Homo sapiens 
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<220> 






















<221> SIGNAL 






















<222> -36. .-1 






















<400> 253 






















Met Ala Ser Val Val 


Pro 


Val 


Lys 


Asp 




Lys Leu 


Leu 


Glu 


Val 


Lys 


-35 




-30 








-25 










Leu Gly Glu Leu Pro 


Ser Trp 


He 


Leu 


Met Arg Asp 


Phe 


Ser 


Pro 


Ser 


-20 


-15 










-10 








-5 


Gly He Phe Gly Ala 


Phe 


Gin 


Arg 


Gly 


Tyr 


Tyr Arg Tyr 


Tyr Asn 


Lys 


1 








5 








10 






Tyr He Asn Val Lys 


Lys 
ys 


Gl 


Ser 


He 


Ser 


Gly He 


Thr 


Met 


Val 


Leu 


15 






20 








25 








Ala Cys Tyr Val Leu 


Phe 


Ser 


Tyr 


Ser 


Phe 


Ser Tyr 


Lys 


His 


Leu 


Lys 


30 




35 


















His Glu Arg Leu Arg 


Lys 


Tyr 


His 
















45 


50 




















<210> 254 






















<211> 62 






















<212> PRT 






















<213> Homo sapiens 






















<220> 






















<221> SIGNAL 






















<222> -42. .-1 






















<220> 






















<221> UNSURE 






















<222> -11 






















<223> Xaa = Glu, Lys 






















<400> 254 






















Met Asp Pro Asn Cys 


Ser 


Cys 


Ala 


Ala 


Ala 


Gly Val 


Ser 


Cys 


Thr 


Cys 


-40 






-35 








-30 








Ala Ser Ser Cys Lys 


Cys 


Lys 


Glu 


Cys 


Lys 


Cys Thr 


Ser 


Cys 


Lys 


Xaa 


-25 




-20 








-15 










Ser Cys Cys Ser Cys 


Cys 


Pro 


Val 


Gly 


Cys Ala Lys 


Cys 


Ala 


Gin 


Gly 


-10 


-5 










1 






5 




Cys He Cys Lys Gly 


Ala 


Ser 


Glu 


Lys 


Cys 


Ser Cys 


Cys 


Ala 






10 








15 








20 






<210> 255 






















<211> 132 






















<212> PRT 






















<213> Homo sapiens 






















<220> 






















<221> UNSURE 






















<222> 118 






















<223> Xaa = Gly, Val 






















<400> 255 






















Leu Phe Pro Ala Pro 


Ala 


Pro 


Pro 


Pro 


Ala 


Pro Ala 


Phe 


Ala 


Pro 


Pro 


1 5 










10 








15 




Pro Lys Val Pro Ser 


Pro 


Glu Arg 


Ser 


Ala 


Pro Arg 


Val 


Pro 


Leu 


Pro 


















30 






Ser Pro Gin Pro Ser 


Tyr 


Pro 


Phe 


Arg 


Pro 


Ala Ala 


Ser Gly 


Gly Thr 


35 






40 








45 








Pro Pro Pro Ala Cys 


Leu 


Pro 


Pro 


Ala 


Gin 


Pro Cys 


Gin 


Gly 


Ser 


Pro 


50 




55 








60 










Ala Met Asn Leu Phe 


Arg 


Phe 


Leu 


Gly 




Leu Ser 


His 


Leu 


Leu 


Ala 


65 


70 










75 








80 
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He He Leu Leu Leu Leu Lys He Trp Lys Ser Arg Ser Cys Ala Ala 

85 90 95 

His Pro Gin Leu Pro Leu Ser Phe Cys Leu Ser Val Cys Leu Ser Val 

100 105 110 

Ser Leu Ser Leu Ser Xaa Ser Leu Ser Leu Ser Phe Ser Val Ser Lys 

115 120 125 

Lys Lys Lys Lys 
130 



<210> 256 
<211> 163 
<212> PRT 
<213> Homo sapiens 



<400> 256 



Pro 


His 


Arg 


Leu 


Thr 


Leu 


Pro 


Thr 


Leu 


Gin 


Ser 


Val 


Thr 


Phe 


Arg Cys 


1 








5 










10 










15 


Pro 


Ser 


Arg 


Ser 
20 


Glu 


Lys 


Leu 


Gly 


Lys 
25 


Asn 


Met 


Val 


Ser 


Ser 

30 


Phe Arg 


Val 


Ser 


Glu 
35 


Leu 


Gin 


Val 


Leu 


Leu 
40 


Gly 


Phe 


Ala 


Gly 


45'^ 


Asn 


Lys Ser 


Gly 


Arg 
50 


Lys 


His 


Asp 


Leu 


Leu 
55 


Met 


Arg 


Ala 


Leu 


His 
60 


Leu 


Leu 


Lys Ser 


Gly 


Cys 


Ser 


Pro 


Ala 


Val 


Gin 


He 


Lys 


He 


Arg 


Glu 


Leu 


Tyr 


Arg Arg 


65 










70 










75 








80 


Arg 


Tyr 


Pro 


Arg 


Thr 
85 


Leu 


Glu 


Gly 


Leu 


Ser 
90 


Asp 


Leu 


Ser 


Thr 


He Lys 
95 


Ser 


Ser 


Val 


Phe 


Ser 


Leu Asp 


Gly 


Gly 


Ser 


Ser 


Pro 


Val 


Glu 


Pro Asp 








100 










105 










110 




Leu 


Ala 


Val 
115 


Ala 


Gly 


He 


His 


Ser 
120 


Leu 


Pro 


Ser 


Thr 


Ser 
125 


Val 


Thr Pro 


His 


Ser 


Pro 


Ser 


Ser 


Pro 


Val 


Gly Ser Val Leu Leu 


Gin 


Asp 


Thr Lys 




130 










135 










140 








Pro 


Thr 


Phe 


Glu 


Met 


Gin 


Gin 


Pro 


Ser 


Pro 


Pro 


He 


Pro 


Pro 


Val His 


145 










150 










155 








160 


Pro 


Asp 


Val 



























<210> 257 
<211> 166 
<212> PRT 

<213> Homo sapiens 



<400> 257 



Pro 


His 


Arg 


Leu 


Thr 


Leu 


Pro 


Thr Leu Gin Ser 


Val 


Thr 


Phe Arg 


Cys 


1 








5 






10 






15 




Pro 


Ser 


Arg 


Ser 


Glu Lys 


Leu 


Gly Lys Asn Met 


Val 


Ser 


Ser Phe 


Arg 








20 








25 






30 




Val 


Ser 


Glu 




Gin 


Val 


Leu 


Leu Gly Phe Ala 


Gly 


Arg 


Asn Lys 


Ser 






35 










40 




45 






Gly 


Arg 


Lys 


His 


Asp 


Leu 


Leu 


Met Arg Ala Leu 


His 


Leu 


Leu Lys 


Ser 




50 










55 




60 








Gly 


Cys 


Ser 


Pro 


Ala 


Val 


Gin 


He Lys He Arg 


Glu 


Leu Tyr Arg 


Arg 


65 










70 




75 








80 


Arg 


Tyr 


Pro 


Arg 


Thr 


Leu 


Glu 


Gly Leu Ser Asp 


Leu 


Ser 


Thr He 


Lys 










85 






90 






95 




Ser 


Ser 


Val 


Phe 


Ser 


Leu Asp 


Gly Gly Ser Ser 


Pro 


Val 


Glu Pro 


Asp 








100 








105 






110 






Ala 


Val 


Ala 


Gly 


He 


His 


Ser Leu Pro Ser 


Thr 


Ser 


Val Thr 


Pro 






115 










120 




125 






His 


Ser 


Pro 


Ser 


Ser 


Pro 


Val 


Gly Ser Val Leu 


Leu 


Gin 


Asp Thr 






130 










135 




140 








Pro 


Thr 


Phe 


Glu 


Met 


Gin 


Gin 


Pro Ser Pro Pro 


He 


Pro 


Pro Val 


His 


145 










150 




155 








160 
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Pro Asp Val Gin Leu Lys 
165 

<210> 258 
<211> 118 
<212> PRT 

<213> Homo sapiens 
<400> 258 



Met Ser Tyr 


Glu 


Gin 


Leu Met 


Gin 


Leu 


Tyr 


Ser Ala Arg Gin Arg Arg 


1 




5 








10 




15 




Arg Leu Asn 


Arg 


Gly 


Leu Arg 


Arg 


Lys 


Gin 


His Ser Leu Leu 




Arg 




20 








25 




30 






Leu Arg Lys 


Ala 


Lys 


Lys Glu 


Ala 


Pro 


Pro 


Met Glu Lys Pro 


Glu 


Val 


35 








40 






45 






Val Lys Thr 


His 


Leu 


Arg Asp 


Met 


He 


He 


Leu Pro Glu Met 


Val 


Gly 


50 






55 








60 






Ser Met Val 


Gly 


Val 


Tyr Asn 


Gly 


Lys 


Thr 


Phe Asn Gin Val 


Glu 


He 


65 






70 








75 




80 


Lys Pro Glu 


Met 


He 


Gly His 


Tyr 


Leu 


Gly 


Glu Phe Ser He 


Thr Tyr 






85 








90 




95 




Lys Pro Val 


Lys 


His 


Gly Arg 


Pro 


Gly 


He 


Gly Ala Thr His 


Ser 


Ser 




100 








105 




110 






Arg Phe lie 


Pro 


Leu 


Lys 














115 




















<210> 259 




















<211> 101 




















<212> PRT 




















<213> Homo sapiens 
















<220> 




















<221> unsur« 




















<222> 98 




















<223> Xaa = 


He, 


, Asn 
















<220> 




















<221> unsure 


















<222> 19 




















<223> Xaa = 


Ser, 


,Thr 
















<400> 259 




















Arg Thr Ala 


Gly 


Arg 


Gly Arg 


Pro 


Ala 


Val 


Ala Ser Trp Glu 


Leu 


Arg 


1 




5 








10 




15 




Ala Arg Xaa 


Cys 


Ala 


Glu Asp 


Pro 


His 


Gin 


Gly Ala Gly Cys 


Ser 


Cys 




20 








25 




30 






Gly Ser Arg 


Ala 


Met 


Ala Glu 


Glu 


Gin 


Gly 


Arg Glu Arg Asp 


Ser 


Val 


35 








40 






45 






Pro Lys Pro 


Ser 


Val 


Leu Phe 


Leu 


His 


Pro 


Asp Leu Gly Val 


Gly 


Gly 


50 






55 








60 






Ala Glu Arg 


Leu 


Val 


Leu Leu 


Pro 


Phe 


Pro 


Thr Glu Arg Arg 


Leu 


Leu 


65 






70 








75 




80 


Phe Cys Thr 


Ala 


Pro 


Arg Ser 


Leu 


Asn 


Ser 


Leu Pro Leu Lys 


Lys 


Lys 



85 90 95 



Lys Xaa Xaa Lys Ser 
100 

<210> 260 
<211> 100 
<212> PRT 

<213> Homo sapiens 



<400> 260 



[fije>'/^:/yv'GG208389e.s^] 



WO 02/083898 PCT/lBOl/00914 
166 



Val 


Arg 


Gin 


Gly 


Gly 


Val 


Gly 


Pro 


Gin 


Trp 


Pro 


Ala 


Gly 


Ser Tyr 


Glu 


1 








5 










10 










15 




Arg 


Gly 


Ala 


Cys 


Ala 


Glu 


Asp 


Pro 


His 


Gin 


Gly Ala 


Gly 


Cys 


Ser 


Cys 








20 










25 










30 






Gly 


Ser 


Arg 
35 


Ala 


Met 


Ala 


Glu 


Glu 
40 


Gin 


Gly 


Arg 


Glu 


Arg 
45 


Asp 


Ser 


Val 


Pro 


Lys 
50 


Pro 


Ser 


Val 




Phe 
55 


Leu 


His 


Pro 


Asp 


Leu 
60 


Gly 


Val 


Gly 


Gly 


Ala 


Glu 


Arg 


Leu 


Val 


Leu 


Leu 


Pro 


Phe 


Pro 


Thr 


Glu 


Arg Arg 


Leu 


Leu 


65 










70 










75 










80 


Phe 


Cys 


Thr 


Ala 


Pro 


Arg Ser 


Leu 


Asn 


Ser 


Leu 


Pro 


Leu 


Lys 


Lys 


Lys 










85 










90 










95 




Lys 


Asn 


Ala 


Lys 
100 



























<210> 261 
<211> 147 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> unsure 
<222> 90 

<223> Xaa = Ala, Gly 
<400> 261 



Met 


Glu 


Tyr 


Val 


Leu 


Glu Val Lys Asn 


Ser 


Pro Arg 


His 


Leu 


Leu 


Lys 


1 








5 




10 










15 




Gin 


Phe 


Thr 


Val 


Cys 


Asp Val Pro Leu 


Tyr 


Asp 


He 


Cys 


Asp 


Tyr 


Asn 








20 




25 










30 






Val 


Ser 


Arg 


Asp 


Arg 


Cys Gin Glu Leu 


Gly 


Cys 


Cys 


Xaa 


Tyr 


Glu 


Gly 






35 






40 








45 








Xaa 


Cys 


Tyr 


Lys 


Lys 


Ala Val Pro He 


Tyr 


He 


His 


Val 


Phe 


Ser 


Ala 




50 








55 






60 










Leu 


He 


Val 


He 


Lys 


Ala Gly Ala Phe 


Val 


He 


Thr 


He 


He 


Tyr Arg 


65 










70 




75 










80 


Val 


He 


Gin 


Glu 


Ser 


Arg Lys Glu Lys 


Xaa 


He 


Pro 


Val 


Tyr 


Val 


Ala 










85 




90 










95 




Leu 


Pro 


Gin 


Lys 


Ser 


Ser Glu Lys Ala 


Glu 


Leu 


Ala 


Ser 


Ser 


Ser 


Ser 








100 




105 










110 










Gly 


Leu 


Lys 


Pro Ala Ser Pro 


Gly 


Pro 


Pro 


Ser 


Ala 


Gly 


Pro 






115 






120 








125 








Ser 


Met 


Lys 


Ser 


Asp 


Glu Asp Lys Asp Asp Val 


Thr 


Gly Thr 


He 


Thr 




130 








135 






140 










Glu 


Ala 


Glu 























145 



<210> 262 
<211> 147 
<212> PRT 
<213> Homo sapiens 



<400> 262 



Met 


Glu 


Tyr Val 


Leu 


Glu 


Val 


Lys 


Asn 


Ser Pro Arg 


His 


Leu 


Leu 


Lys 


1 






5 










10 








15 




Gin 


Phe 


Thr Val 


Cys 


Asp Val 


Pro 


Leu Tyr Asp 


He 


Cys 


Asp 


Tyr 


Asn 






20 










25 








30 






Val 


Ser Arg Asp Arg 


Cys 


Gin 


Glu 


Leu 


Gly Cys 


Cys 


Phe 


Tyr 


Glu 


Gly 






35 








40 








45 








Val 


Cys 


Tyr Lys 


Lys 


Ala 


Val 


Pro 


He 


Tyr He 


His 


Val 


Phe 


Ser 


Ala 




50 








55 








60 










Leu 


He 


Val He 


He 


Ala 


Gly Ala 


Phe 


Val He 


Thr 


He 


He 


Tyr Arg 


65 








70 








75 










80 
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Val lie Gin 


Glu 


Ser 


Arg 


Lys 


Glu 


Lys 


Ala 


He 


Pro Val 


Tyr 


Val 


Ala 






85 










90 








95 




Leu Pro Gin 




Ser 


Ser 


Glu 


Lys 


Ala 


Glu 


Leu 


Ala Ser 


Ser 


Ser 


Ser 




100 










105 








110 






Lys Leu Gly 


Leu 


Lys 


Pro 


Ala 


Ser 


Pro 


Gly 


Pro 


Pro Ser 


Ala 


Gly 


Pro 


115 










120 








125 








Ser Met Lys 


Ser 


Asp 


Glu 


Asp 


Lys 


Asp 


Asp Val 


Thr Gly 


Thr 


He 


Thr 


130 








135 










140 








Glu Ala Glu 


























145 


























<210> 263 


























<211> 149 


























<212> PRT 


























<213> Homo sapiens 






















<400> 263 


























Met Arg Tyr 


Asn 


Glu 


Lys 


Glu 


Leu 


Gin 


Ala 


Leu 


Ser Arg 


Gin 


Pro 


Ala 
















10 








15 




Glu Met Ala 


Ala 


Glu 


Leu 


Gly 


Met 


Arg 


Gly 


Pro 


Lys Lys 


Gly 


Ser 


Val 




20 










25 








30 






Leu Lys Arg 


Arg 


Leu 


Val 


Lys 


Leu 


Val 


Val 


Asn 


Phe Leu 


Phe 


Tyr 


Phe 


35 










40 








45 








Arg Thr Asp 


Glu 


Ala 


Glu 


Pro 


Val 


Gly 




Leu 


Leu Leu 


Glu Arg 


Cys 


50 








55 










60 








Arg Val Val 


Arg 


Glu 


Glu 


Pro 


Gly 


Thr 


Phe 


Ser 


He Ser 


Phe 


He 


Glu 


65 






70 










75 








80 


Asp Pro Glu 


Arg 




Tyr 


His 


Phe 


Glu 


Cys 


Ser 


Ser Glu 


Glu 


Gin 


Cys 
















90 








95 




Gin Glu Trp 


Met 


u 


Ala 


Leu 


Arg 


Arg 


Ala 


Ser 


Tyr Glu 


Phe 


Met 


Arg 




100 










105 








110 






Arg Ser Leu 


lie 




Tyr Arg 


Asn 


Glu 


He 


Arg Lys Val 


Thr 


Gly 


Lys 


115 










120 








125 








Asp Pro Leu 


Glu 


Gin 


Phe 


Gly 


He 


Ser 


Glu 


Glu Ala Arg 


Phe 


Gin 


Leu 


130 








135 










140 








Ser Gly Leu 


Gin 


Ala 






















145 


























<210> 264 


























<211> 103 


























<212> PRT 


























<213> Homo sapi( 
























<400> 264 


























Met Pro Val 


Val 


Pro 


Ala 




Gly 


Arg 


Pro 


Arg Trp Ala 


Asp 


His 


Leu 


1 




5 










10 








15 




Arg Ser Gly 


Val 


Arg 


Asp Gin 


Pro 


Gly 


Gin 


Pro Gly Glu 


Ala 


Pro 


Pro 




20 










25 








30 






Ser Leu Leu 


Lys 


He 


Gin Lys 


Leu 


Ala 


Gly Tyr Gly Gly 


Gly 


Cys 


Leu 


35 










40 








45 








Trp Ser Gin 


Leu 


Leu 


Gly Arg 


Leu 


Arg 


Arg Glu Asn His 


Leu 


Ser 


Pro 


50 








55 










60 








Gly Gly Gly 


Gly 


Cys 


Ser 


Glu 


Pro 


Arg 




Cys 


His Cys 


Thr 


Pro 


Ala 


65 






70 










75 








80 


Trp Val Thr 


Glu 


Gin 


Asp 


Ser 


He 


Ser 


Lys 


He 


Glu Lys 


He 


Ser 


Gin 






85 










90 








95 




Leu Pro Lys 


Lys 


Lys 


Lys 


Lys 




















100 
























<210> 265 


























<211> 78 


























<212> PRT 


























<213> Homo sapi< 
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<400> 265 



Met Asp Pro 


Asn Cys 


Ser 


Cys 


Ala 


Ala 


Gly 


Val 


Ser 


Cys 


Thr 


Cys 


Ala 


1 


5 










10 










15 




Ser Ser Cys 


Lys Cys 




Glu 


Cys 




Cys 


Thr 


Ser 


Cys 




Lys 


Ser 




20 








25 










30 






Cys Cys Ser 


Trp Cys 


Thr 


Gly 


Pro 


Thr 


Leu 


Val 


Asp 


Thr 


Cys 


Pro 


Ala 


35 








40 










45 








Leu Pro Pro 


Val Cys 


Leu 


Ser 


Gin 


Arg 


Ser 


Ser 


Gly Met Asn 


Leu Arg 


50 






55 










60 










Thr His Val 


Gin Trp Glu 


Val Arg Pro 


Pro 


Leu 


Asn 


He 


Gin 






65 




70 










75 












<210> 265 


























<211> 92 


























<212> PRT 


























<213> Homo sapiens 
























<400> 266 


























Met Ser Pro 


Gin Thr 


His 


Ser 


Gin 


Thr 


Cys 


He Arg Asn Leu 


Val 


Thr' 


1 


5 










10 














Cys lie Asn 


Tyr Pro Arg 


Thr 


Ser 


Thr 


Gly Cys 


Lys 


Gly Thr 


Thr 


Thr 




20 








25 










30 






Gin Arg lie 


Met Glu 


Pro 


Val 


Glu 




Glu 


Val 


Glu Gly Thr 


Glu 


Gin 


35 








40 










45 








Asp Asn Ala 


Lys Thr 


Cys 


Gly 


Ser 


Leu 


Gly Arg Gly Asn Glu 


Asn 


Thr 


50 






55 










60 










Met Leu Arg 


Gly Gly 


Phe 


Ser 


Met 


Asn 


Thr 


Thr 


Val 


Gly Gin 


Gly 


He 


65 




70 










75 










80 


Ser Lys Gin 


Thr His 


His 


Thr 


Ser 


Thr 


Thr 


Ser 


Ser 












85 










90 














<210> 267 


























<211> 51 


























<212> PRT 


























<213> Homo sapiens 
























<400> 267 


























Met Leu Ala 


Lys Thr 


Gly 


Val 


His 


His 


Tyr 


Ser 


Gly Asn Asn 


He 


Glu 


1 


5 










10 










15 




Leu Gly Thr 


Ala Cys 


Gly 


Lys 


Tyr 


Tyr Arg Val 


Cys 


Thr 


Leu 


Ala 


He 




20 








25 










30 






lie Asp Pro 


Gly Asp Ser 


Asp 


He 


He 


Arg Ser 


Met 


Pro 


Glu 


Gin 


Thr 


35 








40 










45 








Gly Glu Lys 


























50 


























<210> 268 


























<211> 99 


























<212> PRT 


























<213> Homo : 


sapiens 
























<400> 268 


























Met Lys Lys 


Lys Glu 


Glu 


Thr 


Thr 


Leu 


Ser 


Glu 


Met 


Glu 


Pro 


Val 


Glu 


1 


5 










10 










15 




Pro Gin Tyr 


Gin Leu 


Val 


Asn 


Ala 


Glu 


Ser 


Thr 


Ser 


Pro 


Phe 


Leu 


His 


20 








25 










30 






Cys Leu Arg 


Glu Val 


lie 


Gly 


Glu 


Tyr 


Ser 


Val 


His 


Glu 


Phe 


Ser 


Leu 


35 








40 










45 








Leu Gly Lys 


Thr Glu 


Ser 


Gin 


Gly 


He 


Gly 


Leu 


Trp 


He 


Ala 


Leu 


Val 


50 






55 










60 










Val Phe Leu 


Ser Phe 




He 


Phe 


Ser 


Thr 


Ser 


Phe 


Tyr 


He 


Ser 


Asn 



65 70 75 80 
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Ala Glu Gin Pro Phe Phe 
85 

Leu Ser Leu 

<210> 269 
<211> 128 
<212> PRT 

<213> Homo sapiens 



<400> 269 



Met 
1 


Ala 


Gly 


Asn 


Leu Leu 

5 


Val 


Pro 


Leu 


Ala 


Cys Arg 








20 




Gly 


lie 


Arg 


Leu 


Thr Leu 






35 






Glu 


Lys 


Arg 


Ala 


Met Phe 




50 








Asn 


Phe 


Glu 




His Pro 


65 








70 


Arg 


Gly 


Trp 


Lys 


Gly Asn 










85 


Met 


Val 


Gly 


Ser 


Arg Met 








100 




Arg 


He 


Arg 


Tyr 


Leu Tyr 



115 



<210> 270 
<211> 52 
<212> PRT 

<213> Homo sapiens 
<400> 270 

Met Phe Leu Thr Leu Ala 
1 5 

Leu Leu Asn Val Tyr Glu 
20 

Leu Leu Glu Ser Trp He 
35 

He Ser Ser His 
50 

<210> 271 
<211> 84 
<212> PRT 

<213> Homo sapiens 
<400> 271 

Met Lys Val Lys He Lys 

1 5 

Val Ala Asn Asp Glu Asn 

20 

Cys Cys Pro Asp Cys Lys 
35 

Gly Gin Cys Ser His Cys 
50 

His Ala Gin Gin Val Gin 
65 70 
Lys Phe Lys Glu 



PCT/lBOl/00914 

169 

Lys Glu Pro Pro Thr Glu Ala Ala Lys Glu 



Ser Gly Ala Gly Arg Arg Leu Trp Asp Trp 

10 15 
Ser Phe Ser Leu Gly Val Pro Arg Leu He 

25 30 
Pro Pro Pro Lys Val Val Asp Arg Trp Asn 

40 45 
Gly Val Tyr Asp Asn He Gly He Leu Gly 
55 60 

Lys Glu Leu He Arg Gly Pro He Trp Leu 

75 80 
Glu Leu Gin Arg Cys He Arg Lys Arg Lys 

90 95 
Phe Ala Asp Asp Leu His Asn Leu Asn Lys 

105 110 
Lys His Phe Asn Arg His Gly Lys Phe Arg 
120 125 



Asp Thr Cys Lys Leu Arg Gly Met Ser Phe 

10 15 
Gly Glu Ala Thr Val Ser Ser Val Leu Glu 

25 30 
He Val Gly Asn Glu Arg Tyr Phe Asp Gly 
40 45 



Cys Trp Asn Gly Val Ala Thr Trp Leu Trp 

10 15 
Cys Gly He Cys Arg Met Ala Phe Asn Gly 

25 30 
Val Pro Gly Asp Asp Cys Pro Leu Val Trp 

40 45 
Phe His Met His Cys He Leu Lys Trp Leu 
55 60 

Gin His Cys Pro Met Cys Arg Gin Glu Trp 
75 80 



<210> 272 
<211> 84 
<212> PRT 
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<213> Homo sapiens 
<400> 272 

Met Lys Val Lys lie Lys Cys Trp Asn Gly Val Ala Thr Trp Leu Trp 

15 10 15 

Val Ala Asn Asp Glu Asn Cys Gly lie Cys Arg Met Ala Phe Asn Gly 

20 25 30 

Cys Cys Pro Asp Cys Lys Val Pro Gly Asp Asp Cys Pro Leu Val Trp 

35 40 45 

Gly Gin Cys Ser His Cys Phe His Met His Cys lie Leu Lys Trp Leu 

50 55 60 

His Ala Gin Gin Val Gin Gin His Cys Pro Met Cys Arg Gin Glu Trp 
65 70 75 80 

Lys Phe Lys Glu 

<210> 273 

<211> 84 

<212> PRT 

<213> Homo sapiens 

<400> 273 

Met Lys Val Lys lie Lys Cys Trp Asn Gly Val Ala Thr Trp Leu Trp 

15 10 15 

Val Ala Asn Asp Glu Asn Cys Gly He Cys Arg Met Ala Phe Asn Gly 

20 25 30 

Cys Cys Pro Asp Cys Lys Val Pro Gly Asp Asp Cys Pro Leu Val Trp 

35 40 45 

Gly Gin Cys Ser His Cys Phe His Met His Cys He Leu Lys Trp Leu 

50 55 60 

His Ala Gin Gin Val Gin Gin His Cys Pro Met Cys Arg Gin Glu Trp 
65 70 75 80 

Lys Phe Lys Glu 

<210> 274 
<211> 84 
<212> PRT 

<213> Homo sapiens 
<400> 274 

Met Lys Val Lys He Lys Cys Trp Asn Gly Val Ala Thr Trp Leu Trp 

15 10 15 

Val Ala Asn Asp Glu Asn Cys Gly He Cys Arg Met Ala Phe Asn Gly 

20 25 30 

Cys Cys Pro Asp Cys Lys Val Pro Gly Asp Asp Cys Pro Leu Val Trp 

35 40 45 

Gly Gin Cys Ser His Cys Phe His Met His Cys He Leu Lys Trp Leu 

50 55 60 

His Ala Gin Gin Val Gin Gin His Cys Pro Met Cys Arg Gin Glu Trp 
65 70 75 80 

Lys Phe Lys Glu 

<210> 275 
<211> 84 
<212> PRT 

<213> Homo sapiens 
<400> 275 

Met Lys Val Lys He Lys Cys Trp Asn Gly Val Ala Thr Trp Leu Trp 

15 10 15 

Val Ala Asn Asp Glu Asn Cys Gly He Cys Arg Met Ala Phe Asn Gly 

20 25 30 

Cys Cys Pro Asp Cys Lys Val Pro Gly Asp Asp Cys Pro Leu Val Trp 
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Gly Gin Cys Ser His Cys Phe His Met His Cys lie Leu Lys Trp Leu 

50 55 60 

His Ala Gin Gin Val Gin Gin His Cys Pro Met Cys Arg Gin Glu Trp 
65 70 75 80 

Lys Phe Lys Glu 

<210> 276 
<211> 80 
<212> PRT 

<213> Homo sapiens 



<400> 276 


























Met 


Ala 


Thr 


Pro 


Thr 


Gin 


Thr 


Pro 


Thr 




Ala 


Pro 


Glu Glu 


Pro Asp 


1 








5 










10 








15 




Pro 


Phe 


Tyr 


Tyr Asp 


Tyr Asn 


Thr 


Val 


Gin 


Thr 


Val 


Gly Met 


Thr 


Leu 








20 










25 








30 






Ala 


Thr 


lie 


Leu 


Phe 


Leu 


Leu 


Gly 


He 


Leu 


He 


Val 


He Ser 


Lys 


Lys 






35 










40 










45 






Val 


Lys 


Cys 


Arg 


Lys 


Ala 


Asp 


Ser 


Arg 


Ser 


Glu 


Ser 


Pro Thr 


Cys 






50 










55 










60 








Ser 


Cys 


Lys 


Ser 


Glu 


Leu 


Pro 


Ser 


Ser 


Ala 


Pro 


Gly Gly Gly 


Gly 


Val 


65 










70 










75 








80 



<210> 277 
<211> 127 
<212> PRT 

<213> Homo sapiens 



Met 


Ser 


Phe 


Ser 


Gly Lys 


Tyr 


Gin 


Leu 


Gin 


Ser 


Gin Glu 


Asn 


Phe 


Glu 


1 








5 








10 








15 




Ala 


Phe 


Met 


Lys 
20 


Ala He 


Gly 


Leu 


Pro 
25 


Glu 


Glu 


Leu He 


Gin 
30 




Gly 


Lys 


Asp 


He 

35 


Lys 


Gly Val 


Ser 


Glu 

40 


He 


Val 


Gin 


Asn Gly 
45 


Lys 


His 


Phe 


Lys 


Phe 

50 


Thr 


He 


Thr Ala 


Gly 
55 


Ser 


Lys 


Val 


He 


Gin Asn 
60 


Glu 


Phe 


Thr 


Val 


Gly 


Glu 


Glu 


Cys Glu 




Glu 


Thr 


Met 


Thr 


Gly Glu 


Lys 


Val 




65 








70 










75 








80 


Thr 


Val 


Val 


Gin 


Leu Glu 


Gly 


Asp Asn 


Lys 


Leu 


Val Thr 


Thr 


Phe 


Lys 










85 








90 








95 




Asn 


He 




Ser 


Val Thr 


Glu 


Leu 




Gly Asp 


He He 


Thr 


Asn 


Thr 








100 








105 








110 






Met 


Thr 


Leu 
115 


Gly 


Asp He 


Val 


Phe 
120 


Lys 


Arg 


He 


Ser Lys 
125 


Arg 


He 





<210> 278 
<211> 87 
<212> PRT 

<213> Homo sapiens 



<400> 278 
























Asp 


Ala 


Val 


Ala 


Val 


Gly Lys 


Phe 


Ser 


Gin 


Trp 


Pro 


Pro 


Glu 


Arg Ser 


1 








5 








10 










15 


Pro 


Ala 


Asp 


Ala 
20 


Leu 


Cys Asp 


Gly 


Lys 
25 


Lys 


Ala 


Ala 


Gly 


Pro 
30 


Thr Ser 


Gly 


Ala 


Phe 


Tyr 


Asn 


Glu Leu 


Arg Ala 


Leu 


Glu 


Ser 


Leu 


Pro 


Gly Asn 






35 








40 










45 






Tyr 


Ser 


Ser 


Ala 


Ser 


Thr Val 


Val 


Leu 


Leu 


Trp 


He 


Ala 


Ser 


Arg Gin 


50 








55 










60 








Lys 


Ser 


Gin 


Ser 


Ser 


Arg Asn 


Ser 


Gin 


Cys 


Ser 


Arg 


Lys 


Asp 


Lys Gly 


65 










70 








75 








80 


Gly 


Lys 


Gin 


Glu 


Glu 


His Glu 
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85 

<210> 279 
<211> 96 
<212> PRT 

<213> Homo sapiens 



<400> 279 



Met Pro Gly Pro Thr 


Pro 


Ser 


Gly Thr Asn 


Val Gly 


Ser Ser Gly Arg 


1 5 










10 




15 




Ser Pro Ser Lys Ala 


Val 


Ala 


Ala 


Arg Ala 


Ala Gly 


Ser Thr Val 


Arg 


20 








25 






30 




Gin Arg Lys Asn Ala 


Ser 


Cys 


Gly 


Thr 


Arg 


Ser Ala 


Gly Arg Thr 


Thr 


35 






40 








45 




Ser Ala Gly Thr Gly 


Gly Met 


Trp Arg 


Phe 


Tyr Thr 


Glu Asp Ser 


Pro 


50 




55 








60 






Gly Leu Lys Val Gly 


Pro 


Val 


Pro 


Val 




Val Met 


Ser Leu Leu 


Phe 


65 


70 










75 




80 


He Ala Ser Val Phe 










Trp 


Gly Lys 


Tyr Thr Arg 


Ser 


85 










90 








<210> 280 


















<211> 56 


















<212> PRT 


















<213> Homo sapiens 


















<400> 280 


















Met Ser Ser Trp Ser 


Leu 


Met 


Cys 


Met 


Gin 


Asp Met 


Gly Asn Thr 


Lys 


1 5 










10 




15 




Ala Arg Leu Leu Tyr 


Glu 


Ala 


Asn 


Leu 


Pro 


Glu Asn 


Phe Arg Arg 


Pro 


20 








25 






30 




Gin Thr Asp Gin Ala 


Val 


Glu 


Phe 


Phe 


He Arg Asp 


Lys Tyr Glu 


Lys 


35 






40 








45 




Lys Lys Tyr Tyr Glu 


Lys 


Met 














50 




55 














<210> 281 


















<211> 56 


















<212> PRT 


















<213> Homo sapiens 


















<400> 281 


















Met Ser Ser Trp Ser 


Leu 


Met 


Cys 


Met 


Gin 


Asp Met 


Gly Asn Thr 


Lys 


1 5 










10 




15 




Ala Arg Leu Leu Tyr 


Glu 


Ala 






Pro 


Glu Asn 


Phe Arg Arg 


Pro 


20 








25 






30 




Gin Thr Asp Gin Ala 


Val 


Glu 


Phe 


Phe 


He 


Arg Asp 


Lys Tyr Glu 










40 








45 




Lys Lys Tyr Tyr Glu 


Lys 


Met 


Pro 












50 




55 














<210> 282 


















<211> 56 


















<212> PRT 


















<213> Homo sapiens 


















<400> 282 


















Met Ser Ser Trp Ser 


Leu 


Met 


Cys 


Met 


Gin 


Asp Met 


Gly Asn Thr 


Lys 


1 5 










10 




15 




Ala Arg Leu Leu Tyr 


Glu 


Ala 


Asn 


Leu 


Pro 


Glu Asn 


Phe Arg Arg 


Pro 


20 








25 






30 




Gin Thr Asp Gin Ala 


Val 


Glu 


Phe 


Phe 


He 


Arg Asp 


Lys Tyr Glu 





35 40 45 
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Lys Lys Tyr Tyr Glu Lys Met Pro 



<210> 283 
<211> 56 
<212> PRT 
<213> Homo sapiens 

<400> 283 



Met 


Ser Ser Trp Ser 


Leu 


Met 


Cys 


Met 


Gin Asp Met 


Gly Asn Thr 


Lys 


1 


5 










10 


15 




Ala 


Arg Leu Leu Tyr 


Glu 


Ala 


Asn 


Leu 


Pro Glu Asn 


Phe Arg Arg 


Pro 




20 








25 




30 




Gin 


Thr Asp Gin Ala 


Val 


Glu 


Phe 


Phe 


lie Arg Asp 


Lys Tyr Glu 


Lys 




35 






40 






45 




Lys 


Lys Tyr Tyr Glu 


Lys 


Met 


Pro 











50 55 



<210> 284 

<211> 56 

<212> PRT 

<213> Homo sapiens 

<400> 284 



Met 


Ser 


Ser 


Trp Ser 


Leu 


Met 


Cys 


Met 


Gin Asp Met 


Gly Asn Thr Lys 


1 






5 










10 


15 


Ala 


Arg 


Leu 


Leu Tyr 


Glu 


Ala 


Asn 


Leu 


Pro Glu Asn 


Phe Arg Arg Pro 








20 








25 




30 


Gin 


Thr 


Asp 


Gin Ala 


Val 


Glu 


Phe 


Phe 


lie Arg Asp 


Lys Tyr Glu Lys 






35 








40 






45 


Lys 


Lys 


Tyr 


Tyr Glu 


Lys 


Met 


Pro 









50 55 



<210> 285 
<211> 56 
<212> PRT 



Met 


Ser 


Ser 


Trp 


Ser 


Leu 


Met 


Cys 


Met 


Gin Asp Met 


Gly Asn Thr 


Lys 


1 








5 










10 


15 




Ala 


Arg 


Leu 


Leu 


Tyr 


Glu 


Ala 


Asn 


Leu 


Pro Glu Asn 


Phe Arg Arg 


Pro 








20 










25 




30 




Gin 


Thr 


Asp 


Gin 


Ala 


Val 


Glu 


Phe 


Phe 


lie Arg Asp 


Lys Tyr Glu 


Lys 






35 










40 






45 




Lys 




Tyr 


Tyr 


Glu 




Met 


Pro 











50 55 



<210> 286 
<211> 56 
<212> PRT 
<213> Homo sapiens 

<400> 286 



Met 


Ser Ser Trp Ser 




Met 


Cys 


Met 


Gin Asp Met 


Gly Asn Thr 


Lys 


1 


5 










10 


15 




Ala 


Arg Leu Leu Tyr 


Glu 


Ala 


Asn 


Leu 


Pro Glu Asn 


Phe Arg Arg 


Pro 




20 








25 




30 




Gin 


Thr Asp Gin Ala 


Val 


Glu 


Phe 


Phe 


lie Arg Asp 


Lys Tyr Glu 


Lys 




35 






40 






45 




Lys 


Lys Tyr Tyr Glu 


Lys 


Met 


Pro 











50 55 



[fije>'/^:/yv'GG208389e.s^] 



WO 02/083898 PCT/lBOl/00914 
174 

<210> 287 
<211> 56 
<212> PRT 
<213> Homo sapiens 



<400> 287 

Met Ser Ser Trp Ser Leu Met Cys 
1 5 
Ala Arg Leu Leu Tyr Glu Ala Asn 
20 

Gin Thr Asp Gin Ala Val Glu Phe 

35 40 
Lys Lys Tyr Tyr Glu Lys Met Pro 
50 55 



Met Gin Asp Met Gly Asn Thr Lys 

10 15 
Leu Pro Glu Asn Phe Arg Arg Pro 
25 30 
Phe lie Arg Asp Lys Tyr Glu Lys 
45 



<210> 288 
<211> 56 
<212> PRT 
<213> Homo sapiens 

<400> 288 

Met Ser Ser Trp Ser Leu Met Cys 
1 5 
Ala Arg Leu Leu Tyr Glu Ala Asn 
20 

Gin Thr Asp Gin Ala Val Glu Phe 

35 40 
Lys Lys Tyr Tyr Glu Lys Met Pro 

50 55 



Met Gin Asp Met Gly Asn Thr Lys 

10 15 
Leu Pro Glu Asn Phe Arg Arg Pro 
25 30 
Phe lie Arg Asp Lys Tyr Glu Lys 



<210> 289 
<211> 56 
<212> PRT 
<213> Homo sapiens 

<400> 289 

Met Ser Ser Trp Ser Leu Met Cys 
1 5 
Ala Arg Leu Leu Tyr Glu Ala Asn 
20 

Gin Thr Asp Gin Ala Val Glu Phe 

35 40 
Lys Lys Tyr Tyr Glu Lys Met Pro 
50 55 



Met Gin Asp Met Gly Asn Thr Lys 

10 15 
Leu Pro Glu Asn Phe Arg Arg Pro 
25 30 
Phe lie Arg Asp Lys Tyr Glu Lys 
45 



<210> 290 
<211> 56 
<212> PRT 

<213> Homo sapiens 
<400> 290 

Met Ser Ser Trp Ser Leu Met Cys 

1 5 

Ala Arg Leu Leu Tyr Glu Ala Asn 

20 

Gin Thr Asp Gin Ala Val Glu Phe 

35 40 
Lys Lys Tyr Tyr Glu Lys Met Pro 
50 55 



Met Gin Asp Met Gly Asn Thr Lys 

10 15 
Leu Pro Glu Asn Phe Arg Arg Pro 
25 30 
Phe lie Arg Asp Lys Tyr Glu Lys 
45 



<210> 291 
<211> 56 
<212> PRT 
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<213> Homo sapiens 
<400> 291 

Met Ser Ser Trp Ser Leu Met Cys Met Gin Asp Met Gly Asn Thr Lys 

15 10 15 

Ala Arg Leu Leu Tyr Glu Ala Asn Leu Pro Glu Asn Phe Arg Arg Pro 

20 25 30 

Gin Thr Asp Gin Ala Val Glu Phe Phe lie Arg Asp Lys Tyr Glu Lys 

35 40 45 

Lys Lys Tyr Tyr Glu Lys Met Pro 
50 55 

<210> 292 
<211> 56 
<212> PRT 

<213> Homo sapiens 

<400> 292 

Met Ser Ser Trp Ser Leu Met Cys Met Gin Asp Met Gly Asn Thr Lys 

15 10 15 

Ala Arg Leu Leu Tyr Glu Ala Asn Leu Pro Glu Asn Phe Arg Arg Pro 

20 25 30 

Gin Thr Asp Gin Ala Val Glu Phe Phe lie Arg Asp Lys Tyr Glu Lys 

35 40 45 

Lys Lys Tyr Tyr Glu Lys Met Pro 
50 55 



<210> 293 
<211> 56 
<212> PRT 
<213> Homo sapi< 



<400> 293 

Met Ser Ser Trp Ser Leu Met Cys 

1 5 
Ala Arg Leu Leu Tyr Glu Ala Asn 
20 

Gin Thr Asp Gin Ala Val Glu Phe 

35 40 
Lys Lys Tyr Tyr Glu Lys Met Pro 
50 55 



Met Gin Asp Met Gly Asn Thr Lys 

10 15 
Leu Pro Glu Asn Phe Arg Arg Pro 
25 30 
Phe lie Arg Asp Lys Tyr Glu Lys 
45 



<210> 294 
<211> 56 
<212> PRT 

<213> Homo sapiens 



<400> 294 

Met Ser Ser Trp Ser Leu Met Cys 

1 5 

Ala Arg Leu Leu Tyr Glu Ala Asn 

20 

Gin Thr Asp Gin Ala Val Glu Phe 

35 40 
Lys Lys Tyr Tyr Glu Lys Met Pro 
50 55 



Met Gin Asp Met Gly Asn Thr Lys 

10 15 
Leu Pro Glu Asn Phe Arg Arg Pro 
25 30 
Phe lie Arg Asp Lys Tyr Glu Lys 
45 



<210> 295 
<211> 122 
<212> PRT 
<213> Homo sapie: 



<220> 
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<221> UNSURE 
<222> 113 

<223> Xaa = Ala, Asp 
<400> 295 



Met Glu Ala 


Ser 


Ala 


Leu 


Thr 


Ser 


Ser 


Ala 


Val 


Thr 


Ser 


Val 


Ala Lys 


1 




5 










10 










15 


Val Val Arg 


Val 


Ala 


Ser 


Gly Ser Ala Val 


Val 


Leu 


Pro Leu Ala Arg 




20 










25 










30 




lie Ala Thr 


Val 


Val 


He 


Gly Gly 


Val 


Val 


Ala 


Met 


Ala 


Ala 


Val Pro 


35 










40 










45 






Met Val Leu 


Ser 


Ala 


Met 


Gly 


Phe 


Thr 


Ala 


Ala 


Gly 


He 


Ala 


Ser Ser 


50 








55 










60 








Ser He Ala 


Ala 


Lys 


Met 


Met 


Ser 


Ala 


Ala 


Ala 


He 


Ala 


Asn Gly Gly 


65 






70 










75 








80 


Gly Val Ala 


Ser 


Gly Ser 


Leu 


Val 


Ala 


Thr 


Leu 


Gin 


Ser Leu Gly Ala 






85 










90 










95 


Thr Gly Leu 


Ser 


Gly 


Leu 


Thr 


Lys 


Phe 


He 


Leu Gly 


Ser 


He 


Gly Ser 




100 










105 










110 




Xaa He Ala 


Ala 


Val 


He 


Ala 


Arg 


Phe 


Tyr 












115 










120 
















<210> 296 


























<211> 122 


























<212> PRT 


























<213> Homo sapiens 






















<220> 


























<221> UNSURE 
























<222> 79 


























<223> Xaa = 


Gly, 


,Arg 






















<400> 296 


























Met Glu Ala 


Ser 


Ala 


Leu 


Thr 


Ser 


Ser 


Ala 


Val 


Thr 


Ser 


Val 


Ala Lys 


1 




5 




















15 


Val Val Arg 


Val 


Ala 


Ser 


Gly 


Ser 


Ala 


Val 


Val 


Leu 


Pro 


Leu 


Ala Arg 




20 










25 










30 




He Ala Thr 


Val 


Val 


He 


Gly 


Gly 


Val 


Val 


Ala 


Met 


Ala 


Ala 


Val Pro 


35 










40 










45 






Met Val Leu 


Ser 


Ala 


Met 


Gly 


Phe 


Thr 


Ala 


Ala 


Gly 


He 


Ala 


Ser Ser 


50 








55 










60 








Ser He Ala 


Ala 




Met 


Met 


Ser 


Ala 


Ala 


Ala 


He 


Ala 


Asn 


Xaa Gly 


65 






70 










75 








80 


Gly Val Ala 


Ser 


Gly Ser 


Leu 


Val 


Ala 


Thr 


Leu 


Gin 


Ser 


Leu 


Gly Ala 






85 










90 










95 


Thr Gly Leu 


Ser 


Gly Leu 


Thr 


Lys 


Phe 


He 


Leu Gly 


Ser 


He 


Gly Ser 




100 










105 










110 




Ala He Ala 


Ala 


Val 


He 


Ala 


Arg 


Phe 


Tyr 












115 










120 
















<210> 297 


























<211> 74 


























<212> PRT 


























<213> Homo sapiens 






















<400> 297 


























Met Ala Ser 


Ser 


Thr 


Val 


Pro 


Val 


Ser 


Ala 


Ala 


Gly 


Ser 


Ala 


Asn Glu 


1 




5 










10 










15 


Thr Pro Glu 


He 


Pro Asp 


Asn 


Val 


Gly 




Trp 


Leu 




Gly 


Val Tyr 




20 










25 










30 




Arg Phe Ala 


Thr 


Asp 


Arg 


Asn 


Asp 


Phe 


Arg 


Arg Asn 


Leu 


He 


Leu Asn 


35 










40 










45 






Leu Gly Leu 


Phe 


Ala 


Ala 


Gly 


Val 


Trp 


Leu 


Ala 




Asn 


Leu 
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lie Asp Leu Met Ala Pro Gin Pro Gly Val 



<210> 298 
<211> 74 
<212> PRT 
<213> Homo 



sapiens 



<400> 298 


















Met 


Ala Ser 


Ser Thr 


Val Pro 


Val 


Ser Ala 


Ala Gly Ser 


Ala 


Asn 


Glu 


1 




5 






10 






15 




Thr 


Pro Glu 


He Pro 


Asp Asn 


Val 


Gly Asp 


Trp Leu Arg 


Gly 


Val 


Tyr 






20 






25 




30 






Arg 


Phe Ala 


Thr Asp 


Arg Asn 


Asp 


Phe Arg 


Arg Asn Leu 


He 


Leu 


Asn 




35 






40 




45 








Leu 


Gly Leu 


Phe Ala 


Ala Gly 


Val 


Trp Leu 


Ala Arg Asn 


Leu 


Ser 


Asp 




50 




55 






60 








He 


Asp Leu 


Met Ala 


Pro Gin 


Pro 


Gly Val 










65 






70 















<210> 299 
<211> 147 

<212> PRT 

<213> Homo sapiens 



<400> 299 


























Met 


Asp 


Val 


Leu 


Val 


Ser 


Glu 


Cys 


Ser 


Ala 


Arg Leu 


Leu 


Gin 


Gin 


Glu 


1 








5 










10 








15 




Glu 


Glu 


He 


Lys 


Ser 


Leu 


Thr 


Ala 


Glu 


He 


Asp Arg 


Leu 


Lys 


Asn 


Cys 








20 










25 








30 






Gly 


Cys 


Leu 


Gly 


Ala 


Ser 


Pro 


Asn 


Leu 


Glu 


Gin Leu 


Gin 


Glu 


Glu 


Asn 






35 










40 








45 








Leu 


Lys 


Leu 


Lys 


Tyr 


Arg 


Leu 


Asn 


He 


Leu 


Arg Lys 


Ser 


Leu 


Gin 


Ala 




50 










55 








60 










Glu 


Arg 


Asn 


Lys 


Pro 


Thr 




Asn 


Met 


He 


Asn He 


He 


Ser 


Arg 




55 










70 










75 








80 


Gin 


Glu 


Val 


Phe 


Gly 


His 


Ala 


He 


Lys Ala 


Ala Tyr 


Pro 


Asp Leu 


Glu 










85 










90 








95 




Asn 


Pro 


Pro 


Leu 


Leu 


Val 


Thr 


Pro 


Ser 


Gin 


Gin Ala 


Lys 


Phe 


Gly Asp 








100 










105 








110 






Tyr 


Gin 


Cys 


Asn 


Ser 


Ala 


Met 


Gly 


He 


Ser 


Gin Val 


Met 


Tyr 


Cys 


His 






115 










120 








125 








Asp 


Ser 


Trp 


Leu 


Phe 


Asp 


Phe 


Phe 


Lys 


Tyr 


Tyr Tyr 


His 


His 


Cys 


His 




130 










135 








140 










Leu 


Gin 




























145 































<210> 300 
<211> 97 
<212> PRT 

<213> Homo sapiens 



<400> 300 
























Met 


Ser Ser Ala 


Pro 


Glu 


Pro 


Pro 


Thr 


Phe 


Lys Lys 


Glu 


Pro 


Pro 


Lys 


1 




5 










10 








15 




Glu 


Lys Glu Phe 


Gin 


Ser 


Pro 


Gly 


Leu 


Arg 


Gly Val 


Arg 


Thr 


Thr 


Thr 




20 










25 








30 






Leu 


Phe Arg Ala 


Val 


Asn 


Pro 


Glu 


Leu 


Phe 


He Lys 


Pro 


Asn 


Lys 


Pro 




35 








40 








45 








Val 


Met Ala Phe 


Gly 


Leu 


Val 


Thr 


Leu 


Ser 


Leu Cys 


Val 


Ala 


Tyr 


He 




50 






55 








60 










Gly 


Tyr Leu His 


Ala 


He 


Gin 


Glu 


Asn 


Lys 


Lys Asp Leu 


Tyr 


Glu 


Ala 
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65 70 75 80 

lie Asp Ser Glu Gly His Ser Tyr Met Arg Arg Lys Thr Ser Lys Trp 
85 90 95 

Asp 

<210> 301 
<211> 98 
<212> PRT 

<213> Homo sapiens 

<220> 

<221> UNSURE 
<222> 30 

<22 3> Xaa = Lys,Asn 



<400> 301 



Met 

1 


Ala 


Ala 


Glu 


Pro 

5 


Leu 


Thr 


Glu 


Leu 


Glu 
10 


Glu 


Ser 


He 


Glu Asn Val 

15 


Val 


Thr 


Thr 


Phe 
20 


Phe 


Thr 


Phe 


Ala 


Arg 
25 


Gin 


Glu 


Gly 


Arg 


Xaa Asp Ser 
30 


Leu 


Ser 


Val 
35 


Asn 


Glu 


Phe 


Lys 


Glu 
40 . 


Leu 


Val 


Thr 


Gin 


Gin 
45 


Leu Pro His 


Leu 


Leu 


Lys 


Asp 


Val 


Gly 


Ser 


Leu Asp Glu Lys 


Met 


Lys 


Ser Leu Asp 




50 










55 










60 






Val 


Asn 


Gin Asp 


Ser 


Glu 


Leu 


Lys 


Phe 


Asn 


Glu 


Tyr 


Trp Arg Leu He 


65 










70 










75 






80 


Gly 


Glu 


Leu 


Ala 


Lys 
85 


Glu 


He 


Arg 


Lys 


Lys 
90 


Lys 


Asp 


Leu 


Lys He Arg 
95 


Lys 


Lys 



























<210> 302 
<211> 115 
<212> PRT 

<213> Homo sapiens 

<220> 

<221> UNSURE 
<222> 103 

<223> Xaa = Asp, Tyr 
<220> 

<221> UNSURE 
<222> 100 

<223> Xaa = Gly, Ser 
<400> 302 



Met 


Val 


Ala 


Ala 


Lys 


Lys 


Thr 


Lys 


Lys Ser Leu 


Glu 


Ser 


He 


Lys 


Ser 


1 








5 








10 








15 




Arg 


Leu 


Gin 


Leu 
20 


Val 


Met 




Ser 


Gly Lys Tyr 
25 


Val 




Gly 
30 


Tyr 


Lys 


Gin 


Thr 


35 


Lys 


Met 


He 


Arg 


Gin 
40 


Gly Lys Ala 


Lys 


Leu 
45 


Val 


He 


Leu 


Ala 


Asn 
50 


Asn 


Cys 


Pro 


Ala 


Leu 
55 


Arg 


Lys Ser Glu 


He 
60 


Glu 


Tyr 


Tyr 


Ala 


Met 


Leu 


Ala 


Lys 


Thr 


Gly 


Val 


His 


His Tyr Ser 


Gly Asn Asn 


He 


Glu 


65 










70 






75 










80 


Leu 


Gly 


Thr 


Ala 


Cys 


Gly 


Lys 


Tyr Tyr Arg Val 


Cys 


Thr 


Leu 


Ala 


He 










85 








90 








95 




He 


Asp 


Pro 


Xaa 
100 


Asp 


Ser 


Xaa 


He 


He Arg Ser 
105 


Met 


Pro 


Glu 
110 


Gin 


Thr 



Gly Glu Lys 
115 
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<210> 303 
<211> 102 
<212> PRT 

<213> Homo sapiens 



Met 


Ala 


Gly 


Gin 


Ala Phe 






Phe 


Leu 


Pro 


Leu 


Phe Asp Arg 


Val 


1 








5 








10 






15 




Leu 


Val 


Glu 


Arg 
20 


Ser Pro 


Ala 


Glu 


Thr 
25 


Val 


Thr 




Gly Gly He 
30 


Met 


Leu 


Pro 


Glu 


Lys 


Ser Gin 


Gly Lys 


Val 


Leu 


Gin 


Ala 


Thr Val Val 


Ala 






35 








40 










45 




Val 


Gly 
50 


Ser 


Gly 


Ser Lys 


Gly 
55 


Lys 


Gly 


Gly 


Glu 


He 
60 


Gin Pro Val 


Ser 


Val 


Lys 


Val 


Gly 


Asp Lys 


Val 


Leu 


Leu 


Pro 


Glu 


Tyr 


Gly Gly Thr Lys 


65 








70 










75 






80 


Val 


Val 


Leu 


Asp 


Asn Lys 


Asp 


Tyr 


Phe 


Leu 


Phe 


Arg Asp Gly Asp 


He 










85 








90 






95 




Leu 


Gly 


Lys 


Tyr 
100 


Val Asp 



















<210> 304 
<211> 102 
<212> PRT 

<213> Homo sapiens 



Met 


Ala 


Gly 


Gin 


Ala 


Phe Arg Lys 


Phe 


Leu 


Pro 


Leu 


Phe Asp Arg Val 


1 








5 






10 






15 




Leu 


Val 


Glu 


Arg 


Ser 


Ala Ala Glu 


Thr 


Val 


Thr 


Lys 


Gly Gly He 


Met 








20 






25 








30 




Leu 


Pro 


Glu 


Lys 


Ser 


Gin Gly Lys 


Val 


Leu 


Gin 


Ala 


Thr Val Val 


Ala 






35 






40 










45 




Val 


Gly 


Ser 


Gly 


Ser 


Lys Gly Lys 


Gly 


Gly Glu 


He 


Gin Pro Val 


Ser 




50 








55 








60 






Val 


Lys 


Val 


Gly 


Asp 


Lys Val Leu 


Leu 


Pro 


Glu 


Tyr 


Gly Gly Thr 




65 










70 






75 






80 


Val 


Val 


Leu 


Asp 


Asp 


Lys Asp Tyr 


Phe 


Leu 


Phe 


Arg Asp Gly Asp 


He 










85 






90 






95 




Leu 


Gly 


Lys 


Tyr 


Val Asp 




















100 



















<210> 305 

<211> 116 

<212> PRT 

<213> Homo sapiens 



<400> 305 



Met 


Ser 


Ala 


Thr 


Ala 


Ala 


Thr 


Ala 


Pro 


Pro 


Ala 


Ala 


Pro 


Ala 


Gly Glu 


1 








5 










10 










15 




Gly 


Gly 


Pro 


Pro 


Ala 


Pro 


Pro 


Pro 


Asn 


Leu 


Thr 


Ser 


Asn Arg Arg 


Leu 








20 










25 










30 






Gin 


Gin 


Thr 
35 


Gin 


Ala 


Gin 


Val 


Asp 
40 


Glu 


Val 


Val 


Asp 


He 
45 


Met 


Arg 


Val 


Asn 


Val 


Asp 




Val 




Glu 


Arg Asp 


Gin 




Leu 


Ser 


Glu 


Leu 






50 










55 










60 










Asp 


Arg 


Ala 


Asp 


Ala 


Leu 


Gin 


Ala 


Gly 


Pro 


Ser 


Gin 


Phe 


Glu 


Thr 


Ser 


65 










70 










75 










80 


Ala 


Ala 




Leu 


Lys 
85 


Arg 


Lys 


Tyr 


Trp 


Trp 
90 


Lys 


Asn 


Leu 


Lys 


Met 
95 


Met 


He 


He 


Leu 


Gly 


Val 


He 


Cys 


Ala 


He 


He 


Leu 


He 


He 


He 


He 


Val 



100 105 110 



Tyr Phe Ser Thr 
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<210> 306 
<211> 97 
<212> PRT 

<213> Homo sapiens 



<400> 306 



Met Ser Leu 


Ser 


Leu 


Val 


Phe Arg Ala 


Ala 


Ser Tyr 


Phe 


Lys 


Leu 


Val 


1 




5 










10 










15 




Pro Phe His 


Ser 


Ser 


Ser 


Ser 






Phe 




Gin 


Pro 


Pro 


Gly 


Trp 




20 










25 










30 






Val Val Leu 


Thr 


Gin 


Thr 


Leu 


Val 


Leu 


Leu 


His 


Phe 


Glu 


Arg 


Phe 


Ser 


35 










40 










45 








Tyr Gin Asn 


Val 


Pro 


Lys 


Ser 


Ala 


Gin 


Gly 


Lys 


Gly Asn Leu Gin 


Pro 


50 








55 










60 










Glu Thr Asn 


He 


His 


Leu 


Phe 


His 


Phe 


Leu 


Thr 


Phe 


Pro 


Lys 


Gin 


He 


65 






70 










75 










80 


Ser Arg Asn 


Leu 


Phe 


Asn 


Ser 


Leu 


Leu 


Cys 




Met 


Cys 


Leu 


Thr 


Tyr 






85 










90 










95 




Phe 




























<210> 307 




























<211> 110 




























<212> PRT 




























<213> Homo sapiens 
























<400> 307 




























Met Ser Met 


Thr 


Asp 


Leu 


Leu 


Asn 


Ala 


Glu 


Asp 


He 


Lys 


Lys 


Ala 


Val 


1 




5 










10 










15 




Gly Ala Phe 


Ser 


Ala 


Thr 


Asp Ser 


Phe 


Asp 


His 


Lys 


Lys 


Phe 


Phe 


Gin 




20 










25 










30 






Met Val Gly 


Leu 




Lys 


Lys 


Ser 


Ala 


Asp 


Asp Val 


Lys 


Lys 


Val 


Phe 


35 










40 










45 








His Met Leu 


Asp 


Lys 


Asp 


Lys 


Ser Gly 


Phe 


He 


Glu Glu Asp 


Glu 


Leu 


50 








55 










60 










Gly Phe lie 


Leu 


Lys 


Gly 


Phe 


Ser 


Pro Asp 


Ala 


Arg Asp 


Leu 


Ser 


Ala 


65 






70 










75 










80 


Lys Glu Thr 


Lys 


Met 


Leu 


Met 


Ala 


Ala 


Gly 


Asp 


Lys 


Asp Gly Asp 


Gly 






85 










90 










95 




Lys He Gly 


Val 


Asp 


Glu 


Phe 


Ser 


Thr 


Leu 


Val 


Ala 


Glu 


Ser 








100 










105 










110 






<210> 308 




























<211> 110 




























<212> PRT 




























<213> Homo sapiens 
























<400> 308 




























Met Ser Met 


Thr 


Asp Leu 


Leu 


Asn 


Ala 


Glu 


Asp 


He 


Lys 


Lys 


Ala 


Val 


1 




5 










10 










15 




Gly Ala Phe 


Ser 


Ala 


Thr 




Ser 


Phe 


Asp 


His 


Lys 


Lys 


Phe 


Phe 


Gin 




20 










25 










30 






Met Val Gly 


Leu 


Lys 






Ser 


Ala 


Asp 


Asp 


Val 


Lys 


Lys 


Val 


Phe 


35 










40 










45 








His Met Leu 


Asp 


Lys 


Asp 


Lys 


Ser 


Gly 


Phe 


He 


Glu 


Glu 


Asp 


Glu 


Leu 


50 








55 










60 










Gly Phe He 


Leu 




Gly 


Phe 


Ser 


Pro Asp 


Ala 


Arg 




Leu 


Ser 


Ala 


65 






70 










75 










80 


Lys Glu Thr 


Lys 


Met 


Leu 


Met 


Ala 


Ala 


Gly 


Asp 


Lys 


Asp 


Gly Asp 


Gly 




85 










90 










95 




Lys He Gly 


Val 


Asp Glu 


Phe 


Ser 


Thr 


Leu 


Val 


Ala 


Glu 


Ser 







100 105 110 
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<210> 309 
<211> 110 
<212> PRT 

<213> Homo sapiens 
<400> 309 



Met Ser Met 




Asp Leu 






Ala 


Glu 


Asp He 


Lys Lys 


Ala 




1 




5 










10 






15 




Gly Ala Phe 


Ser 


Ala 


Thr 


Asp 


Ser 


Phe Asp 


His Lys 


Lys Phe 


Phe 


Gin 




20 










25 






30 






Met Val Gly 






Lys 


Lys 


Ser 


Ala 


Asp 


Asp Val 


Lys Lys 


Val 


Phe 


35 










40 








45 






His Met Leu 


Asp 


Lys 


Asp 


Lys 


Ser 


Gly 


Phe 


He Glu 


Glu Asp 


Glu 


Leu 


50 








55 








60 








Gly Phe He 


Leu 




Gly 


Phe 


Ser 


Pro Asp 


Ala Arg Asp Leu 


Ser 


Ala 


65 
















75 






80 


Lys Glu Thr 


Lys 


Met 


Leu 


Met 


Ala 


Ala 


Gly 


Asp Lys 


Asp Gly Asp 








85 










90 






95 




Lys He Gly 


Val 


Asp 


Glu 


Phe 


Ser 


Thr 


Leu 


Val Ala 


Glu Ser 








100 










105 






110 






<210> 310 
























<211> 112 
























<212> PRT 
























<213> Homo sapiens 




















<220> 
























<221> UNSURE 






















<222> 8,33 
























<223> Xaa = 


Asp, Glu 




















<400> 310 
























Met Asp Pro 


Arg 


Lys 


Val 


Asn 


Xaa 


Leu Arg 


Ala Phe 


Val Lys 


Met 


Cys 


1 




5 










10 






15 




Lys Gin Asp 


Pro 


Ser 


Val 


Leu 


His 


Thr 


Glu 


Glu Met 


Arg Phe 


Leu 


Arg 




20 










25 






30 






Xaa Trp Val 


Glu 


Ser 


Met 


Gly 


Gly 


Lys 


Val 


Pro Pro 


Ala Thr 


Gin 


Lys 


35 










40 








45 






Ala Lys Ser 


Glu 


Glu 


Asn 


Thr 


Lys 


Glu 


Glu 


Lys Pro Asp Ser 


Lys 


Lys 


50 








55 








60 








Val Glu Glu 


Asp 


Leu Lys 


Ala Asp Glu 


Pro 


Ser Ser 


Glu Glu 


Ser 


Asp 


65 






70 










75 






80 


Leu Glu He 


Asp 


Lys 


Glu 


Gly Val 


He 


Glu 


Pro Asp 


Thr Asp Ala 


Pro 






85 










90 






95 




Gin Glu Met 


Gly 


Asp Glu 


Asn 


Ala 


Glu 


He 


Thr Ala 


Gin His 


Phe 


Leu 




100 










105 






110 






<210> 311 
























<211> 118 
























<212> PRT 
























<213> Homo ; 


sapii 






















<400> 311 
























Met Val Leu 




Glu 


Ser 


Glu 


Gin 


Phe 


Leu 


Thr Glu 


Leu Thr 


Arg 


Leu 


1 




5 










10 






15 




Phe Gin Lys 


Cys 


Arg 


Thr 


Ser 


Gly 


Ser 


Val 


Tyr He 


Thr Leu 


Lys 


Lys 




20 










25 






30 






Tyr Asp Gly Arg 


Thr 


Lys 


Pro 


He 


Pro 


Lys 


Lys Gly Thr Val 


Glu 


Gly 


35 










40 








45 






Phe Glu Pro 


Ala 


Asp Asn 


Lys 


Cys 


Leu 


Leu 


Arg Ala 


Thr Asp 


Gly 


Lys 


50 








55 








60 








Lys Lys He 


Ser 


Thr 


Val 


Val 


Ser 


Ser 


Lys 


Glu Val 


Asn Lys 


Phe 


Gin 
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Met Ala Tyr Ser Asn 
85 

Arg Asp Lys Lys Asn 
100 

Ala Ala Thr Ala Ala 
115 

<210> 312 
<211> 109 

<212> PRT 

<213> Homo sapiens 
<400> 312 



Met 


Asp 


Ser 


Leu 


Pro 


1 








5 


Leu 


Gly 


Arg 


Asn 

20 


Arg 


Lys 


Arg 


Thr 
35 


Glu 


Phe 


Pro 


Val 
50 


Gin 


Ser 


Leu 


Glu 


Phe 


Leu 


Gly 


Phe 


65 










Val 


Leu 


Asp 


He 


Trp 
85 


Phe 


Phe 


Ser 


Ser 


Pro 



100 

<210> 313 
<211> 103 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> unsure 
<222> 99 

<223> Xaa = Arg, Thr 
<220> 

<221> unsure 
<222> 103 

<223> Xaa = His, Asn 



<400> 313 






Met 
1 


Ser 


Gly 


Asp 


Gly 
5 


Lys 


Val 


Lys 


Lys 


Ala 








20 




Asp 


Ala 


Trp 


Ser 


He 






35 






Lys 


Ala 


Arg 


Lys 


Thr 




50 








Gly 


Arg 


Phe 


Trp 


Lys 


65 










Tyr 


Phe 


Phe 


Leu 


Tyr 










85 


Gin 


Val 


Xaa 


Asn 


He 



100 



PCT/JlBOl/00914 



Leu Leu Arg Ala Asn Met Asp Gly Leu Lys Lys 

90 95 
Glu Thr Lys Lys Thr Lys Ala Ala Ala Ala Ala 
105 110 

Gin 



Ser Gly Lys 


He 


His 


Arg Lys 


Val 


Lys 


He 


He 






10 








15 




Lys Glu Asn 


Leu 


Glu 


Pro Asn 


Ala 


Glu 


Phe 


Asp 




25 








30 






He Thr Gin 


Glu 


Glu 


Asn Arg 


He 


Cys 


Ser 


Ser 


40 








45 








Leu Asp Leu 


Phe 


Gin 


Thr Ser 


Glu 


Glu 


Lys 


Ser 


55 






60 










Thr Ser Tyr 


Thr 


Glu 


Lys Ser Gly He 


Cys 


Asn 


70 






75 








80 


Glu Glu Glu 


Asn 


Ser 


Asp Asn 


Leu 


Leu 


Thr 


Ala 






90 








95 




Ser Thr Ser 


Thr 


Phe 


Thr Gly 


Phe 









105 



Ala 


Thr Glu 


Gin 


Ala 


Ala Glu Tyr 


Val 


Pro 


Glu 








10 










15 




Glu 


Lys Lys 


Leu 


Glu 


Glu 




Pro 


Tyr 


Asp 








25 










30 






Leu 


He Arg Glu 


Ala 


Gin 


Asn 


Gin 


Pro 


He Asp 




40 










45 








Tyr 


Glu Arg 


Leu 


Val 


Ala 


Gin 


Phe 


Pro 


Ser 


Ser 




55 








60 










Leu Tyr He 


Glu 


Ala 


Glu 


Val 


Thr 


He 


Leu 


Phe 


70 








75 










80 


Gin 


Tyr Cys 


Ser 


He 


His 


Cys 


Ser Asp Arg 


Lys 








90 










95 




Ala 


Xaa 



















<210> 314 
<211> 72 
<212> PRT 
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<213> Homo sapiens 
<400> 314 



Met Ser Gly Asp Gly Ala 


Thr 


Glu 


Gin 


Ala 


Ala Glu 




Val 


Ser 


Glu 


1 5 










10 








15 




Lys Val Lys Lys Ala 


Glu 


Lys 


Lys 


Leu 


Glu 


Glu Asn 


Pro 


Tyr 


Asp 


Leu 


20 








25 








30 








Leu 


He 


Arg 


Glu 


Ala 


Gin Asn 


Gin 


Pro 


He Asp 


35 






40 








45 








Lys Ala Arg Lys Thr 


Tyr 


Glu Arg 


Leu 


Val 


Ala Gin 


Phe 


Pro 


Ser 


Ser 


50 




55 








60 










Gly Arg Phe Trp Lys 


Leu 


Tyr 


He 
















65 


70 




















<210> 315 






















<211> 106 






















<212> PRT 






















<213> Homo sapiens 






















<400> 315 






















Met Leu Lys Ser Asn 


Gly 


Glu Arg Arg Ser 


Arg Asn Ala 


Leu 


Pro 


Ala 


1 5 










10 








15 




Val Tyr Ala Arg Lys 


Met 


Ala 


Ala 


Ser 


Gin 


Gin Gin 


Ala 


Ser 


Ala 


Ala 


20 








25 








30 






Ser Ser Ala Ala Gly Val 


Ser 


Gly 


Pro 


Ser 


Ser Ala 


Gly 


Gly 




Gly 


35 






40 








45 








Pro Gin Gin Gin Pro 


Gin 


Pro 


Pro 


Ala 


Gin 


Leu Val 


Gly 


Pro 


Ala 


Gin 


50 




55 








60 










Ser Gly Leu Leu Gin 


Gin 


Gin 


Gin 


Gin 




Phe Asp 


Pro 


Val 


Gin 


Arg 


65 


70 










75 








80 


Tyr Lys Met Leu lie 


Pro 


Gin 


Leu 


Lys 


Glu 


Ser Leu 


Gin 


Val 


He 


Gly 


85 










90 








95 




Leu Lys Gin Arg Glu 


Ala 


Asn Trp 


He 


Trp 












100 








105 














<210> 316 






















<211> 86 






















<212> PRT 






















<213> Homo sapiens 






















<400> 316 






















Met Ala Glu Asp Met 


Glu 


Thr 


Lys 


He 


Lys 


Asn Tyr 


Lys 


Thr 


Ala 


Pro 


1 5 










10 








15 




Phe Asp Ser Arg Phe 


Pro 


Asn 


Gin 


Asn 


Gin 


Thr Arg Asn Cys 




Gin 


20 








25 








30 






Asn Tyr Leu Asp Phe 


His 


Arg 


Cys 


Gin 


Lys 


Ala Met 


Thr 


Ala 


Lys 


Gly 


35 






40 








45 








Gly Asp He Ser Val 


Cys 


Glu 


Trp Tyr 


Gin 


Arg Val 


Tyr 


Gin 


Ser 


Leu 


50 




55 








60 










Cys Pro Thr Ser Trp Val 


Thr 


Asp Trp Asp 


Glu Gin Arg Ala 


Glu 


Gly 


65 


70 










75 








80 


Thr Phe Pro Gly Lys 


He 




















85 






















<210> 317 






















<211> 105 






















<212> PRT 






















<213> Homo sapiens 






















<400> 317 






















Met Cys Val Gly Leu 


Leu 


Met 


Val 


Ser 


He 


Leu Thr 


Asp 


He 


Asn 


Leu 


1 5 










10 








15 




Ser Asn Leu Val Ala 


Val 


Gin 


Tyr Ser 


Pro 


Asp Tyr 


Cys 


Asn 


Phe 





[fije>'/^:/yv'GG208389e.s^] 



WO 02/083898 PCT/lBOl/00914 
184 



20 








25 










30 






Lys Arg Ser Asp Lys 


Asn 


Gin 


Asp 


Ala 


Ser 


Thr 


Phe 


Cys 


His 


Asn 


Cys 


35 






40 










45 








Asn Gin Phe His Leu 


Cys 


Leu 


Gin 


Tyr His 


His 




He 


Val 


Leu 


Pro 


50 














60 










Trp Ser Val Leu Ala 


lie 


Leu 


Ser 


Gin 


Leu 


Phe 


Leu 


His 


He 


Thr 


Ser 


65 


70 










75 










80 


Arg lie Arg Pro Thr 


lie 


Tyr 


Lys 


Ser 


Asn 


Lys 


Pro 


Lys 


Ser 


He 


Glu 


85 










90 










95 




lie Leu lie Gly Phe 


Val 


Leu 


Asn 


Leu 
















100 








105 
















<210> 318 
























<211> 101 
























<212> PRT 
























<213> Homo sapiens 
























<400> 318 
























Met Glu Arg Pro Asp 




Ala 


Ala 


Leu 


Asn 


Ala 


Leu 


Gin 


Pro 


Pro 


Glu 


1 5 










10 










15 




Phe Arg Asn Glu Ser 


Ser 


Leu 


Ala 


Ser 


Thr 


Leu 


Lys 


Thr 


Leu 


Leu 


Phe 


20 








25 










30 






Phe Thr Ala Leu Met 


He 


Thr 


Val 


Pro 


He 


Gly Leu 


Tyr 


Phe 


Thr 


Thr 


35 






40 










45 








Lys Ser Tyr lie Phe 


Glu Gly Ala 


Leu Gly 


Met 


Ser Asn Arg 


Asp 


Ser 






55 










60 










Tyr Phe Tyr Ala Ala 


He 


Val 


Ala 


Val 


Val 


Ala 


Val 


His 


Val 


Val 


Leu 


65 


70 










75 










80 


Ala Leu Phe Val Tyr 


Val 


Ala 


Trp 


Asn 


Glu 


Gly Ser Arg Gin 


Trp Arg 


85 










90 










95 




























^ ^ 100 
























<210> 319 
























<211> 92 
























<212> PRT 
























<213> Homo sapiens 
























<220> 
























<221> UNSURE 
























<222> 29 
























<223> Xaa = Gly, Arg 
























<220> 
























<221> UNSURE 
























<222> 28 
























<223> Xaa = Lys, Asn, Arg, Ser 




















<400> 319 
























Met Phe Arg Asp Phe 


Gly Arg Arg 


Leu 


Gin 


Arg Asp 


Leu 


Lys 


Arg 


Val 


1 5 










10 










15 




Val Asp Ala Arg Leu 




Leu 


Ser 


Glu 


Glu 




Xaa 


Xaa 


Gly 


Arg 


He 


20 








25 










30 






Lys Pro Lys Pro Val 


Glu 


Val 


Gin 


Val 


Val 


Thr 


His 


His 


Met 


Gin Arg 


35 






40 










45 








Tyr Ala Val Trp Phe 


Gly 


Gly Ser 


Met 


Leu 


Ala 


Ser 


Thr 


Pro 


Glu 


Phe 


50 




55 










60 










Phe Gin Val Cys His 


Thr 


Lys 


Lys 


Asp Tyr 


Glu 


Glu 


Tyr 


Gly 


Pro 


Ser 


65 


70 










75 










80 


lie Cys Arg His Asn 


Pro 


Val 


Phe 


Gly Val 


Met 


Ser 










85 










90 















<210> 320 
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<211> 92 

<212> PRT 

<213> Homo sapiens 

<400> 320 



Met Phe Arg Asp 


Phe 


Gly Arg 


Arg Leu 


Gin 




Leu Lys 


Arg Val 


1 


5 










10 








15 


Val Asp Ala Arg 


Leu Arg 


Leu 


Ser 


Glu 


Glu 


Leu 


Ser 


Gly Gly Arg He 


20 










25 








30 




Lys Pro Lys Pro 


Val 


Glu 


Val 


Gin 




Val 


Thr 


His 


His Met 


Gin Arg 


35 








40 










45 




Tyr Ala Val Trp 


Phe 


Gly 


Gly 


Ser 


Met 


Leu 


Ala 


Ser 


Thr Pro 


Glu Phe 


50 






55 










60 






Phe Gin Val Cys 


His 


Thr 


Lys 


Lys 


Asp 


Tyr 


Glu 


Glu 


Tyr Gly 


Pro Ser 


65 




70 










75 






80 


lie Cys Arg His 


Asn 


Pro 


Val 


Phe 


Gly 


Val 


Met 


Ser 








85 










90 










<210> 321 






















<211> 98 






















<212> PRT 






















<213> Homo sapiens 




















<400> 321 






















Met Ser Val Ser 


Phe 


His 








Lys 


Glu 


Leu 


Trp Thr 


Trp Met 


1 


5 










10 








15 


Glu Asp Leu Gin 




Glu 


Met 


Leu 


Glu 


Asp 


Val 


Cys 


Ala Asp 


Ser Val 


20 










25 












Asp Ala Val Gin 


Glu 


Leu 


He 


Lys 


Gin 


Phe 


Gin 


Gin 


Gin Gin 


Thr Ala 


35 


















45 




Thr Leu Asp Ala 


Thr 


Leu 


Asn 


Val 


He 




Glu 


Gly 


Glu Asp 


Leu He 


50 






55 










60 






Gin Gin Leu Arg 


Ser 


Ala 


Pro 


Pro 


Ser 


Leu 


Gly Glu 


Pro Ser 


Glu Ala 


65 




70 










75 






80 


Arg Ser Ala Trp 


Ala 


Glu 


Leu 


Ser 


Ser 


Gly Lys 


Cys 


Leu Gly 


Leu Asp 




85 










90 








95 


Val Arg 






















<210> 322 






















<211> 89 






















<212> PRT 






















<213> Homo sapiens 




















<400> 322 






















Met Val Ser Gly 


Asp Gly 


Phe 


Leu 


Val 


Ser Arg 


Pro 


Glu Ala 


He His 


1 


5 










10 








15 


Leu Gly Pro Arg 


Gin 


Ala 


Val 


Arg 


Pro 


Ser 


Val 


Arg 


Ala Glu 


Ser Arg 


20 










25 








30 




Arg Val Asp Gly 


Gly Gly Arg 


Ser 


Pro 


Arg 


Glu 


Pro 


Asp Gly Arg Gly 


35 








40 










45 




Arg Ser Arg Gin 


Ala Arg 


Phe 


Ser 


Pro 


Tyr 


Pro 


He 


Pro Ala 


Val Glu 


50 






55 










60 






Pro Asp Leu Leu 


Arg Ser 


Val 


Leu 


Gin 


Gin 


Arg 


Leu 


He Ala 


Leu Gly 


65 




70 










75 






80 


Gly Val He Ala 


Ala Arg 


He 


Ser 


Val 













85 



<210> 323 
<211> 123 
<212> PRT 

<213> Homo sapiens 



<400> 323 
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Met 


Ala 


Leu 


Asn 


Asn Val 


Ser Leu Ser 


1 








5 




Val 


Ala 


Tyr 


Arg 


Ser Ser 


His Gly Asp 








20 




25 


Leu 


Ala 


Met 


Val 


Ser Gly 


Asp Gly Phe 






35 






40 


He 


His 


Leu 


Gly 


Pro Arg 


Gin Ala Val 




50 








55 


Ser 


Arg 


Arg 


Val 


Asp Gly 


Gly Gly Arg 


65 








70 




Arg 


Gly 


Arg 


Ser 


Arg Gin 
85 


Ala Arg Phe 


Val 


Glu 


Pro 


Asp 


Leu Leu 


Arg Ser Val 








100 




105 


Leu 


Gly 


Gly 


Val 


He Ala 


Ala Arg He 






115 






120 



PCT/IBOl/00914 

186 



Ser 


Gly Asp Gin 


Arg Ser 


Arg 


10 




15 




Leu 


Arg Pro Arg 


Ala Ser 


Ala 






30 




Leu 


Val Ser Arg 


Pro Glu 


Ala 




45 






Arg 


Pro Ser Val 


Arg Ala 


Glu 




60 






Ser 


Pro Arg Glu 


Pro Asp 


Gly 




75 




80 


Ser 


Pro Tyr Pro 


He Pro 


Ala 


90 




95 




Leu 


Gin Gin Arg 


Leu He 


Ala 






110 




Ser 


Val 







<210> 324 
<211> 172 
<212> PRT 

<213> Homo sapiens 



<400> 324 



Met 


Ala 


Gly Ala 


Ala 


Thr 


Gin 


Ala Ser Leu 


Glu Ser 


Ala 


Pro Arg 


He 


1 








5 






10 






15 




Met 


Arg 


Leu 


Val 


Ala 


Glu 


Cys 


Ser Arg Ser 


Arg Ala 


Arg 


Ala Gly 


Glu 








20 








25 






30 




Leu 


Trp 


Leu 


Pro 


His 


Gly 


Thr 


Val Ala Thr 


Pro Val 


Phe 


Met Pro 


Val 






35 










40 




45 






Gly 


Thr 


Gin 


Ala 


Thr 


Met 


Lys 


Gly He Thr 


Thr Glu 


Gin 


Leu Asp 


Ala 




50 










55 




60 








Leu 


Gly 


Cys 


Arg 


He 


Cys 


Leu Gly Asn Thr 


Tyr His 


Leu Gly Leu 


Arg 


65 










70 






75 






80 


Pro 


Gly 


Pro 


Glu 


Leu 


He 


Gin 


Lys Ala Asn 


Gly Leu His 


Gly Phe 


Met 










85 






90 






95 




Asn 


Trp 


Pro 


His 


Asn 


Leu 


Leu 


Thr Asp Ser 


Gly Gly 


Phe 


Gin Met 


Val 








100 








105 






110 




Ser 


Leu 


Val 


Ser 


Leu 


Ser 


Glu 


Val Thr Glu 


Glu Gly 


Val 


Arg Phe 


Arg 






115 










120 




125 






Ser 


Pro 


Tyr Asp 


Gly Asn 


Glu 


Thr Leu Leu 


Ser Pro 


Glu 


Lys Ser 


Val 




130 










135 




140 








Gin 


He 


Gin 


Asn 


Ala 


Leu 


Gly Ser Asp He 


He Met 


Gin 


Leu Asp 


Asp 


145 










150 






155 






160 


Val 


Val 


Ser 


Ser 


Thr 


Val 


Thr 


Gly Pro Arg 


Val Glu 









165 170 



<210> 325 
<211> 20 
<212> PRT 

<213> Homo sapiens 



<400> 325 

Met Ala He Lys Phe Leu Glu Val He Lys Pro Phe Cys Val He Leu 
15 10 15 

Gin Leu Thr Ser 
20 



<210> 326 
<211> 23 
<212> PRT 

<213> Homo sapiens 



<400> 326 
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Phe Leu Asn Asn Ser Ser Pro Gin Glu Pro Leu Lys Leu Pro Pro Gly 
15 10 15 

Gin Leu Ser Ser Thr His Arg 



<210> 327 
<211> 45 
<212> PRT 

<213> Homo sapiens 

<220> 

<221> UNSURE 
<222> 27 

<223> Xaa = Phe, Ser 



<400> 327 

Met Glu Ser Asp Asp Ala Arg Ala Ala Arg Val Lys Glu Leu Leu Val 
15 10 15 

Asp Cys Tyr Lys Pro Thr Glu Ala Phe lie Xaa Gly Leu Leu Asp Lys 

20 25 30 

lie Arg Gly Met Gin Lys Leu Ser Thr Pro Gin Lys Lys 



<210> 328 
<211> 109 
<212> PRT 

<213> Homo sapiens 



<400> 328 


















Met 


Ala 


Ala 


Ser 


Ala 


Ala Arg Gly Ala Ala Ala 


Leu Arg Arg Ser 


lie 


1 








5 




10 






15 




Asn 


Gin 


Pro 


Val 
20 


Ala 


Phe Val Arg Arg 
25 


lie 


Pro 


Trp 


Thr Ala Ala 
30 


Ser 


Ser 


Gin 


Leu 


Lys 


Glu 


His Phe Ala Gin 


Phe 


Gly 


His 


Val Arg Arg Cys 






35 






40 








45 




He 


Leu 
50 


Pro 


Phe 


Asp 


Lys Glu Thr Gly 
55 


Phe 


His 


Arg 

60 


Gly Leu Gly 


Trp 


Val 


Gin 


Phe 


Ser 


Ser 


Glu Glu Gly Leu 


Arg 


Asn 


Ala 


Leu Gin Gin 


Glu 


65 










70 




75 






80 


Asn 


His 


lie 


lie 


Asp 
85 


Gly Val Lys Val 


Gin 
90 


Val 


His 


Thr Arg Arg 
95 


Pro 


Lys 


Leu 


Pro 


Gin 
100 


Thr 


Ser Asp Asp Glu 
105 


Lys 


Lys 


Asp 


Phe 





<210> 329 
<211> 51 
<212> PRT 

<213> Homo sapiens 



<400> 329 

Met Ala Ala Ser Ala Ala Arg Gly Ala Ala Ala Leu His Lys Tyr Gin 

15 10 15 

Ser Ala Gly Cys Phe Cys Glu Lys Asn Ser Leu Asp Cys Gly Val Glu 

20 25 30 

Ser Ala Glu Arg Thr Leu Cys Thr Val Arg Pro Cys Gin Lys Val His 

35 40 45 

Phe Thr Phe 
50 



<210> 330 
<211> 99 
<212> PRT 

<213> Homo sapiens 
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<400> 330 

Pro Val Arg Gly Leu Asn Met Ser Glu Phe Val Cys Asn Leu Ser Ala 

15 10 15 

Arg Pro Tyr Val Tyr Asp Leu lie Ala Val Ser Asn His Tyr Gly Ala 

20 25 30 

Met Gly Val Gly His Tyr 
35 

<210> 331 
<211> 13 
<212> PRT 

<213> Homo sapiens 
<400> 331 

Met Leu Gly Phe Ala Lys Asn Trp lie Phe Ser lie Thr 
15 10 

<210> 332 
<211> 13 
<212> PRT 

<213> Homo sapiens 
<400> 332 

Met Leu Gly Phe Ala Lys Asn Trp lie Phe Ser lie Thr 
1 5 10 

<210> 335 
<211> 13 
<212> PRT 

<213> Homo sapiens 
<400> 335 

Met Leu Gly Phe Ala Lys Asn Trp lie Phe Ser lie Thr 
15 10 

<210> 336 

<211> 13 

<212> PRT 

<213> Homo sapiens 

<400> 336 

Met Leu Gly Phe Ala Lys Asn Trp lie Phe Ser lie Thr 
15 10 

<210> 337 
<211> 109 
<212> PRT 
<213> Homo sapiens 



<400> 337 



Met 


Ala 


Ala 


Ser 


Ala 


Ala 


Arg 


Gly 


Ala 


Ala 


Ala 


Leu 


Arg Arg Ser He 


1 








5 










10 








15 


Asn 


Gin 


Pro 


Val 


Ala 


Phe 


Val Arg Arg Xaa 


Pro 


Trp 


Thr 


Ala Ala Ser 








20 










25 










30 


Ser 


Gin 


Leu 


Lys 


Glu 


His 


Phe 


Ala 


Gin 


Phe 


Gly 


His 


Val 


Arg Arg Cys 






35 










40 










45 




lie 


Leu 


Pro 


Phe 






Glu 


Thr 


Gly 


Phe 


His 


Arg 


Gly 


Leu Gly Trp 




50 










55 










60 






Val 


Gin 


Phe 


Ser 


Ser 


Glu 


Glu 


Gly 






Asn 


Ala 


Leu 


Gin Gin Glu 


65 










70 










75 






80 


Asn 


His 


He 


He 


Asp 


Gly Val 


Lys 


Val 


Gin 


Val 


His 


Thr 


Arg Arg Pro 










85 










90 








95 
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<210> 338 

<211> 59 

<212> PRT 

<213> Homo sapiens 



<400> 338 
















Met Lys Phe Gly Cys Leu Ser 


Phe 


Arg 


Gin 


Pro Tyr Ala 


Gly 


Phe 


Val 


1 5 






10 






15 




Leu Asn Gly He Lys Thr Val 


Glu 


Thr 


Arg 


Trp Arg Pro 




Leu 


Ser 


20 




25 






30 






Ser Gin Arg Asn Cys Thr lie 


Ala 


Val 


His 


He Ala His 


Arg 


Asp 


Trp 


35 


40 






45 








Glu Gly Asp Ala Cys Arg Glu 


Leu 


Leu 


Val 


Glu 









50 55 



<210> 339 

<211> 506 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> 88 . . 471 



<220> 

<221> sig_peptide 

<222> 88. .189 

<223> Von Heijne matrix 

score 13.6000003814697 
seq VLLLLSLLLTALA/SS 



<400> 339 

acttatctgc agacttgtag gcagcaactc accctcactc agaggtcttc tggttctgga 60 
aacaactcta gctcagcctt ctccacc atg age etc aga ett gat ace ace cct 114 
Met Ser Leu Arg Leu Asp Thr Thr Pro 
-30 



tec 


tgt 


aae 


agt 


gcg 


aga 


cca 


ett 


cat 


gee 


ttg 


eag 


gtg 


ctg 


ctg 


ett 


162 


Ser 


Cys 


Asn 


Ser 


Ala 


Arg 


Pro 


Leu 


His 


Ala 


Leu 


Gin 


Val 


Leu 


Leu 


Leu 




-25 










-20 










-15 










-10 




ctg 


tea 


ttg 


ctg 


ctg 


act 


get 


ctg 


get 


tec 


tec 


ace 


aaa 


gga 


caa 


act 


210 


Leu 


Ser 


Leu 


Leu 


Leu 
-5 


Thr 


Ala 


Leu 


Ala 


Ser 
1 


Ser 


Thr 


Lys 


Gly 
5 


Gin 


Thr 




aag 


aga 


aae 


ttg 


gcg 


aaa 


ggc 


aaa 


gag 


gaa 


agt 


eta 


gac 


agt 


gac 


ttg 


258 


Lys 


Arg 


Asn 


Leu 


Ala 


Lys 


Gly 


Lys 


Glu 


Glu 


Ser 


Leu 


Asp 


Ser 


Asp 


Leu 








10 










15 










20 










tat 


get 


gaa 


etc 


ege 


tge 


atg 


tgt 


ata 


aag 


aea 


ace 


tet 


gga 


att. 


cat 


306 


Tyr 


Ala 


Glu 


Leu 


Arg 


Cys 


Met 


Cys 


He 


Lys 


Thr 


Thr 


Ser 


Gly 


He 


His 






25 










30 










35 












ccc 


aaa 


aac 


ate 


caa 


agt 


ttg 


gaa 


gtg 


ate 


ggg 




gga 


acc 


cat 


tge 


354 


Pro 




Asn 


He 


Gin 


Ser 




Glu 


Val 


He 


Gly 




Gly 


Thr 


His 


Cys 




40 










45 










50 










55 




aac 


caa 


gtc 


gaa 


gtg 


ata 


gcc 


aca 


ctg 


aag 


gat 


ggg 


agg 


aaa 


ate 


tge 


402 


Asn 


Gin 


Val 


Glu 


Val 


He 


Ala 


Thr 


Leu 




Asp 


Gly 


Arg 




He 


Cys 












60 










65 










70 






ctg 


gac 


cca 


gat 


get 


cec 


aga 


ate 


aag 


aaa 


att 


gta 


eag 


aaa 


aaa 


ttg 


450 


Leu 


Asp 


Pro 


Asp 


Ala 


Pro 


Arg 


He 


Lys 




He 


Val 


Gin 




Lys 


Leu 










75 










80 










85 








gca 


ggt 


gat 


gaa 


tet 


get 


gat 


taatttgtte tgtttetgcc aaaettcttt 


501 


Ala 


Gly 


Asp 


Glu 


Ser 


Ala 


Asp 























90 
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aactc 506 



<210> 340 
<211> 523 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 41. .235 

<220> 

<221> sig_peptide 
<222> 41. .94 
<223> Von Heijne matrix 



jcore 12.1999998092651 
seq LTLAVLFLTGSQA/EUl 



<400> 340 

ttagagactg cgagaaggag gtcccccacg gcccttcagg atg aaa get gcg gtg 

Met Lys Ala Ala Val 



ctg 


acc 


ttg 


gcc 


gtg 


etc ttc ctg 


acg 


ggg 


age cag get egg cat 


ttc 


103 


Leu 


Thr 


Leu 


Ala 
-10 


Val 


Leu Phe Leu 


Thr 
-5 


Gly 


Ser Gin Ala Arg His 
1 


Phe 




tgg 


cag 


caa 


gat 


gaa 


ccc ccc cag 


age 


ccc 


tgg gat ega gtg aag 


gae 


151 


Trp 


Gin 
5 


Gin 


Asp 


Glu 


Pro Pro Gin 
10 


Ser 


Pro 


Trp Asp Arg Val Lys 
15 


Asp 




ctg 


gcc 


act 


gtg 


tac 


gtg gat gtg 


etc 


aaa 


gae age ggc aga gac 


tat 


199 


Leu 


Ala 


Thr 


Val 


Tyr 


Val Asp Val 


Leu 


Lys 


Asp Ser Gly Arg Asp Tyr 




20 










25 






30 


35 




gtg 


tec 


cag 


ttt 


gaa 


ggc tec get 


tgg 


gaa 


aac age taaacetaaa 




245 


Val 


Ser 


Gin 


Phe 


Glu 
40 


Gly Ser Ala 


Trp 


Glu 
45 


Asn Ser 







gctccttgac aactgggaca gcgtracctc crccttcagc aakctgcgcg aacagctcgg 305 

ccctgtgacc caggagttct gggataacct ggaaaaggag acagagggcc tgaggcagga 365 

gatgagcaag gatctggagg aggtgaaggc caasgtgcag ccctacctgg acgacttcca 425 

gaagaagtgg caggaggaga tggagctcta ccgccagaag gcagctttct taactatcct 485 

aacaageett ggaeeaaatg gaaataaage tttttgat 523 

<210> 341 
<211> 580 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 134. .373 

<220> 

<221> sig_peptide 
<222> 134 . .211 
<223> Von Heijne matrix 
score 12 

seq LLLALLLPTQIYS/SE 
<400> 341 

atttgtggct ttcctggtat ataaggtcte gccggctcge egegctccce accttgcctg 60 
cgeccgcccg gagccagcgg ttctccaagc acccagcatc ctgctagacg cgccgcgcac 120 
cgaeggaggg gac atg ggc aga gca atg gtg gcc agg etc ggg ctg ggg 169 
Met Gly Arg Ala Met Val Ala Arg Leu Gly Leu Gly 
-25 -20 -15 

ctg ctg ctg ctg gca ctg etc eta ccc acg cag att tat tec agt gaa 217 
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Leu Leu Leu Leu Ala Leu Leu Leu Pro Thr Gin lie Tyr Ser Ser Glu 
-10 -5 1 

aca aca act gga act tea agt aac tec tec cag agt act tec aac tct 265 

Thr Thr Thr Gly Thr Ser Ser Asn Ser Ser Gin Ser Thr Ser Asn Ser 

5 10 15 

ggg ttg gee cca aat cca act aat gcc acc acc aag gyg get ggt ggt 313 

Gly Leu Ala Pro Asn Pro Thr Asn Ala Thr Thr Lys Xaa Ala Gly Gly 



35 40 45 50 

cat etc tac tct taagagactc aggccaagaa acgtetteta aattteccea 413 

His Leu Tyr Ser 

tcttctaaac ccaatccaaa tggcgtctgg aagtccaatg tggcaaggaa aaaeaggtct 473 

tcatcgaatc tactaattcc acacctttta ttgacacaga aaatgttgag aateecaaat 533 

ttgattgatt tgaagaacat gtgagaggkt tgactagatg atggatg 580 



<210> 342 
<211> 508 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 76. .378 



<220> 

<221> sig_peptide 

<222> 76. . 156 

<223> Von Heijne matrix 

score 11.8000001907349 
seq LFLGLLLLPLVVA/FA 



<400> 342 

aattggccac agagacccag cccgagtttc ccatcgcact gagcactgag atcctgctgg 60 

aagctctgcc gcagc atg age tec gca gee ggg ttc tgc gcc tea cge cec 111 

Met Ser Ser Ala Ala Gly Phe Cys Ala Ser Arg Pro 
-25 -20 



ggg 


ctg 


ctg 


ttc 


ctg 


ggg 


ttg 


ctg 


etc 


ctg 


cca 


ctt 


gtg 


gtc 


gcc 


ttc 


159 


Gly 


Leu 


Leu 


Phe 


Leu 


Gly 


Leu 


Leu 


Leu 




Pro 


Leu 


Val 


Val 


Ala 


Phe 




-15 










-10 










-5 










1 




gcc 


age 


get 


gaa 


get 


gaa 


gaa 


gat 


ggg 


gac 


ctg 


cag 


tgc 


ctg 


tgt 


gtg 


207 


Ala 


Ser 


Ala 


Glu 
5 


Ala 


Glu 


Glu 


Asp 


Gly 
10 






Gin 


Cys 


Leu 
15 


Cys 


Val 




aag 


acc 


acc 


tec 


cag 


gtc 


cgt 


cec 


agg 


cac 


ate 


ace 


age 


ctg 


gag 


gtg 


255 


Lys 


Thr 


Thr 
20 


Ser 


Gin 


Val 


Arg 


Pro 
25 


Arg 


His 


He 


Thr 


Ser 
30 




Glu 


Val 




ate 


aag 


gcc 


gga 


cec 


cac 


tgc 


cec 


act 


gcc 


caa 


ctg 


ata 


gee 


a eg 


ctg 


303 


He 


Lys 

35 


Ala 


Gly 


Pro 


His 


Cys 
40 


Pro 


Thr 


Ala 


Gin 


Leu 
45 


He 


Ala 


Thr 


Leu 




aag 


aat 


gga 


agg 


aaa 


att 


tgc 


ttg 


gac 


ctg 


caa 


gcc 


ecg 


ctg 


tac 


aag 


351 


Lys 


Asn 


Gly 


Arg 




He 


Cys 


Leu 


Asp 


Leu 


Gin 


Ala 


Pro 


Leu 


Tyr 


Lys 




50 










55 










60 










65 






ata 


att 


aag 






ttg 


gag 


agt 


tagetactag etgeetaegt 




398 


Lys 


He 


He 


Lys 




Leu 


Leu 


Glu 


Ser 



















70 



gtgtgcattt gctatatagc atacttettt tttccagttt caatctaact gtgaaagaac 458 
ttetgatatt tgtgttatcc ttatgatttt aaataaacaa aataaateta 508 



<210> 343 
<211> 459 
<212> DNA 
<213> Homo sapiens 
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<220> 
<221> CDS 
<222> 75. .332 

<220> 

<221> sig_peptide 

<222> 75. .128 

<223> Von Heijne matrix 

score 11.6999998092651 

seq VLLLLLLVEQAAA/LG 

<400> 343 

gactctttgg tcagggaact gctcgctgag cacagctgca cagtgctgtc agaacggccg 
atctccagcc caag atg att cca gca gtg gtc ttg etc tta etc ctt ttg 
Met lie Pro Ala Val Val Leu Leu Leu Leu Leu Leu 
-15 -10 

gtt gaa caa gca gcg gcc ctg gga gag cct cag etc tgc tat ate ctg 
Val Glu Gin Ala Ala Ala Leu Gly Glu Pro Gin Leu Cys Tyr He Leu 
-5 15 10 





gee 


ate 


ctg 


ttt 


ctg 


tat 


gga 


att 


gtc 




ace 






tac 


tgt 


206 


Asp 


Ala 


He 


Leu 


Phe 
15 


Leu 


Tyr 


Gly 


He 


Val 
20 


Leu 


Thr 






Tyr 
25 


Cys 






ctg 


aag 


ate 




gtg 




aag 


gca 


get 


ata 


ace 


age 


tat 


gag 


aaa 


254 


Arg 




Lys 


He 

30 


Gin 


Val 


Arg 


Lys 


Ala 

35 


Ala 


He 


Thr 


Ser 


Tyr 
40 


Glu 


Lys 




tea 




ggt 


gtt 


tac 


acg 


ggc 


ctg 


age 


ace 


agg 


aac 


cag 


gag 


act 


tac 


302 


Ser 


Asp 


Gly 
45 


Val 


Tyr 


Thr 


Gly 


Leu 
50 


Ser 


Thr 


Arg 


Asn 


Gin 
55 


Glu 


Thr 


Tyr 




gag 


act 


ctg 


aag 


cat 


gag 


aaa 


cca 


cca 


cag 


tagccttaga atagatgcgg 


352 


Glu 


Thr 


Leu 


Lys 


His 


Glu 




Pro 


Pro 


Gin 

















tcatattctt ctttggcttc tggttcttcc agccctcatg tttggcatca catatgcctg 412 
catgccatta acaccagctg gccctacccc tataatgatc ctgtgtc 459 

<210> 344 
<211> 553 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 160. .435 

<220> 

<221> sig_peptide 

<222> 160. .231 

<223> Von Heijne matrix 

score 11.6999998092651 

seq FVLGLGLTPPTLA/QD 

<400> 344 

agatgeaaag eaggatteaa aagaacatct ttgcgttttc taccggctce ceatcategt 60 
aetagggagg aagaageggg tgagaaaeaa aacttctttc cattgteetg eeegtttetg 120 
cggacttgtt etgaggecga ggagcctgtg ttggaagag atg gtg atg ggc ctg 174 

Met Val Met Gly Leu 
-20 

ggc gtt ttg ttg ttg gtc ttc gtg ctg ggt ctg ggt ctg acc cca ccg 222 
Gly Val Leu Leu Leu Val Phe Val Leu Gly Leu Gly Leu Thr Pro Pro 

-15 -10 -5 

ace ctg get cag gat aac tee agg tac aca cac ttc ctg ace cag cac 270 
Thr Leu Ala Gin Asp Asn Ser Arg Tyr Thr His Phe Leu Thr Gin His 
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tat 


gat 


gcc 


aaa 


oca 


cag 


ggc 


egg gat gae aga 


tac 


tgt 


gaa 


age ate 


318 


Tyr 


Asp 


Ala 


Lys 


Pro 


Gin 


Gly Arg Asp Asp Arg 


Tyr 


Cys 


Glu 


Ser He 






15 










20 




25 










atg 


agg 


aga 


egg 


ggc 


ctg 


acc 


tea ecc tgc aaa 


gac 


ate 


aac 


aca ttt 


366 


Met 


Arg 


Arg 


Arg 


Gly 




Thr 


Ser Pro Cys Lys 




He 


Asn 


Thr Phe 




30 










35 




40 








45 




att 


cat 


ggc 




aag 


cgc 


aga 


tea agg cea tet 


gtg 


aaa 


aca 


aga atg 


414 


He 


His 


Gly 




Lys 


Arg Arg 


Ser Arg Pro Ser 


Val 


Lys 


Thr 


Arg Met 












50 






55 








60 




gaa 


acc 


etc 


aca 


gag 


aaa 


acc 


taagaataag caagtcttct ttccaggtca 


4 65 


Glu 


Thr 




Thr 
65 


Glu 


Lys 


Thr 















ccacttgcaa gctacatgga ggttcceetg gcctccatgc cagtaccgag ccacagcggg 525 
gttcagaaac gttgttgttg ettgtgaa 553 



<210> 345 
<211> 499 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 39. .371 

<220> 

<221> sig_peptide 
<222> 39. .86 
<223> Von Heijne matrix 
score 11.5 

seq ILLSVALLAFSSA/QD 
<400> 345 

agcataaagt tgggagtgac accagagcct tctgcaag atg ctt ctg att ctg ctg 56 

Met Leu Leu He Leu Leu 
-15 



tea 


gtg 


gee 


ctg 


ctg 


gee 


ttc 


age 


tea 


get 


cag 


gae 


tta 


gat 


gaa 


gat 


104 


Ser 


Val 


Ala 


Leu 




Ala 


Phe 


Ser 


Ser 


Ala 


Gin 


Asp 


Leu 


Asp 


Glu Asp 




-10 










-5 










1 








5 






gtc 


age 


caa 


gaa 


gac 


gtt 


ccc 


ttg 


gta 


ata 


tea 


gat 


gga 


gga 


gae 


tet 


152 


Val 


Ser 


Gin 


Glu 


Asp 


Val 


Pro 


Leu 


Val 


He 


Ser 


Asp 


Gly 


Gly 


Asp 


Ser 










10 










15 










20 








gag 


cag 


ttc 


ata 


gat 


gag 


gag 


cgt 


cag 


gga 


cea 


eet 


ttg 


gga 


gga 


cag 


200 


Glu 


Gin 


Phe 


He 


Asp 


Glu 


Glu 


Arg 


Gin 


Gly 


Pro 


Pro 


Leu 


Gly 


Gly 


Gin 








25 










30 










35 










caa 


tct 


caa 


ccc 


tct 


get 


ggt 


gat 


ggg 




cag 


rat 




ggc 


cct 


cag 


248 


Gin 


Ser 


Gin 


Pro 


Ser 


Ala 


Gly 


Asp 


Gly 


Asn 


Gin 


Xaa 


Asp 


Gly 


Pro 


Gin 






40 










45 










50 












cag 


gga 


oca 


ccc 


caa 




gga 


ggc 


cag 


cag 


caa 


caa 


ggt 


cea 


cea 


cct 


296 


Gin 


Gly 


Pro 


Pro 


Gin 


Gin 


Gly 


Gly 


Gin 


Gin 


Gin 


Gin 


Gly 


Pro 


Pro 


Pro 




55 










60 










65 










70 




cct 


cag 


gga 


aag 


cea 


car 


gga 


cea 


cct 


ccc 


caa 


ggg 


ggc 


cgc 


cea 


eaa 


344 


Pro 


Gin 


Gly 


Lys 


Pro 
75 


Gin 


Gly 


Pro 


Pro 


Pro 


Gin 


Gly 


Gly 


Arg 


Pro 


Gin 




gga 


cct 


cea 


cag 


ggg 


cag 


tct 


cct 


cag 


taatetagga ttcaatgaea 




391 


Gly 


Pro 


Pro 


Gin 


Gly 


Gin 


Ser 


Pro 


Gin 



















90 95 



ggaagtgaat aagaagatga eagtgtttea aatgeettga aaeataatgt gateatgete 451 
taactteaat ataeeaataa aataatcage ttgeaaaaaa aaaaaaaa 499 

<210> 346 
<211> 398 
<212> DNA 
<213> Homo sapiens 
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<220> 
<221> CDS 
<222> 36. .353 

<220> 

<221> sig_peptide 
<222> 36. .83 
<223> Von Heijne matrix 
score 11.5 

seq ILLSVALLAFSSA/QD 
<400> 346 

ataaagttgg gagtgacacc agagccttct gcaag atg ctt ctg att ctg ctg 53 

Met Leu Leu lie Leu Leu 

























-15 












tea 


gtg 


gcc 


ctg 


ctg 


gcc 


ttc 


age 


tea 


get 


cag 


gac 


tta 


gat 


gaa 


gat 


101 


Ser 


Val 


Ala 


Leu 


Leu 


Ala 


Phe 


Ser 


Ser 


Ala 


Gin 


Asp 


Leu 


Asp 


Glu 


Asp 




-10 










-5 










1 








5 






gtc 


age 


caa 


gaa 


gac 


gtt 


ccc 


ttg 


gta 






gat 


gga 


gga 


gac 


tct 


149 


Val 


Ser 


Gin 


Glu 
10 


Asp 


Val 


Pro 


Leu 


Val 
15 


He 


Ser 


Asp 


Gly 


Gly 
20 


Asp 


Ser 




gag 


cag 


ttc 


ata 


gat 


gag 


gag 


cgt 


cag 


gga 


cca 


cct 


ttg 


gga 


gga 


cag 


197 


Glu 


Gin 


Phe 
25 


He 


Asp 


Glu 


Glu 


Arg 
30 


Gin 


Gly 


Pro 


Pro 


Leu 
35 


Gly 


Gly 


Gin 




caa 


tct 


caa 


ccc 


tct 


get 


ggt 


gat 


ggg 


aac 


cag 


gat 


gat 


ggc 


cct 


cag 


245 


Gin 


Ser 
40 


Gin 


Pro 


Ser 


Ala 


Gly 
45 


Asp 


Gly 


Asn 


Gin 


Asp 

50 


Asp 


Gly 


Pro 


Gin 




cag 


gga 


cca 


ccc 


caa 


caa 


gga 


ggc 


cag 


cag 


caa 


caa 


ggt 


cca 


cca 


cct 


293 


Gin 


Gly 


Pro 


Pro 


Gin 


Gin 


Gly 


Gly 


Gin 


Gin 


Gin 


Gin 


Gly 


Pro 


Pro 


Pro 




55 










60 










65 










70 




cct 


cag 


gga 


aag 




caa 


gga 


cca 


ccc 


caa 


cag 


gga 


ggc 


cag 


tct 


tgt 


341 


Pro 


Gin 


Gly 


Lys 


Pro 
75 


Gin 


Gly 


Pro 


Pro 


Gin 
80 


Gin 


Gly 


Gly 


Gin 


Ser 
85 


Cys 




tgc 


tgt 


gac 


aag 


taacagcctt ttttttaaat tgttttttat acaaaaaaaa 


aaaaa 


398 


Cys 


Cys 


Asp 


Lys 
90 





























<210> 347 
<211> 525 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 48. .470 

<220> 

<221> sig_peptide 

<222> 48. .119 

<223> Von Heijne matrix 

score 10.8999996185303 

seq LAGLCCLVPVSLA/ED 

<400> 347 

ctcctcagct tcaggcacca ccactgacct gggacagtga atcgaca atg ccg 

Met Pro 

tct gtc teg tgg ggc ate etc ctg ctg gea ggc ctg tgc tgc ctg 
Ser Val Ser Trp Gly He Leu Leu Leu Ala Gly Leu Cys Cys Leu 

-20 -15 -10 

cct gtc tec ctg get gag gat ccc cag gga gat get gcc cag aag 
Pro Val Ser Leu Ala Glu Asp Pro Gin Gly Asp Ala Ala Gin Lys 
-5 15 10 



tct 56 
Ser 

gtc 104 

Val 

aca 152 
Thr 
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gat 


aea 


tee 




cat 


gat 


cag 










ttc 


aae 


aag 


ate 


acc 


200 


Asp 


Thr 


Ser 


His 
15 


His 


Asp 


Gin 


Asp 


His 
20 


Pro 


Thr 


Phe 


Asn 


Lys 
25 


He 


Thr 




ccc 


aae 


ctg 


get 


gag 


ttc 


gee 


ttc 


age 


eta 


tac 


cge 




ctg 


gca 


cac 


248 


Pro 


Asn 


Leu 
30 


Ala 


Glu 


Phe 


Ala 


Phe 
35 


Ser 


Leu 


Tyr 


Arg 


Gin 
40 


Leu 


Ala 


His 




cag 


tee 


aae 


age 


aee 


aat 


ate 


ttc 


ttc 


tec 


eca 


gtg 


age 




get 


aea 


296 


Gin 


Ser 
45 


Asn 


Ser 


Thr 


Asn 


He 
50 


Phe 


Phe 


Ser 


Pro 


Val 
55 


Ser 


He 


Ala 


Thr 




gcc 


ttt 


gca 


atg 


etc 


tec 


ctg 


ggg 


acc 


aag 


get 


gac 


act 


cac 


gat 


gaa 


344 


Ala 


Phe 


Ala 


Met 


Leu 


Ser 


Leu 


Gly 


Thr 


Lys 


Ala 


Asp Thr 


His 


Asp 


Glu 




60 










65 










70 










75 




ate 


ctg 


gag 


gge 


ctg 


aat 


ttc 


aae 


etc 


acg 


gag 


att 


ccg 


gag 


get 


cag 


392 










80 










85 


Glu 


He 


Pro 


Glu 


Ala 
90 


Gin 




ate 


cat 


gaa 


ggc 


ttc 


cag 


gaa 


etc 


etc 


cgt 


acc 


etc 


aac 


cag 


cca 


gac 


440 


He 


His 


Glu 


Gly 


Phe 


Gin 


Glu 


Leu 


Leu 


Arg 


Thr 


Leu 


Asn 


Gin 


Pro Asp 










95 










100 










105 








age 




etc 


cag 


ctg 




ace 


ggc 


aag 


aat 


taatgctttc ttaactaagt 


490 


Ser 


Gin 


Leu 


Gin 


Leu 


Thr 


Thr 


Gly 


Lys 


Asn 

















110 115 
ttactttgag tattaacaag tagcttatac tgagg 525 



<210> 348 
<211> 500 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 91. .462 

<220> 

<221> sig_peptide 

<222> 91. .180 

<223> Von Heijne matrix 

score 9.89999961853027 

seq LLFGFTLVSGTGA/EK 

<400> 348 

acccctacce caegccccct cccgcgcgcg cggttaaatc cccgcacctg ageatcggct 60 
cacacctgca ccccgcccgg gcatagcacc atg cct get tgt cgc eta ggc ccg 114 
Met Pro Ala Cys Arg Leu Gly Pro 
-30 -25 



eta 


gcc 


gcc 


gcc 


etc 


etc 


etc 


age 


ctg 


ctg 




ttc 


ggc 


ttc 




eta 


162 


Leu 


Ala 


Ala 
-20 


Ala 


Leu 


Leu 


Leu 


Ser 
-15 


Leu 


Leu 


Leu 


Phe 


Gly 
-10 


Phe 


Thr 


Leu 




gtc 


tea 


ggc 




gga 


gca 


gag 


aag 


act 


ggc 


gtg 


tgc 


ccc 


gag 


etc 


cag 


210 


Val 


Ser 
-5 


Gly 


Thr 


Gly 


Ala 


Glu 
1 


Lys 


Thr 


Gly 


Val 
5 


Cys 


Pro 


Glu 




Gin 
10 




get 


gac 


cag 


aac 


tgc 


acg 


caa 


gag 


tgc 


gtc 


teg 


gac 


age 


gaa 


tgc 


gcc 


258 


Ala 


Asp 


Gin 


Asn 


Cys 
15 


Thr 


Gin 


Glu 


Cys 


Val 
20 


Ser 


Asp 


Ser 


Glu 


Cys 
25 


Ala 




gac 


aae 


etc 


aag 


tgc 


tgc 


age 


gcg 


ggc 


tgt 


gcc 


acc 


ttc 


tgc 


tct 


ctg 


306 


Asp 


Asn 


Leu 


Lys 
30 


Cys 


Cys 


Ser 


Ala 


Gly 
35 


Cys 


Ala 


Thr 


Phe 


Cys 
40 


Ser 


Leu 




ccc 


aat 


gat 


aag 


gag 


ggt 


tec 


tgc 


ccc 


cag 


gtg 


aac 


att 


aac 


ttt 


ccc 


354 


Pro 


Asn 


Asp 
45 


Lys 


Glu 


Gly 


Ser 


Cys 

50 


Pro 


Gin 


Val 


Asn 


He 
55 


Asn 


Phe 


Pro 




cag 


etc 


ggc 


etc 


tgt 


egg 


gac 


cag 


tgc 


cag 


gtg 


gac 


age 


cag 


tgt 


cct 


402 


Gin 


Leu 

60 


Gly 


Leu 


Cys 


Arg 


Asp 
65 


Gin 


Cys 


Gin 


Val 


Asp 
70 


Ser 


Gin 


Cys 


Pro 




ggc 


cag 






tgc 


tgc 


cgc 


aat 


ggc 


tgt 


ggg 


aag 


gtg 


tec 


tgt 


gtc 


450 
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Gly Gin Met Lys Cys Cys Arg Asn Gly Cys Gly Lys Val Ser Cys Val 
75 80 85 90 

act ccc aat ttc tgagctccag ccaccaccag gctgagcagt gaggagag 500 

Thr Pro Asn Phe 

<210> 349 
<211> 519 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 28. .399 

<220> 

<221> sig_peptide 

<222> 28. .117 

<223> Von Heijne matrix 

score 9.89999961853027 

seq LLFGFTLVSGTGA/EK 

<400> 349 

acctgcaccc cgcccgggca tagcacc atg cct get tgt cgc eta ggc ccg eta 54 
Met Pro Ala Cys Arg Leu Gly Pro Leu 
-30 -25 



gcc 


gcc 


gcc 


etc 


etc 


etc 


age 


etg 


etg 


etg 


ttc 


ggc 


ttc 


acc 


eta 


gtc 


102 


Ala 


Ala 
-20 


Ala 


Leu 


Leu 


Leu 


Ser 
-15 


Leu 


Leu 


Leu 


Phe 


Gly 
-10 


Phe 


Thr 


Leu 


Val 




tea 


ggc 




gga 


gea 


gag 


aag 


act 


ggc 


gtg 


tgc 


ccc 


gag 


etc 


cag 


get 


150 


Ser 


Gly 


Thr 


Gly 


Ala 


Glu 


Lys 


Thr 


Gly 


Val 


Cys 


Pro 


Glu 


Leu 


Gin 


Ala 




-5 










1 








5 










10 






gac 


cag 


aac 


tgc 


aeg 


eaa 


gag 


tgc 


gtc 


teg 


gac 


age 


gaa 


tgc 


gcc 


gac 


198 


Asp 


Gin 


Asn 


Cys 
15 


Thr 


Gin 


Glu 


Cys 


Val 
20 


Ser 


Asp 


Ser 


Glu 


Cys 
25 


Ala 


Asp 




aac 


ate 


aag 


tgc 


tgc 


age 


gcg 


ggc 


tgt 


gcc 


acc 


ttc 


tgc 


tct 


etg 




246 


Asn 


He 


Lys 
30 


Cys 


Cys 


Ser 


Ala 


Gly 
35 


Cys 


Ala 


Thr 


Phe 


Cys 
40 


Ser 


Leu 


Pro 




aat 


gat 


aag 


gag 


ggt 


tec 


tgc 


eec 


eag 


gtg 


aac 


att 


aac 


ttt 


ccc 


eag 


294 


Asn 


Asp 
45 


Lys 


Glu 


Gly 


Ser 


Cys 
50 


Pro 


Gin 


Val 


Asn 


He 
55 


Asn 


Phe 


Pro 


Gin 




etc 


ggc 


etc 


tgt 


egg 


gac 


eag 


tgc 


eag 


gtg 


gac 


age 


cag 


tgt 


cct 


ggc 


342 


Leu 


Gly 


Leu 


Cys 


Arg 


Asp 


Gin 


Cys 


Gin 


Val 


Asp 


Ser 


Gin 


Cys 


Pro 


Gly 




60 










65 










70 










75 




cag 


atg 


aaa 


tgc 


tgc 


cgc 


aat 


ggc 


tgt 


ggg 


aag 


gtg 


tec 


tgt 


gtc 


act 


390 


Gin 


Met 


Lys 


Cys 


Cys 
80 


Arg 




Gly 


Cys 


Gly 
85 


Lys 


Val 


Ser 


Cys 


Val 
90 


Thr 





ccc aat ttc tgagctccag ceacaggctg ageagtgagg agagaaagtt 439 

Pro Asn Phe 

tetgeetgge cctgcatctg gttccagccc acctgccetc ecctttttcg gggactctgt 4 99 

attccctett gggctgacca 519 



<210> 350 
<211> 482 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> CDS 
<222> 40. .246 

<220> 

<221> sig_peptide 
<222> 40. .90 



wo 02/083898 



PCT/lBOl/00914 



197 

<223> Von Heijne matrix 
score 9.5 

seq FVFLLFVISLAAA/AH 
<400> 350 

aaaaagtagg acaccggagc tagcagtggc aacccactt atg ttt cct ggc ttt 54 

Met Phe Pro Gly Phe 





-15 




gtt ttt ttg etc ttt gta ata agt ctt get get 


get get eat ctt tgg 


102 


Val Phe Leu Leu Phe Val lie Ser Leu Ala Ala 


Ala Ala His Leu Trp 




-10 -5 


1 




gtc etg gee gee ttt atg gge egt ate aee gtg 


aag gtc tgc age ttc 


150 


Val Leu Ala Ala Phe Met Gly Arg lie Thr Val 


Lys Val Cys Ser Phe 




5 10 15 


20 




act cct gag gcc age aag acc gtg age cca cca 


gaa gga gcg aac aac 


198 


Thr Pro Glu Ala Ser Lys Thr Val Ser Pro Pro 


Glu Gly Ala Asn Asn 




25 30 


35 




teg aga cgc act get ttt aag age tgt aac act 


cac cac aaa ggt ctg 


246 


Ser Arg Arg Thr Ala Phe Lys Ser Cys Asn Thr 


His His Lys Gly Leu 






50 




tagcatcact cetgaagtea gegggaecae gaaeetacca 


gaaagaagaa actccataca 


306 


gatctgaaaa tetgaaggaa caaactccgg acacaccatc 


tttaagaaet gtaacactca 


366 


etgcgagggt ceaeggctte atttttgaag tcagccagac 


caagaaccca ecaattcege 


426 


acacacttct gctgtttcgt tccattctca aggaaaactg 


tgatgggtcc attttt 


482 


<210> 351 






<211> 462 






<212> DNA 






<213> Homo sapiens 






<220> 






<221> CDS 






<222> 114. .344 






<220> 






<221> sig peptide 






<222> 114 . .212 






<223> Von Heijne matrix 







score 9.30000019073486 
seq MVLLSFALTLCSA/FW 



<400> 351 

wctgctcact gctgggtact gttctgctgt gggtgcccag gaagggacta cacctcttcg 60 
eagtgtttta tacetttggt aatatcgcat caattgggag taccatctte etc atg 116 

Met 

gtg aaa cag ctg aag cga atg ttt gag cct act cgt ttg att 164 
Val Lys Gin Leu Lys Arg Met Phe Glu Pro Thr Arg Leu lie 
-30 -25 -20 

ate atg gtg ctg ttg agt ttt gca ctt acc ctg tgt tct gee 212 
lie Met Val Leu Leu Ser Phe Ala Leu Thr Leu Cys Ser Ala 
-10 -5 

tgg cat aac atg gga ctt gca ctt ate ttc tgc att ttg cag 260 
Trp His Asn Met Gly Leu Ala Leu lie Phe Cys lie Leu Gin 

5 10 15 

gca ttg acg tgg tac age ctt tec ttc ata cca ttt gca agg 308 
Ala Leu Thr Trp Tyr Ser Leu Ser Phe lie Pro Phe Ala Arg 

20 25 30 

gtg aag aag tgt ttt gee gtg tgt ctt gca taatteatgg 354 
Val Lys Lys Cys Phe Ala Val Cys Leu Ala 
35 40 
ccagttttat gaagetttgg aaggcactat ggacagaagc tggtggaeag ttttgtaaet 414 
atcttcgaaa cctctgtett acagacatgt gcettttatc ttgeagea 462 
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<210> 352 
<211> 496 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 29. .466 

<220> 

<221> sig_peptide 

<222> 29. .106 

<223> Von Heijne matrix 

score 8.89999961853027 

seq GLLWAFCAPGARA/EE 

<400> 352 

cgtctgcttc ggagaccgta agatattg atg acc atg aga tec ctg etc aga 52 

Met Thr Met Arg Ser Leu Leu Arg 
-25 -20 



acc 


ccc 


ttc 


ctg 


tgt 


ggc 


ctg 


etc 


tgg gee ttt 


tgt 


gee 


cca 


ggc 


gee 


100 


Thr 


Pro 


Phe 


-15 
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Met 
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He 
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Gly 


Val 


He 


Asn 
35 
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40 
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45 
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ttg 
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292 


Gin 


Leu 


His 


Tyr 

50 


Phe 


Lys 


Met 


His 


Asp Tyr Asp 
55 
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60 
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Leu 




gat 
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tta 


gaa 


etc 
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gtc 


cat 


aag 


gag 
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340 


Asp 


Gly 


Leu 
65 
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Leu 
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Thr 


Ala 
70 


He Thr His 


Val 


His 
75 




Glu 


Glu 




ggg 


agt 


gaa 


eag 


gca 


cca 


Ota 


atg 


agt gaa gat 


gaa 


ctg 


att 


aac 


ata 


388 


Gly 


Ser 
80 


Glu 


Gin 


Ala 


Pro 


Leu 

85 


Met 


Ser Glu Asp 


Glu 
90 


Leu 


He 


Asn 


He 




ata 


gat 


ggt 


gtt 


ttg 


aga 


gat 


gat 


gac aag aac 


aat 


gat 


gga 


tac 


att 


436 


He 


Asp 


Gly 


Val 


Leu 


Arg 


Asp 


Asp 


Asp Lys Asn 


Asn 


Asp 


Gly 


Tyr 


He 
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100 
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gac 


tat 
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ttt 
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tea 


etg eag tagatgttat ttggeatete 
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Glu 


Phe 


Ala 


Lys 


Ser 


Leu Gin 















115 120 



ctggttatat 496 

<210> 353 
<211> 499 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 155. .340 

<220> 

<221> sig_peptide 

<222> 155. .292 

<223> Von Heijne matrix 

score 8.60000038146973 
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seq AVLLLILFAIVFG/LL 
<400> 353 

cttttcctct caacagttgc ttctttgagt cagggtgcag ctctggtcac ctggcggcct 60 
cttcagctca gccctccaca aagtgtgagc ctgaaggacc accctgaatt gcccttgtag 120 
gacccagaac agcwaccagc agaatcagat tctc atg gac caa ytg gta ttc aaa 175 
Met Asp Gin Leu Val Phe Lys 
-45 -40 
gag aca ate tgg aat gat gcg ttc tgg cag aac ccc tgg gac cag ggg 223 
Glu Thr lie Trp Asn Asp Ala Phe Trp Gin Asn Pro Trp Asp Gin Gly 

-35 -30 -25 

ggc ctg gca gtg att ate tta ttc ate acc get gte etg ctt cte ate 271 
Gly Leu Ala Val lie lie Leu Phe lie Thr Ala Val Leu Leu Leu He 

-20 -15 -10 

tta ttt gee ate gtg ttt ggt tta etc act tec aca gaa aac act cag 319 
Leu Phe Ala He Val Phe Gly Leu Leu Thr Ser Thr Glu Asn Thr Gin 

-5 15 
tgt gaa gcg ggt gaa gag gag tgaectgact tgctggggac tgagatggca 370 
Cys Glu Ala Gly Glu Glu Glu 
10 15 

gcaggggagg cgagctgacc tgcccccatt ccagtggtgg gcccttcgcg gttccctctg 430 
gctcaggggc caagcctggt gtcttccttt cccaccagga aaaagtctag taaaatactg 490 
tatetgget 4 99 

<210> 354 
<211> 537 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 13.. 498 

<220> 

<221> sig_peptide 

<222> 13. .153 

<223> Von Heijne matrix 

score 8.30000019073486 

seq LALSSLLSLLLFA/GM 

<400> 354 

tagegctgac gc atg cgc ata get aac cge acc egg ttc age teg cet tte 51 
Met Arg He Ala Asn Arg Thr Arg Phe Ser Ser Pro Phe 
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Phe 


Ala 


Gly 
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Met 


Gin 


Met 
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10 
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243 
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Leu 
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He 


Leu 
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Phe Ala Ser Gly Leu lie His Arg Val Cys Val Thr Thr Cys Phe He 

65 70 75 

ttc tec atg gtt ggt ctg tac tac ate aac aag ate tee tec ace ctg 435 
Phe Ser Met Val Gly Leu Tyr Tyr He Asn Lys He Ser Ser Thr Leu 

80 85 90 

tae eag gea gea get eea gtc etc aca cca gee aag gte aca gge aag 483 
Tyr Gin Ala Ala Ala Pro Val Leu Thr Pro Ala Lys Val Thr Gly Lys 
95 100 105 110 

age aag aag aga aac tgaccctgaa tgttcaataa agttgattct ttgcaaaaa 537 
Ser Lys Lys Arg Asn 
115 

<210> 355 
<211> 514 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 132. .401 

<220> 

<221> sig_peptide 

<222> 132.. 236 

<223> Von Heljne matrix 

score 8.30000019073486 

seq LGLGLVTLTASWA/AL 

<400> 355 

ccttcagckc ccggccgaga gcccggtggc tgcgactgak agcctggtta ctgccggcac 60 
ctttggatcc tcggccaatt ctctgcttcg cctacacttg gcgtgcggtg taccagcctc 120 
cccgcgtcac c atg gaa aca gga gcc tct gea tec ate cca gag ctg ate 170 
Met Glu Thr Gly Ala Ser Ala Ser He Pro Glu Leu He 
-35 -30 -25 
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45 
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50 
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Pro 

55 











tgggcatcaa gatctaaccc ttgaaccttc cacgaaccct gttgtcaata cctctgggtt 471 
tccgccggtt acgcttaatt ttgacatatc ggtcagactg gtg 514 

<210> 356 
<211> 589 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 41. .223 

<220> 

<221> sig_peptide 
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<222> 41. .115 

<223> Von Heijne matrix 

score 8.10000038146973 

seq ATLLLVLCLQLGA/NK 



<400> 356 

accacttctc tgggacacat tgccttctgt i 



tctccagc atg cgc ttg etc cag 
Met Arg Leu Leu Gin 



etc ctg ttc agg gcc age cct gcc acc ctg etc ctg gtt etc tge ctg 

Leu Leu Phe Arg Ala Ser Pro Ala Thr Leu Leu Leu Val Leu Cys Leu 



cag ttg ggg gcc aac aaa get cag gac aac act egg aag ate ata ata 
Gin Leu Gly Ala Asn Lys Ala Gin Asp Asn Thr Arg Lys lie lie He 

15 10 
aag aat ttt gac att ccc aag tea gta cgt eca aat gac gaa gya ctg 
Lys Asn Phe Asp He Pro Lys Ser Val Arg Pro Asn Asp Glu Xaa Leu 

15 20 25 

cag tge ttg cag ttc aaa cag aat tgaaagaatg catggtggtt aaaacttace 
Gin Cys Leu Gin Phe Lys Gin Asn 



30 



35 



teattageag eatecctcta caaggtgcat ttaactataa gtatactgcc tgcetatgtg 
aegacaatcc aaaaaccttc tactgggact tttacaccaa cagaactgtg caaattgeag 
ccgtcgttga tgttattcgg gaattaggca tctgcctgat gatgctgctg taatccceat 
caaaaaeaac eggttttata etattgaaat cetaaaggta gaataatgga agcectgtct 
gtttgceaca cccaggtgat ttcctctaaa gaaacttgge tggaatttct gctgtggtct 
ataaaataaa ettcttaaca tacaaaaaaa aaaaaa 



4 93 
553 
589 



<210> 357 
<211> 465 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 19. .378 



<220> 

<221> sig_peptide 

<222> 19. .114 

<223> Von Heijne matrix 

score 7.59999990463257 
seq VTLGIGFFALASA/LW 
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eta aag atg aac ccc aac ttg caa caa aaa ata tea ttt gtg ate ect 339 

Leu Lys Met Asn Pro Asn Leu Gin Gin Lys He Ser Phe Val He Pro 

60 65 70 75 

eag aga cea get eea eaa eaa ate gea gea gtg tta eat taagettatc 388 

Gin Arg Pro Ala Pro Gin Gin He Ala Ala Val Leu His 

80 85 
aacattaeca tctgattett attacageca aagtatagaa geagetgatg actggtttte 448 
tgatgattct ctagtga 4 65 



<210> 358 
<211> 438 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 84. .317 



<220> 

<221> sig_peptide 

<222> 84. .140 

<223> Von Heijne matrix 

score 7.59999990463257 
seq ALVLALMISMISA/DS 



<400> 358 

atagaaaagg acatctcttg agacttcact tcagctteae tgaettcttg aetctectet 
tgagtaaaag gactcagcca act atg aag ttt ttt gtc ttt get tta gtc ttg 
Met Lys Phe Phe Val Phe Ala Leu Val Leu 
-15 -10 
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tea att gte act tgatgatata attgeaattt aaactgttaa getgtgttea 357 
Ser He Val Thr 

gtaetgtttc tgaataatag aaatcacttc tctaaaagca ataaatttca ageacatttt 417 

taaataaaaa aaatmtmaaa a 438 



<210> 359 
<211> 537 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 57. .317 



<220> 

<221> sig_peptide 

<222> 57. .116 

<223> Von Heijne matrix 

score 7.40000009536743 
seq LLVFLAGFPVLDA/ND 
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<400> 359 

gatttctccc ggaacctctg ctcagcctgg tgaaccacac aggccagcgc tctgac atg 59 

Met 
-20 



cag 


aag 


gtg 


acc 


ctg 


ggc 


ctg 


ctt 


gtg 


ttc 


ctg 


gea 


ggc 


ttt 


cct 


gte 


107 


Gin 


Lys 


Val 


Thr 


Leu 
-15 


Gly 


Leu 


Leu 


Val 


Phe 
-10 


Leu 


Ala 


Gly 


Phe 


Pro 
-5 


Val 




ctg 


gac 


gcc 


aat 




eta 


gaa 


gat 


aaa 


aac 


agt 


cct 


ttc 


tac 


tat 


gac 


155 


Leu 


Asp 


Ala 


Asn 
1 


Asp 


Leu 


Glu 


Asp 
5 


Lys 




Ser 


Pro 


Phe 
10 


Tyr 


Tyr 


Asp 




tgg 


cac 


age 


etc 


eag 


gtt 


ggc 


ggg 


etc 


ate 


tge 


get 


ggg 


gtt 


ctg 


tge 


203 


Trp 


His 
15 


Ser 


Leu 


Gin 


Val 


Gly 
20 


Gly 




He 


Cys 


Ala 
25 


Gly 


Val 




Cys 




gcc 


atg 


ggc 


ate 


ate 


ate 


gte 


atg 


agt 


gea 




tge 


aaa 


tge 


aag 


ttt 


251 


Ala 


Met 


Gly 


He 


He 


He 


Val 


Met 


Ser 


Ala 


Lys 


Cys 


Lys 


Cys 


Lys 


Phe 




30 










35 










40 










45 




ggc 


cag 


aag 


tec 


ggt 


cac 


cat 


cca 


ggg 


gag 


act 


cca 


cct 


etc 


ate 


ace 


299 


Gly 


Gin 


Lys 


Ser 


Gly 

50 


His 


His 


Pro 


Gly 


Glu 
55 


Thr 


Pro 


Pro 


Leu 


He 
60 


Thr 




cca 


ggc 


tea 


gcc 


caa 


age 


tgatgaggac agaccagctg aaattgggtg 




347 


Pro 


Gly 


Ser 


Ala 
65 
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gaggaccgtt ctctgtcccc aggtcctgtc tctgcacaga aacttgaact ccaggatgga 407 

attctteete etetgetggg actectttgc atggeaggge teatcteace tetcgeaaga 4 67 

gggtctettt gtteaatttt ttttaateta aaatgattgt geetetgece aaaaaaaaaa 527 

aaaaaaagct 537 



<210> 360 
<211> 499 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 69. .383 

<220> 

<221> sig_peptide 

<222> 69. .125 

<223> Von Heijne matrix 

score 7.19999980926514 

seq VSIMLLLVTVSDC/AV 

<400> 360 

aeaaggetga gcgggaggaa gcgagaggca tctaagcagg cagtgttttg ccttcacccc 60 
aagtgaee atg aga ggt gee aeg ega gte tea ate atg etc etc eta gta 110 
Met Arg Gly Ala Thr Arg Val Ser He Met Leu Leu Leu Val 
-15 -10 
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Gly 


agg 


tac 
Tyr 
75 


350 
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cgc tgc tec atg gac ttg aag aac ate aat ttt taggcgcttg cctggtctca 403 
Arg Cys Ser Met Asp Leu Lys Asn lie Asn Phe 

80 85 
ggatacccac catccttttc ctgagcacag cctggatttt tatttctgcc atgaaaccca 4 63 
gctcccatga ctctcccagt ccctasactg actacc 4 99 

<210> 361 
<211> 479 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 129. .434 

<220> 

<221> sig_peptide 
<222> 129. .236 
<223> Von Heijne matrix 
score 7 

seq PFFLITLVGVVVA/VV 
<400> 361 

tccggggccc gggagccaac cgagggcgtt cctgtcgggg ctgcagcggc gggagggagc 60 
ccagtggagg cgccctcccg aagcgccact gcycatgctg accacccagc cctccggctg 120 
ctgatgtc atg agt aac acc act gtg ccc aat gcc ccc cag gcc aac age 170 
Met Ser Asn Thr Thr Val Pro Asn Ala Pro Gin Ala Asn Ser 
-35 -30 -25 



gac 


tec 


atg 


gtg 


ggc 


tat 


gtg 


ttg 


ggg 


ccc 


ttc 


ttc 


etc 


ate 


acc 


ctg 


218 


Asp 


Ser 


Met 


Val 


Gly 


Tyr 


Val 


Leu 


Gly 


Pro 


Phe 


Phe 


Leu 


He 


Thr 


Leu 








-20 










-15 










-10 










gtc 


ggg 


gtg 


gtg 


gtg 


get 


gtg 


gta 


atg 


tat 


gta 


cag 


aag 


aaa 


aag 


egg 


266 


Val 


Gly 


Val 


Val 


Val 


Ala 


Val 


Val 


Met 


Tyr 


Val 


Gin 


Lys 


Lys 


Lys 


Arg 






-5 










1 








5 










10 




gtg 


gac 


egg 


ctg 


cgc 


cat 


cac 


ctg 


etc 


ccc 


atg 


tac 


age 


tat 


gac 


cea 


314 


Val 


Asp 


Arg 


Leu 


Arg 


His 


His 


Leu 


Leu 


Pro 


Met 


Tyr 


Ser 


Tyr 


Asp 


Pro 












15 










20 










25 






get 


gag 


gaa 


ctg 


crt 


gag 


get 


gag 


cag 


gag 


ctg 


etc 


tct 


gac 


atg 


gga 


362 


Ala 


Glu 


Glu 


Leu 


Xaa 


Glu 


Ala 


Glu 


Gin 


Glu 


Leu 


Leu 


Ser 


Asp 


Met 


Gly 










30 










35 










40 








gac 


ccc 


aag 


gtg 


gta 


eat 


ggc 


tgg 


cag 


agt 


ggc 


tac 


cag 


cac 


aag 


egg 


410 


Asp 


Pro 


Lys 


Val 


Val 


His 


Gly 


Trp 


Gin 


Ser 


Gly 


Tyr 


Gin 


His 


Lys 


Arg 








45 










50 










55 










atg 


cea 


ctg 


ctg 


gat 


gtc 


aag 


acg 


tgacctgaec cecttgccce acccttcaga 


464 


Met 


Pro 






Asp 


Val 


Lys 


Thr 





















60 65 
gcctggggtc ctgga 479 



<210> 362 
<211> 453 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 30. .440 

<220> 

<221> sig_peptide 
<222> 30. .83 
<223> Von Heijne matrix 
score 7 

seq VLGLVLLSVTVQG/KV 
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<400> 362 

cagcctagca ctctgaccta gcagtcaac atg aag get etc att gtt ctg ggg 
Met Lys Ala Leu lie Val Leu Gly 



ctt 


gtc 


etc 


ett 


tct 


gtt 


acg 


gtc 


cag 


ggc 


aag 


gtc 


ttt gaa 


agg 


tgt 


101 


Leu 


Val 


Leu 


Leu 


Ser 


Val 


Thr 


Val 


Gin 


Gly 


Lys 


Val 


Phe Glu 


Arg 


Cys 




-10 










-5 










1 






5 






gag 


ttg 


gcc 


aga 


act 


ctg 


aaa 


aga 


ttg 


gga 


atg 


gat 


ggc tac 


agg 


gga 


149 


Glu 


Leu 


Ala 


Arg 


Thr 


Leu 


Lys 


Arg 


Leu Gly Met Asp Gly Tyr Arg 


Gly 




















15 








20 








ate 


age 


eta 


gea 


aac 


tgg 


atg 


tgt 


ttg 


gcc 


aaa 


tgg 


gag agt 


ggt 


tac 


197 


He 


Ser 


Leu 


Ala 


Asn 


Trp 


Met 


Cys 


Leu 


Ala 


Lys 


Trp Glu Ser 


Gly Tyr 








25 










30 










35 










aca 


cga 


get 


aca 


aac 


tac 


aat 


get 


gga 


gac 


aga 


age act 


gat 


tat 


245 


Asn 


Thr 


Arg 


Ala 


Thr 


Asn 


Tyr Asn Ala Gly Asp Arg 


Ser Thr 


Asp Tyr 






40 










45 










50 










ggg 


ata 


ttt 


eag 


ate 


aat 


age 


cgc 


tac 


tgg 


tgt 


aat 


gat ggc 


aaa 


ace 


293 


Gly 


He 


Phe 


Gin 


He 


Asn 


Ser 


Arg 


Tyr 


Trp 


Cys 


Asn 


Asp Gly 


Lys 


Thr 




55 










60 










65 








70 




cca 


gga 


gea 


gtt 






tgt 


cat 


tta 


tec 


tgc 


agt 


ggg tgg 


cat 


gga 


341 


Pro 


Gly 


Ala 


Val 


Asn 


Ala 


Cys 


His 


Leu 


Ser 


Cys 


Ser 


Gly Trp 


His 


Gly 





75 80 85 

gaa ate gtt gtc aaa aca gag atg tec gtc agt atg ttc aag gtt gtg 
Glu He Val Val Lys Thr Glu Met Ser Val Ser Met Phe Lys Val Val 

90 95 100 

gag tgt aac tec aga att ttc ctt ctt cag etc att ttg tct etc tea 
Glu Cys Asn Ser Arg He Phe Leu Leu Gin Leu He Leu Ser Leu Ser 

105 110 115 

cat taaggggagt agg 
His 

<210> 363 
<211> 417 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 82. .384 

<220> 

<221> sig_peptide 

<222> 82. .168 

<223> Von Heijne matrix 

score 6.90000009536743 

seq VCMCVFSTQGALG/EM 

<220> 

<221> inisc_feature 
<222> 404 

<223> n=a, g, c or t 
<400> 363 

gctaeeteet ggtgtctgtt gcctggtgtg tgggaaacee ccacacgctg gaccaeagaa 
gtcttctgtg ttgactgttg a atg agg gaa gag aaa aag ccc ttt gag aga 
Met Arg Glu Glu Lys Lys Pro Phe Glu Arg 
-25 -20 
gag aga gag agt gtg tgt gtg tgt atg tgt gtg ttt tec act caa gga 
Glu Arg Glu Ser Val Cys Val Cys Met Cys Val Phe Ser Thr Gin Gly 

-15 -10 -5 

get ttg ggg gaa atg get gea cac ttc ata gat gaa aag ctg agg ccc 
Ala Leu Gly Glu Met Ala Ala His Phe He Asp Glu Lys Leu Arg Pro 
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15 10 



agt 


gag 


ggg aat 


ggt 


cac 


aga 


ggg 


aca 


ctg 


gat age 


ctg 


agt 


teg 


gac 


255 


Ser 


Glu 


Gly Asn Gly 


His Arg 


Gly 


Thr 


Leu Asp Ser 


Leu 


Ser 


Ser 


Asp 






15 








20 








25 














gag 


tec tac 


ate 


ccc 


tec 


ace 


gcc 


gac 


ece ace 


eag 


get 


ggc 


cot 




Gin 


Glu 


Ser Tyr 


He 


Pro 


Ser 


Thr Ala Asp 


Pro Thr 


Gin 


Ala 


Gly 


Pro 




30 








35 










40 








45 




gag 


ctg 


ctg cac 


aag 


aac 


ttg 


ece 


gtg 


aee 


tee ega 


tee 


eag 


eeg 


etg 


351 


Glu 


Leu 


Leu His 


Lys 
50 


Asn 


Leu 


Pro 


Val 


Thr 
55 


Ser Arg 


Ser 


Gin 


Pro 
60 


Leu 




ccc 


tct 


gac etc 


gcg 


ate 


cca 


gcc 


get 


gcc 


etc tgaeeteeeg ateccagcen 


404 


Pro 


Ser 


Asp Leu 


Ala 


He 


Pro 


Ala 


Ala 


Ala 


Leu 













65 70 
gctgeecteg ctg 417 



<210> 364 
<211> 529 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 114 . .497 



<220> 

<221> sig_peptide 

<222> 114. .191 

<223> Von Heijne matrix 

score 6.80000019073486 
seq VFLSLFLIQLLIS/FS 



<400> 364 

etcagcaece agggeggtgg taggteaeag tctetgggcg ggtcteagtg tccaaeactg 60 
tagetggtge etgceaggtt cceagtgget ggggteaeea ggtetgaaga gag atg 116 

Met 



tgc 


tgg 


ctg 


egg 


gca 


tgg 


ggc 


eag 


ate 


etc 


ctg 


cca 


gtt 


ttc 


etc 


tec 


164 


Cys 


Trp 


Leu 


Arg 


Ala 


Trp 


Gly 


Gin 


He 


Leu 




Pro 


Val 


Phe 


Leu 


Ser 




-25 










-20 










-15 










-10 




etc 


ttt 


etc 


ate 


caa 


ttg 


ett 


ate 


age 


ttc 


tea 


gag 


aat 


ggt 


ttt 


ate 


212 


Leu 


Phe 


Leu 


He 


Gin 
-5 


Leu 


Leu 


He 


Ser 


Phe 

1 


Ser 


Glu 


Asn 


Gly 

5 


Phe 


He 




cac 


age 


ccc 


agg 




aat 


eag 


aaa 


cca 


aga 


gat 


ggg 


aat 


gaa 


gag 


gaa 


260 


His 


Ser 


Pro 




Asn 


Asn 


Gin 


Lys 


Pro 


Arg 


Asp Gly 


Asn 


Glu 


Glu 


Glu 








10 










15 










20 










tgt 


get 


gta 




aag 


agt 


tgt 


caa 


ttg 


tgc 


aca 


gaa 


gat 


aag 


aaa 


tgt 


308 


Cys 


Ala 


Val 


Lys 


Lys 


Ser 


Cys 


Gin 


Leu 


Cys 


Thr 


Glu 


Asp 


Lys 


Lys 


Cys 






25 










30 










35 












gtt 


tgg 


tgt 








aaa 


gca 


tgc 


aaa 


aaa 


tac 


tgt 


ttt 


ccc 


tat 


356 


Val 


Trp 


Cys 


Ser 


Glu 


Glu 


Lys 


Ala 


Cys 


Lys 


Lys 


Tyr 


Cys 


Phe 


Pro 


Tyr 




40 










45 










50 










55 




ttc 


ggt 


tgt 


ega 


ttc 


agt 


tet 


ata 


tat 


tgg 


tta 


aae 


tgt 


aaa 


gtt 


gac 


404 


Phe 


Gly 


Cys 


Arg 


Phe 


Ser 


Ser 


He 


Tyr 


Trp 


Leu 




Cys 


Lys 


Val 


Asp 












60 










65 










70 






atg 


ttt 


gga 


ate 


atg 


atg 


ett 


eta 


etc 


att 


gca 


gta 


tta 


att 


aca 


gga 


452 


Met 


Phe 


Gly 


He 
75 


Met 


Met 


Leu 




Leu 
80 


He 


Ala 


Val 




He 
85 


Thr 


Gly 




ttc 


gtt 


tgg 


tae 


tge 


tgc 


gcc 


tat 


cac 


ttt 


tae 


ctg 


eag 


gat 


ata 




497 


Phe 


Val 


Trp 


Tyr 


Cys 


Cys 


Ala 


Tyr 


His 


Phe 


Tyr 


Leu 


Gin 




He 







90 95 100 



tgatgaatag ataattgraa agagatcctc ca 529 



<210> 365 
<211> 505 
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<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 71. .496 



<220> 

<221> sig_peptide 
<222> 71. .142 
<223> Von Heijne matrix 
score 6.5 

seq LSICLSAVATATG/AE 



<400> 365 

gacgaggacs cgcctgcgca gaggcggcag caccaccggg gttgactccg ggggcgcggc 60 
gaggagagac atg agg ctg age tgg ttc egg gtc ctg aca gta ctg tec 109 

Met Arg Leu Ser Trp Phe Arg Val Leu Thr Val Leu Ser 
-20 -15 



ate 


tgc 


ctg 


age 


gcc gtg 


gcc 


acg 


gcc 


acg 


ggg 




gag 


gge 




agg 


157 


He 


Cys 
-10 


Leu 


Ser 


Ala Val 


Ala 
-5 


Thr 


Ala 


Thr 


Gly 


Ala 
1 


Glu 


Gly 


Lys 


Arg 
5 




aag 


ctg 


cag 


ate 


ggg gtc 


aag 


aag 


egg 


gtg 




cac 


tgt 


ccc 


ate 


aaa 


205 


Lys 


Leu 


Gin 


He 


Gly Val 
10 


Lys 


Lys 




Val 
15 


Asp 


His 


Cys 


Pro 


He 
20 


Lys 




teg 


ege 


aaa 


ggg 


gat gtc 


etg 


eae 


atg 


eae 


tae 


aeg 


ggg 


aag 


etg 


gaa 


253 


Ser 


Arg 


Lys 


Gly 
25 


Asp Val 


Leu 


His 


Met 
30 


His 


Tyr 


Thr 


Gly 


Lys 
35 


Leu 


Glu 




gat 


ggg 


aca 


gag 


ttt gae 


age 


age 


etg 


ece 


eag 


aac 


eag 


ece 


ttt 


gte 


301 


Asp 


Gly 


Thr 


Glu 


Phe Asp Ser 


Ser 


Leu 


Pro 


Gin 


Asn 


Gin 


Pro 


Phe 


Val 








40 








45 










50 










ttc 


tee 


ett 


gge 


aea gge 


eag 


gtc 


ate 


aag 


gge 


tgg 


gae 


eag 


ggg 


etg 


349 


Phe 


Ser 


Leu 


Gly 


Thr Gly Gin 


Val 


He 


Lys 


Gly 


Trp 


Asp 


Gin 


Gly 


Leu 






55 








60 










65 












ctg 


ggg 


atg 


tgt 


gag ggg 


gaa 


aag 


ege 


aas 


ykg 


gtg 


ate 


cca 


tec 


gag 


397 


Leu 


Gly 


Met 


Cys 


Glu Gly 


Glu 


Lys 


Arg 


Xaa 


Xaa 


Val 


He 


Pro 


Ser 


Glu 




70 








75 










80 










85 




eta 


ggg 


tat 


gga 


gag egg 


gga 


get 


ccc 


cca 


aag 


att 


cca 


gge 


ggt 


gea 


445 


Leu 


Gly 


Tyr 


Gly 


Glu Arg 
90 


Gly 


Ala 


Pro 


Pro 
95 


Lys 


He 


Pro 


Gly 


Gly 
100 


Ala 




acc 


ctg 


gtg 


ttc 


gag gtg 


gag 


etg 


ete 


aaa 


ata 


gag 


ega 


ega 


act 


gag 


493 


Thr 


Leu 


Val 


Phe 
105 


Glu Val 


Glu 


Leu 


Leu 
110 


Lys 


He 


Glu 


Arg 


Arg 
115 


Thr 


Glu 





ctg taaceagae 505 
Leu 



<210> 366 
<211> 539 
<212> DNA 
<213> Homo sapie 



<220> 
<221> CDS 
<222> 65. .274 



<220> 

<221> sig_peptide 
<222> 65. .136 
<223> Von Heijne matrix 
score 6.5 

seq ILLTRPACLGSWA/EI 



<400> 366 
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cactgattct ttggccatcg ctactacaat tttagaattg ctggcaattt tttctgcaca 60 
ctgg atg get tct tec acc agt gtc tea aca gga caa ate ttg ctg aca 109 
Met Ala Ser Ser Thr Ser Val Ser Thr Gly Gin He Leu Leu Thr 
-20 -15 -10 



aga 


cct 


get 


tgc 


ttg 


ggg 


tee 


tgg get 


gag ate egg 


tea 


eeg 


gtg 


agg 


157 


Arg 


Pro 


Ala 


Cys 


Leu 
-5 


Gly 


Ser 


Trp Ala 


Glu He Arg 
1 


Ser 


Pro 
5 


Val 


Arg 




acc 


ate 


tec 


ate 


gee 


age 


gae 


tte eea 


aea gea egg 


gtg 


agt 


ete 


tgg 


205 


Thr 


He 


Ser 


He 


Ala 


Ser 


Asp 


Phe Pro 


Thr Ala Arg 


Val 


Ser 


Leu 


Trp 








10 










15 




20 










gtg 


ccg 


ccc 


gca 


cct 


ggg 


atg 


gtt cct 


att aag ate 


tec 


ggc 


tgt 


gca 


253 


Val 


Pro 


Pro 


Ala 


Pro 


Gly 


Met 


Val Pro 


He Lys He 


Ser 


Gly 


Cys 


Ala 






25 










30 




35 












aac 


tgg 


gcc 


ttc 


tea 


ccg 


gca 


tagatgatat cacacatcat ggcaagctca 


304 


Asn 


Trp Ala 


Phe 


Ser 


Pro 


Ala 
















40 










45 



















cagcccccgc caaaggcata gccattgaca gcagcgatga ctggattctt gacctgggtg 364 

aggtggtcee agtgctteaa gaaettgetg gagtaacagt eetggaaaet caggttetge 424 

attteettga tateagetct agtgaaaagg geggtagttg gtggtggaae cagaaaegga 484 

egecggtgct tkgageggtt ettaaattat atttaaaaag taactttttg tataa 539 



<210> 367 
<211> 460 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 14. .178 



<220> 

<221> sig_peptide 

<222> 14. .82 

<223> Von Heijne matrix 

score 6.40000009536743 
seq LGALSGAAALGFA/SY 



<400> 367 

ctgttctaca get atg gcc ggg cca get gca. get ttc cgc cgc ttg ggc 4 9 

Met Ala Gly Pro Ala Ala Ala Phe Arg Arg Leu Gly 
-20 -15 

gcc ttg tec gga get gcg gcc tta ggc ttc get tec tac ggg gcg cac 97 
Ala Leu Ser Gly Ala Ala Ala Leu Gly Phe Ala Ser Tyr Gly Ala His 

-10 -5 15 

ggc gem aat tee cag atg cct acg gga agg age tgt ttg aca agg cca 145 
Gly Ala Asn Ser Gin Met Pro Thr Gly Arg Ser Cys Leu Thr Arg Pro 

10 15 20 

aca aae aee act tet tac aca gcc tgg ccc tgt taggggtgec ceattgeaga 198 
Thr Asn Thr Thr Ser Tyr Thr Ala Trp Pro Cys 

25 30 
aageeaetct gggetgggtt attgctaget teeggaaega eettattetg caeeagettt 258 
tactaccagg ctctgagtgg agaccccage ateeagactt tggeeectge gggagggace 318 
ctgctactct tgggctggct tgccttggct ctttgagctc ccttttgctt aattactggg 378 
ttttctgggc agtttttttt ttaaagagtt ggagtaagaa gaggattaaa aaggaaagge 438 
aaataaaaaa aaaaaaaaaa ge 460 



<210> 368 
<211> 482 
<212> DNA 
<213> Homo 



sapiens 



<220> 
<22l> CDS 
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<222> 14. .178 
<220> 

<221> sig_peptide 

<222> 14. .82 

<223> Von Heijne matrix 

score 5.40000009536743 

seq LGALSGAAALGFA/SY 

<400> 368 

ctgttctaca get atg gcc ggg cca get gca get ttc cgc cgc ttg ggc 4 9 

Met Ala Gly Pro Ala Ala Ala Phe Arg Arg Leu Gly 
-20 -15 
gcc ttg tec gga get gcg gcc tta ggc ttc get tec tac ggg gcg cac 97 
Ala Leu Ser Gly Ala Ala Ala Leu Gly Phe Ala Ser Tyr Gly Ala His 

-10 -5 15 

ggc gem aat tec eag atg ect acg gga agg age tgt ttg aca agg cca 145 
Gly Ala Asn Ser Gin Met Pro Thr Gly Arg Ser Cys Leu Thr Arg Pro 

10 15 20 

aca aac acc act tct tac aca gcc tgg ccc tgt taggggtgec ceattgeaga 198 
Thr Asn Thr Thr Ser Tyr Thr Ala Trp Pro Cys 

25 30 
aagccaetct gggctgggtt attgctagct tccggaacga ccttattctg caccagcttt 258 
tactaccagg ctctgagtgg agaccccagc atccagactt tggcccctgc gggagggacc 318 
ctgctactct tgggctggct tgccttggct ctttgagetc ecttttgctt aattactggg 378 
ttttctgggc agtttttttt ttaaagagtt ggagtaagaa gaggattaaa aaggaaagge 438 
aaataaactt tggagtettt gttcateaaa aaaaaaaaaa aaaa 482 



<210> 369 
<211> 497 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 161. .322 



<220> 

<221> sig_peptide 

<222> 161. .211 

<223> Von Heijne matrix 

score 6.40000009536743 
seq LTFAQLLFATVLG/IA 



<400> 369 

agatgggggt aggagctctc eagacagtgt cccctcactc acccccgcgg cgcctggcgt 60 
cttetcetgg gtgttgaagt acaagatttg aagagatttc ctgggtgtgg agtgtgactt 120 
tccaaaacca gcttttcctt gagctgtatt tgttgcagca atg ttt agg aga ttg 175 

Met Phe Arg Arg Leu 
-15 

act ttt gca caa ctg ctt ttt gcc act gtc ctt gga att get gga gga 223 

Thr Phe Ala Gin Leu Leu Phe Ala Thr Val Leu Gly He Ala Gly Gly 

-10 -5 1 

gta tat att ttt caa cca gta ttt gaa cag tat gcc aaa gat cag aag 271 
Val Tyr He Phe Gin Pro Val Phe Glu Gin Tyr Ala Lys Asp Gin Lys 
5 10 15 20 

gaa tta aaa gaa aag atg cag ttg gta caa gaa tea gaa gag aag aaa 319 
Glu Leu Lys Glu Lys Met Gin Leu Val Gin Glu Ser Glu Glu Lys Lys 

25 30 35 

agt taatactaca tggagttagg cctggcgcag tggctcacgc ctgtaatccc 372 
Ser 

agcactttgg gaggccgagg cgggtggatc aggtggtcag gagtteaaga ccagcctgac 432 
caacatggtg aaacctgtct ctactgaaaa tacaaaaatc agccaggctt ggtggcatgc 492 
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gctgt 497 



<210> 370 
<211> 422 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 47. .295 



<220> 

<221> sig_peptide 

<222> 47. .91 

<223> Von Heijne matrix 

score 6.30000019073486 
seq LFLSLPVLWVLS/IV 



<400> 370 

aaccaagccc tccagcaagg attcagagtg cccctccggc ctcgcc atg agg etc 55 

Met Arg Leu 
-15 



ttc 


ctg 


teg 


ctt 


ceg 


gte 


ctg 


gtg 


gtg 


gtt 


ctg 


teg 


ate 


gte 


ttg 


gaa 


103 


Phe 


Leu 


Ser 
-10 


Leu 


Pro 


Val 


Leu 


Val 
-5 


Val 


Val 


Leu 


Ser 


He 
1 


Val 




Glu 




ggc 


cca 


gcc 


cca 


gee 


cag 


ggg 




cca 


gae 


gte 


tee 


agt 


gee 


ttg 


gat 


151 


Gly 


Pro 


Ala 


Pro 


Ala 


Gin 


Gly 


Thr 


Pro 


Asp 


Val 


Ser 


Ser 


Ala 


Leu 


Asp 




5 










10 










15 










20 




aag 


ctg 


aag 


gag 


ttt 


gga 


aac 


aca 


ctg 


gag 


gae 


aag 


get 


egg 


gaa 


etc 


199 


Lys 


Leu 


Lys 


Glu 


Phe 
25 


Gly 


Asn 


Thr 


Leu 


Glu 

30 


Asp 


Lys 


Ala 


Arg 


Glu 

35 


Leu 




ate 


age 


cgc 


ate 


aaa 


cag 


agt 


gaa 


ctt 


tet 


gee 


aag 


atg 


egg 


gag 


tgg 


247 


He 


Ser 


Arg 


He 
40 


Lys 


Gin 


Ser 


Glu 


Leu 
45 


Ser 


Ala 


Lys 


Met 


Arg 

50 


Glu 


Trp 




ttt 


tea 


gag 


aca 


ttt 


cag 


aaa 


gtg 


aag 


gat 


aaa 


etc 


aag 


att 


gae 


tea 


295 


Phe 


Ser 


Glu 
55 


Thr 


Phe 


Gin 


Lys 


Val 
60 


Lys 


Asp 


Lys 


Leu 


Lys 
65 


He 


Asp 


Ser 





tgaggacctg aagggtgaca tcccaggagg ggcctctgaa atttcccaca ccccagcgcc 355 
tgtgetgagg actccctcca tgtggeecea ggtgccacca ataaaaatcc tacagaaaac 415 
taatata 422 



<210> 371 
<211> 495 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 66. .359 



<220> 

<221> sig_peptide 
<222> 66. .194 
<223> Von Heijne matrix 
score 6 

seq FLIYVALLRVSEC/LP 



<400> 371 

gggcgcgagg cggccaccgt ggagagcaga gcgcggcggc tggaagctgc taagtcagag 60 
ccgcg atg ttc egg att gag ggc etc geg ceg aag ctg gae ceg gag gag 110 
Met Phe Arg He Glu Gly Leu Ala Pro Lys Leu Asp Pro Glu Glu 
-40 -35 -30 

atg aaa egg aag atg cgc gag gat gtg ate tec tec ata egg aac ttt 158 
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Met 


Lys 


Arg 


Lys 
-25 


Met 


Arg 


Glu 


Asp 


Val 
-20 


He Ser 


Ser 


He 


Arg 
-15 


Asn Phe 




etc 


ate 


tac 


gtg 


gee 


ete 


etg 


ega 


gtc 


agt gag 


tgt 


etc 


ecg 


ggc tgt 


206 


Leu 


He 


Tyr 
-10 


Val 


Ala 


Leu 


Leu 


Arg 
-5 


Val 


Ser Glu 


Cys 


Leu 
1 


Pro 


Gly Cys 




gac 


tgt 


gat 


aec 


age 


ggg 


gag 


etc 


aec 


gac ggg 


cac 




tta 




254 




















Asp Gly His 


Pro 


Leu 


Thr Leu 




5 










10 








15 








20 




agg 


ggt 


cat 


egg 


ggc 


ett 


ega 


act 


gag 


ctg aac 


ggt 


age 


ggg 


gag caa 


302 


Arg 


Gly 


His 


Arg 


Gly 


Leu 


Arg 


Thr 


Glu 


Leu Asn 


Gly 


Ser Gly Glu Gin 












25 










30 








35 




gga 


ggc 


tec 


att 


tat 


ett 


aaa 


gaa 


att 


gga eag 


eat 


atg 


aag 


aca gga 


350 


Gly 


Gly 


Ser 


He 


Tyr 


Leu 


Lys 


Glu 


He 


Gly Gin 


His 


Met 


Lys 


Thr Gly 





40 45 50 



cat cac ata tgaatgcacg atatgaagag cctggttaca gtttcgactc 399 
His His He 
55 

etctetgeaa gtgaataggc ccagaaaggt gtaagagact ctttgaatgg acataaaatt 4 59 

ctgcttgtta agaacaagtt tggctctggt aactga 4 95 

<210> 372 
<211> 515 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 115. .360 

<220> 

<221> sig_peptide 

<222> 115. .243 

<223> Von Heijne matrix 

score 5.59999990463257 

seq VVSFALIATLVYA/LF 

<400> 372 

ataaggggac gtctagtggg ttgcccggga gggtggcggg agggtcctgg aaataatctg 60 
tcctctgtcg ccgggaactg gcgaggtagt tccttcgcgg tggagagacc tgga atg 117 

Met 



gee 


aaa 


tat 


caa 


ggt 


gaa 


gtt 


caa 


agt 


ttg 


aaa 


ctg 


gat 


gat 


gat 


tea 


165 


Ala 


Lys 


Tyr 


Gin 


Gly 


Glu 


Val 


Gin 


Ser 


Leu 


Lys 


Leu 


Asp 


Asp 


Asp 


Ser 








-40 










-35 










-30 










gtt 


ata 


gaa 


gga 


gta 


age 


gac 


caa 




ett 


gtg 




gtt 


gtg 




agt 


213 


Val 


He 


Glu 


Gly 


Val 


Ser 


Asp 


Gin 


Val 


Leu 


Val 


Ala 


Val 


Val 


Val 


Ser 






-25 










-20 










-15 












ttc 


get 


ttg 


att 


get 


ace 


etg 


gta 


tat 


gca 


ett 


ttc 


aga 


aat 


gta 


cat 


261 


Phe 


Ala 


Leu 


He 


Ala 


Thr 


Leu 


Val 


Tyr 


Ala 


Leu 


Phe 




Asn 


Val 


His 




-10 










-5 










1 








5 






caa 


aac 


att 


cac 


cca 


gaa 


aac 


cag 


gag 


eta 


gta 


agg 


gta 


ett 


ega 


gaa 


309 


Gin 


Asn 


He 


His 


Pro 


Glu 


Asn 


Gin 


Glu 


Leu 


Val 


Arg 


Val 


Leu 


Arg 


Glu 










10 










15 










20 








cag 


ett 


caa 


aca 


gaa 


eag 


gat 


gea 


cet 


get 


gac 


teg 


aca 


gca 


gtt 


eta 


357 


Gin 




Gin 


Thr 


Glu 


Gin 


Asp 


Ala 


Pro 


Ala 


Asp Ser 


Thr 


Ala 


Val 


Leu 





25 30 35 

cac tgacatgtae tgtcceatct gcetgeaeca agcctccttc ccggtggaga 410 
His 

ccaactgtgg aeatcttttt tgtggtgcct gcattattgc ttactggcga tatggttcat 470 
ggettgggge aatcagttgt ccaatctgta gaeaaacgag acatg 515 



<210> 373 
<211> 513 
<212> DNA 
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<213> Homo sapiens 

<220> 
<221> CDS 
<222> 56. .352 

<220> 

<221> sig_peptide 
<222> 56. .139 
<223> Von Heijne matrix 
score 5.5 

seq LGYLVLSEGAVLA/SS 

<400> 373 

ctgaagccgg aagctaccta tctggtaggg agctccccca gcaccgaaga ctgcg atg 58 

Met 

act tct gca ctg acc cag ggg ctg gag cga ate cca gac cag etc ggc 106 
Thr Ser Ala Leu Thr Gin Gly Leu Glu Arg lie Pro Asp Gin Leu Gly 

-25 -20 -15 

tac ctg gta ctg agt gaa ggt gca gtg ctg gcg tea tct ggg gac ctg 154 
Tyr Leu Val Leu Ser Glu Gly Ala Val Leu Ala Ser Ser Gly Asp Leu 

-10 -5 • 1 5 

gag aat gat gag cag gca gee agt gee ate tct gag ctg gtc age aca 202 
Glu Asn Asp Glu Gin Ala Ala Ser Ala He Ser Glu Leu Val Ser Thr 

10 15 20 

gcc tgc ggt ttc egg ctg cac cgc ggc atg aat gtg ccc ttc aag cgc 250 
Ala Cys Gly Phe Arg Leu His Arg Gly Met Asn Val Pro Phe Lys Arg 

25 30 35 

ctg tct gtg gtc ttt gga gaa cac aca ctg ctg gtg acg gtg tea gga 298 
Leu Ser Val Val Phe Gly Glu His Thr Leu Leu Val Thr Val Ser Gly 

40 45 50 

cag agg gtg ttt gtg gtg aag agg cag aac cga ggt egg gag eec att 34 6 

Gin Arg Val Phe Val Val Lys Arg Gin Asn Arg Gly Arg Glu Pro He 

55 60 65 

gat gtc tgagcctgcc ggaggegagg gteggagaag eggattgggt cetgggcetc 4 02 

Asp Val 
70 

tgtgatgagg caggcacace tgtcggtctt ggcttgctgc tagaaactag ggccttctgc 4 62 
tegcecaeet ceeaceeect acetggaegg gcceaggett ggggactctg a 513 

<210> 374 
<211> 632 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 2. .559 

<220> 

<221> sig_peptide 
<222> 2. .46 

<223> Von Heijne matrix 

score 5.30000019073486 
seq VMAASMARGGVSA/RV 

<400> 374 

g atg cat gtc atg gcc gcc tec atg gcc egg gga ggc gtg agt gcc agg 49 
Met His Val Met Ala Ala Ser Met Ala Arg Gly Gly Val Ser Ala Arg 
-15 -10 -5 1 

gtt eta ctg cag get gee agg ggc acc tgg tgg aac aga cet ggg ggc 97 
Val Leu Leu Gin Ala Ala Arg Gly Thr Trp Trp Asn Arg Pro Gly Gly 
5 10 15 



[fije>'/^:/yv'GG208389e.s^] 



WO 02/083898 



act 


tec 


ggg 


teg 


ggg 


gag 


ggg 


gtg 


gcg 


Thr 


Ser 


Gly 


Ser 


Gly 


Glu 


Gly 


Val 


Ala 






20 










25 




caa 


gcg 


aea 


ggc 


teg 


egc 


eeg 


get 


gga 


Gin 


Ala 


















35 










40 






gag 


egg 


eec 


ggg 


gac 


gtg 


gtg 


aae 


gtg 


Glu 


Arg 


Pro 


Gly 


Asp 


Val 


Val 


Asn 


Val 


50 










55 








cag 


egg 


ate 


cea 


gtg 


agt 


ggc 


aga 


gtc 


Gin 


Arg 


He 


Pro 


Val 
70 


Ser 


Gly 


Arg 


Val 


gcc 


eag 


egc 


eac 


ggg 


gtg 


gac 


ctg 


gaa 


Ala 


Gin 


Arg 


His 


Gly 


Val 


Asp 


Leu 


Glu 








85 










90 


gcc 


tgc 


tec 


acc 


tgc 


cat 


gtg 


tat 


gtg 


Ala 


Cys 


Ser 


Thr 


Cys 


His 


Val 


Tyr 


Val 






100 










105 




ctg 


ect 


eet 


eec 


gag 


gag 


agg 


gaa 


gac 


Leu 


Pro 


Pro 


Pro 


Glu 


Glu 


Arg 


Glu 


Asp 




115 










120 






etc 


etc 


eag 


gag 


aac 


teg 


egg 


ctg 


ggc 


Leu 


Leu 


Gin 


Glu 


Asn 


Ser 


Arg 


Leu 


Gly 


130 










135 








gag 


ctg 


saa 


gga 


gcg 


gaa 


ttc 


acc 


ctg 


Glu 


Leu 


Xaa 


Gly 


Ala 


Glu 


Phe 


Thr 


Leu 










150 










tac 


gtg 


gat 


ggc 


cat 


gtc 


eec 


aag 


eec 


Tyr 


Val 


Asp 


Gly 


His 


Val 


Pro 


Lys 


Pro 



165 170 



ttccacattg ccatggcccc agggccagat tc 
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213 



ctg 


ggg 


aea 


acc 


aga 


aag 


ttt 


145 


Leu 


Gly Thr Thr 


Arg 


Lys 


Phe 










30 










gag 


gag 


gac 


gcg 


ggc 


ggc 


eeg 


193 


Glu 


Glu 


Asp 
45 


Ala 


Gly 


Gly 


Pro 




gtg 


tte 


gta 


gac 


egc 


tea 


ggc 


241 


Val 


Phe 


Val 


Asp Arg 


Ser 


Gly 






60 










65 




ggg 


gac 


aat 


gtt 


ett 


eac 


ctg 


289 


Gly 


Asp Asn 


Val 


Leu 


His 


Leu 




75 










80 






ggg 


gcc 


tgt 


gaa 


gcc 


tec 


ctg 


337 


Gly 


Ala 


Cys 


Glu 


Ala 
95 


Ser 


Leu 




agt 


gaa 


gac 


eac 


ctg 


gat 


etc 


385 


Ser 


Glu Asp 


His 


Leu Asp 


Leu 










110 










gac 


atg 


eta 


gac 


atg 


gee 


eec 


433 


Asp 


Met 


Leu Asp Met 


Ala 


Pro 








125 












tgc 


cag 


att 


gtg 


ctg 


aea 


eeg 


481 


Cys 


Gin 
140 


He 


Val 


Leu 


Thr 


Pro 
145 




ccc 


aag 


ate 


acc 


agg 


aac 


ttc 


529 


Pro 


Lys 


He 


Thr Arg Asn 


Phe 




155 










160 







eac tgacatgaac acctggacca 57 9 

His 

igggaata gcaggtgcag cct 632 



<210> 375 
<211> 503 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 59. .427 



<220> 

<221> sig_peptide 

<222> 59. .202 

<223> Von Heijne matrix 

score 5.19999980926514 
seq LLMLLLFLSELQY/YL 



<400> 375 

aggccggaag agggagtctg taggggcggg ccgggcgtcc cctttccggc cggtcccc 58 



atg 


gag 


gcg 


ctg 


ggg 


aag 


ctg 


aag 


cag 


ttc 


gat 


gcc 


tac 


ccc 


aag 


act 


106 


Met 


Glu 


Ala 


Leu 
-45 


Gly 








Gin 
-40 


Phe 




Ala 


Tyr 


Pro 
-35 


Lys 


Thr 




ttg 


gag 


gac 


ttc 


egg 


gtc 


aag 


acc 


tgc 


ggg 


ggc 


gcc 


acc 


gtg 


acc 


att 


154 


Leu 


Glu 


Asp 
-30 


Phe 


Arg 


Val 


Lys 


Thr 
-25 


Cys 


Gly 


Gly 


Ala 


Thr 
-20 


Val 


Thr 


He 




gtc 


agt 


ggc 


ett 


etc 


atg 


ctg 


eta 


ctg 


ttc 


ctg 


tec 


gag 


ctg 


cag 


tat 


202 


Val 


Ser 
-15 


Gly 


Leu 


Leu 


Met 


Leu 
-10 


Leu 


Leu 


Phe 




Ser 
-5 


Glu 


Leu 


Gin 


Tyr 




tac 


etc 


acc 


acg 


gag 


gtg 


cat 


cct 


gag 


etc 


tac 


gtg 


gac 


aag 


teg 


egg 


250 


Tyr 


Leu 


Thr 


Thr 


Glu 


Val 


His 


Pro 


Glu 


Leu 


Tyr 


Val 


Asp 


Lys 


Ser 


Arg 




1 








5 










10 










15 






gga 


gat 


aaa 


ctg 


aag 


ate 


aac 


ate 


gat 


gta 


etk 


ttt 


eeg 


eac 


atg 


eet 


298 
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Gly Asp Lys Leu Lys lie Asn He Asp Val Leu Phe Pro His Met Pro 

20 25 30 

tgt gcc tat ctg agt att gat gcc atg gat gtg gcc gga gaa cag cag 34 6 

Cys Ala Tyr Leu Ser He Asp Ala Met Asp Val Ala Gly Glu Gin Gin 

35 40 45 

ctg gat gtg gaa cac aac ctg ttc aag caa cga eta gat aaa gat ggc 394 

Leu Asp Val Glu His Asn Leu Phe Lys Gin Arg Leu Asp Lys Asp Gly 

50 55 60 

ate ccc gtg age tea gag get gag egg cat gat taatgcagga caccagtgga 447 

lie Pro Val Ser Ser Glu Ala Glu Arg His Asp 
65 70 75 

accaacataa tttgetatag gactggatgt ggagtggagg agagcaaagt catggt 503 



<210> 376 
<211> 521 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 145. .375 



<220> 

<221> sig_peptide 

<222> 145. .231 

<223> Von Heijne matrix 

seore 5.09999990463257 
seq LLLKLQRLPQAEP/VE 



<400> 376 

gacaaacttg ttctgcgggc tgcggatggg tgcgagggtg gaatctcggt gctgcgacga 60 
gtgtggggcc arccgtggag gctccaggtg ttctctctgc cccagcagag cccggcagga 120 
gccccaacag gaagccagcg cggc atg get gcc acc gac ttc gtg cag gag 171 
Met Ala Ala Thr Asp Phe Val Gin Glu 
-25 



atg 


cgc 


gcc 


gtg 


ggc 


gag 


agg 


ctg 


ctg 


etc 


aag 


ctg 


cag 


aga 


ctg 


ccc 


219 


Met 


Arg 


Ala 


Val 


Gly 


Glu 


Arg 


Leu 


Leu 


Leu 


Lys 




Gin 


Arg 


Leu 


Pro 




-20 










-15 










-10 










-5 




cag 


get 


gag 


ccc 


gtg 


gag 


ate 


gtg 


gcc 


ttc 


tea 


gte 


ate 


ate 


ctt 


ttc 


267 


Gin 


Ala 


Glu 


Pro 


Val 
1 


Glu 


He 


Val 


Ala 

5 


Phe 


Ser 


Val 


He 


He 
10 


Leu 


Phe 






get 


act 


gtt 


ctg 


ctg 


ttg 


ctg 


ctg 


ata 


gcc 


tge 


age 


tge 


tge 


tge 


315 


Thr 


Ala 


Thr 
15 


Val 


Leu 


Leu 


Leu 


Leu 
20 


Leu 


He 


Ala 


Cys 


Ser 
25 


Cys 


Cys 


Cys 




act 


eae 


tge 


tge 


tge 


cct 


gag 


erg 


aga 


ggc 


agg 


aag 


gte 


cag 


gtg 


cag 


363 


Thr 


His 

30 


Cys 


Cys 


Cys 


Pro 


Glu 
35 


Xaa 


Arg 


Gly 


Arg 


Lys 
40 


Val 


Gin 


Val 


Gin 




ccg 




oca 


cca 


tgacggackg gcgatggctg aggagaagct ggagaggaga 




415 


Pro 
45 


Thr 


Pro 


Pro 





























tggecaatgc catgacacag gccatcagee tggcectgea geccttacce etcaagaesa 475 
ggetcectgg eeeagetetg geeeageeag gtacetggae aetgae 521 



<210> 377 
<211> 461 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 17. .337 



<220> 
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<221> sig_peptide 
<222> 17. .97 
<223> Von Heijne matrix 

score 4.90000009536743 

seq VLSTLSLVQFSPS/GR 

<400> 377 

actttagctg ctgcag atg ctg ttt gaa gaa get ctg ccc etc age tgc tct 52 
Met Leu Phe Glu Glu Ala Leu Pro Leu Ser Cys Ser 
-25 -20 





cct 


gtg 














gtg 






age 






gga 




Asp 


Pro 


Val 


Leu 


Ser 


Thr 


Leu 


Ser 




Val 


Gin 


Phe 


Ser 


Pro 


Ser 


Gly 




-15 










-10 










-5 










1 




agg 


acc 


cag 


gae 


etg 


cte 


tet 


eea 


ggg 


gtg 


gag 


aac 


etg 


teg 


gtg 


etg 


148 


Arg 


Thr 


Gin 


Asp 
5 


Leu 


Leu 


Ser 


Pro 


Gly 
10 


Val 


Glu 


Asn 


Leu 


Ser 
15 


Val 


Leu 




gac 


gtg 


tee 


eet 


etg 


gge 


ttg 


gee 


tge 


tgt 


etg 


ete 


aet 


ete 


aee 


atg 


196 


Asp 


Val 


Ser 
20 


Pro 


Leu 


Gly 


Leu 


Ala 

25 


Cys 


Cys 


Leu 


Leu 


Thr 

30 


Leu 


Thr 


Met 




tec 


tgc 


cca 


ggg 


cct 


gac 


cct 


cct 


gag 


ggg 


ccc 


ggg 


acc 


cag 


cgt 


gtg 


244 


Ser 


Cys 
35 


Pro 


Gly 


Pro 


Asp 


Pro 
40 


Pro 


Glu 


Gly 


Pro 


Gly 
45 


Thr 


Gin 


Arg 


Val 




tgg 


caa 


ggg 


get 


eta 


egg 


ate 


eta 


eag 


ete 


eea 


gga 


gee 


eea 




ggg 


292 


Trp 


Gin 


Gly 


Ala 


Leu 


Arg 


He 


Leu 


Gin 


Leu 


Pro 


Gly 


Ala 


Pro 


Asp 


Gly 




50 










55 










60 










65 




gtt 


tea 


eea 


tat 


eag 


eea 


gte 


tgg 


tet 


ega 


aet 


eet 


gae 


ete 


aag 




337 


Val 


Ser 


Pro 


Tyr 


Gin 


Pro 


Val 


Trp 


Ser 


Arg 


Thr 


Pro 


Asp 


Leu 


Lys 







70 75 80 



tgattegcca geetegteet eeeaaagtge tgggattaea gatgtgagee aeeatgeeea 397 
tggagaatgt attaettttg taatttaaaa aataatagta ataaaataaa gatetttaea 457 
atgt 461 

<210> 378 
<211> 495 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 148. .483 

<220> 

<221> sig_peptide 

<222> 148. .264 

<223> Von Heijne matrix 

score 4.80000019073486 

seq IILFSAIVGFIYG/YV 

<220> 

<221> mise_feature 
<222> 43,112,122,154 

<223> n=a, g, c or t 

<400> 378 

cctaatcgaa aattcgctct cccgggctta gaaggcccgg ctnctgacgc gcagtgccag 60 
aeettaccce tcaeggtect taagtctegg tcgcectcgc etcgcagcet gneaceegcg 120 
mneagetgee cgeeteetca geeagce atg etg nag eat etg age teg etg ece 174 
Met Leu Xaa His Leu Ser Ser Leu Pro 
-35 

aeg eag atg gat tac aag gge cag aag eta get gaa eag atg ttt eag 222 

Thr Gin Met Asp Tyr Lys Gly Gin Lys Leu Ala Glu Gin Met Phe Gin 

-30 -25 -20 -15 

gga att att ctt ttt tct gca ata gtt gga ttt ate tac ggg tac gtg 270 



FCT/lBOl/00914 
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I rie Val Gly Phe lie Tyr Gly Tyr Val 

-5 1 

: gtc tat ata gtt atg gcc gga ttt get 318 

■ Val Tyr He Val Met Ala Gly Phe Ala 



Gly 


He 


He 


Leu 


Phe 


Ser 










-10 




get 


gaa 


eag 


tte 


ggg 


tgg 


Ala 


Glu 


Gin 
5 


Phe 


Gly 


Trp 


ttt 


tea 


tgt 


ttg 


etg 


aea 


Phe 


Ser 


Cys 


Leu 


Leu 


Thr 




20 










cct 


etc 


aag 


tgg 


tta 


cct 


Pro 


Leu 


Lys 


Trp 


Leu 


Pro 


35 










40 


aaa 


gaa 


aaa 


tta 


aga 


ggc 


Lys 


Glu 


Lys 


Leu 


Arg 


Gly 










55 




cag 


cac 


ctg 


ctt 


ttg 


ttt 


Gin 


His 


Leu 


Leu 


Leu 


Phe 








70 







<210> 379 
<211> 541 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 60. .368 



<220> 

<221> sig_peptide 

<222> 60. . 122 

<223> Von Heijne matrix 

score 4.69999980926514 
seq ALRSLNLAPPTVA/AP 



<400> 379 

acagtcaaeg teatctagga gcaeegagca gcttggctaa aagtaagggt gtegtgctg 59 
atg gcc ctg tgc gca ctg acc cgc get ctg cgc tct ctg aac ctg gcg 107 
Met Ala Leu Cys Ala Leu Thr Arg Ala Leu Arg Ser Leu Asn Leu Ala 

-20 -15 -10 

ccc ccg acc gtc gcc gcc cct gcc ccg agt ctg ttc ccc gcc gcc cag 155 
Pro Pro Thr Val Ala Ala Pro Ala Pro Ser Leu Phe Pro Ala Ala Gin 
-5 15 10 

atg atg aac aat ggc etc etc caa cag ccc tct gcc ttg atg ttg etc 203 
Met Met Asn Asn Gly Leu Leu Gin Gin Pro Ser Ala Leu Met Leu Leu 

15 20 25 

ccc tgc ege cea gtt ctt act tct gtg gcc ctt aat gcc aac ttt gtg 251 
Pro Cys Arg Pro Val Leu Thr Ser Val Ala Leu Asn Ala Asn Phe Val 

30 35 40 

tec tgg aag agt cgt acc aag tac acc att aea cea gtg aag atg agg 299 
Ser Trp Lys Ser Arg Thr Lys Tyr Thr lie Thr Pro Val Lys Met Arg 

45 50 55 



aag 


tct 


ggg 


ggc 


cga 






aea ggt get 


gga aac 


gtg cgt age aac 


Lys 


Ser 


Gly 


Gly 


Arg 


Asp 


His 


Thr Gly Ala 


Gly Asn 


Val Arg Ser Asn 


60 










65 






70 


75 


agt 


agg 


ccg 


agt 


ate 




cgt 


tgatcataac i 


laacgggtc 


:a ttggcaaggc 


Ser 


Arg 


Pro 


Ser 


He 
80 


Gin 


Arg 









aggtcgcaac cgetggctgg gcaagaggcc taacagtggg cggtggcacc gcaagggggg 458 
etgggetgge cgaaagatte ggeeactacc ceccatgaag agttacgtga agctgectte 518 
tgettetgce aaagctgata tec 541 



<210> 380 
<211> 545 
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<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 80.. 355 

<220> 

<221> sig_peptide 

<222> 80. .124 

<223> Von Heijne matrix 

score 4.59999990463257 

seq VLAGSLLGPTSRS/AA 

<400> 380 

cttttccttt gttccagggc tggagcggct ctgggctccg gaatcgcccg cagccggtac 60 
tgcgggaccc actgcggat atg get gtc ttg get gga tec ctg ttg ggc ccc 112 
Met Ala Val Leu Ala Gly Ser Leu Leu Gly Pro 
-15 -10 -5 

acg agt agg teg gca gcg ttg ctg ggt ggc agg tgg etc cag ccc egg 160 
Thr Ser Arg Ser Ala Ala Leu Leu Gly Gly Arg Trp Leu Gin Pro Arg 

1 5 10 

gee tgg ctg ggg ttc cca gac gcc tgg ggc etc cee ace ccg cag cag 208 
Ala Trp Leu Gly Phe Pro Asp Ala Trp Gly Leu Pro Thr Pro Gin Gin 

15 20 25 

gee egg gge aag get cge ggg aat gag tat cag ccg age aac ate aaa 256 
Ala Arg Gly Lys Ala Arg Gly Asn Glu Tyr Gin Pro Ser Asn lie Lys 

30 35 40 

cge aag aac aag cae gge tgg gtc egg cge ctg age acg ccg gcc ggc 304 
Arg Lys Asn Lys His Gly Trp Val Arg Arg Leu Ser Thr Pro Ala Gly 
45 50 55 60 

gtg cag gtc ate ett cge ega atg etc aag gge cge aag teg ctg age 352 
Val Gin Val lie Leu Arg Arg Met Leu Lys Gly Arg Lys Ser Leu Ser 

65 70 75 

eat tgaggatege gaegcagteg gegggacect catggaagea tegcectcgc 405 
His 

cteggacett gcctggcgct atttttgcag ggagctgggg agcaggaacg cctcggacct 4 55 
gagtgctctc catattgtgg ggttgaagtc tggatgggag cttgccaagt ccctttttag 525 
getttttaat taggaageat 545 

<210> 381 
<211> 498 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 137. .397 

<220> 

<221> sig_peptide 

<222> 137. .253 

<223> Von Heijne matrix 

score 4.59999990463257 

seq LQGARVLLTGANA/GV 

<400> 381 

ggggtgcggt eggggacccg geaggaggcg geegagaaga gaggacegtg ggggegtteg 60 
cgtggctccc agcccgggac cccacccccg ctggacagtg ggggaaactg aggcctgagc 120 
gggeeeaeae aggace atg aag gtg ett etc etc aca ggg ctg ggg gee ctg 172 
Met Lys Val Leu Leu Leu Thr Gly Leu Gly Ala Leu 
-35 -30 
ttc ttc gcc tat tat tgg gat gac aac ttc gac cca gcc age etc cag 220 
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Phe 


Phe 


Ala 
-25 


Tyr 


Tyr 


Trp 


Asp 


Asp 
-20 


Asn 


Phe 


Asp 


Pro 


Ala 
-15 


Ser 


Leu 


Gin 




gga 


gcg 


cga 


gtg 


ctg 


ctg 


aca 


ggg 


gcc 


aac 


get 


ggt 


gtt 


ggt 


gag 


gag 


268 


Gly 


Ala 
-10 


Arg 


Val 




Leu 


Thr 
-5 


Gly 


Ala 


Asn 


Ala 


Gly 
1 


Val 


Gly 


Glu 


Glu 
5 




ctg 


gcc 


tat 


cac 


tac 


gcg 


cgt 


ctg 


ggc 


tec 




ctg 


gtg 


etc 


act 




316 










10 










15 


His 


Leu 


Val 


Leu 


Thr 
20 


Ala 




cac 


act 


gag 


get 


etc 


ctg 


cag 


aag 


gca 


egg 


tgg 


etc 


acg 


ett 


gta 


gtc 


364 


His 


Thr 


Glu 


Ala 

25 


Leu 


Leu 


Gin 


Lys 


Ala 
30 


Arg 


Trp 


Leu 


Thr 


Leu 
35 


Val 


Val 




age 




ttg 


gga 


ggc 




gga 


aag 


tgg 


ate 


ace tgaggteagg agttcaagae 


417 


Ser 


Thr 


Leu 


Gly 


Gly 


Arg 


Gly 


Lys 


Trp 


lie 


Thr 















40 45 



cagcctggcc aacatggcga aacctggtct ctactaagaa gacaaaaatt aaggccaggc 477 
atggtggetc atgeetgtaa t 4 98 

<210> 382 
<211> 459 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 43. .417 

<220> 

<221> sig_peptide 

<222> 43. .87 

<223> Von Heijne matrix 

score 4.59999990463257 

seq LLTHNLLSSHVRG/VG 

<400> 382 

acctttttcc ggttccggcc tggegagagt ttgtgeggcg ae atg aaa ctg ctt 54 

Met Lys Leu Leu 



























-15 










ace 


cac 


aat 


ctg 


ctg 


age 


teg 


cat 


gtg 


egg 


ggg 


gtg 


ggg 


tec 


cgt 


ggc 


102 


Thr 


His 
-10 


Asn 


Leu 


Leu 


Ser 


Ser 
-5 


His 


Val 


Arg 


Gly 


Val 
1 


Gly 


Ser 


Arg 


Gly 
5 




ttc 


cec 


ctg 


cgc 


etc 


cag 


gcc 




gag 


gtc 


cgt 


ate 


tgc 


cct 


gtg 


gaa 


150 


Phe 


Pro 


Leu 


Arg 


Leu 
10 


Gin 


Ala 


Thr 


Glu 


Val 
15 


Arg 


He 


Cys 


Pro 


Val 
20 


Glu 




ttc 


aac 






ttc 


gtg 


gcg 


cgt 


atg 


ata 


cet 


aaa 


gtg 


gag 


tgg 


teg 


198 


Phe 


Asn 


Pro 


Asn 
25 


Phe 


Val 


Ala 


Arg 


Met 
30 


lie 


Pro 


Lys 


Val 


Glu 
35 


Trp 


Ser 




gcg 


ttc 


ctg 


gag 


gcg 


gcc 


gat 


aac 


ttg 


cgt 


ctg 


ate 


cag 


gtg 


ccg 


aaa 


246 


Ala 


Phe 


Leu 
40 


Glu 


Ala 


Ala 


Asp 


Asn 
45 




Arg 


Leu 


He 


Gin 
50 


Val 


Pro 


Lys 




ggg 


ccg 


gtt 


gag 


gga 


tat 


gag 


gag 


aat 


gag 


gag 


ttt 


ctg 


agg 


ace 


atg 


294 


Gly 


Pro 
55 


Val 


Glu 


Gly 


Tyr 


Glu 
60 


Glu 


Asn 


Glu 


Glu 


Phe 
65 


Leu 


Arg 


Thr 


Met 




cac 


cac 


ctg 


ctg 


ctg 


gag 


gtg 


gaa 


gtg 


ata 


gag 


ggc 


ace 


ctg 


cag 


tgc 


342 


His 


His 


Leu 


Leu 


Leu 


Glu 


Val 


Glu 


Val 


lie 


Glu 


Gly 


Thr 


Leu 


Gin 


Cys 




70 










75 










80 










85 




ccg 


gaa 


tct 


gga 


cgt 


atg 


ttc 


ccc 


ate 


age 


cgc 


ggg 


ate 


cem 




atg 


390 


Pro 


Glu 


Ser 


Gly 


Arg 

90 


Met 


Phe 


Pro 


He 


Ser 
95 


Arg 


Gly 


He 


Pro 


Xaa 
100 


Met 




ctg 


ctg 


agt 




gag 


gaa 


act 


gag 


agt 


tgattgtgse aggegceagt 




437 


Leu 


Leu 


Ser 


Glu 


Glu 


Glu 


Thr 


Glu 


Ser 



















105 110 
ttttcttgty atgactgtgt at 459 
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<210> 383 
<211> 533 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 82. .237 

<220> 

<221> sig_peptide 
<222> 82. .147 
<223> Von Heijne matrix 
score 4.5 

seq FFLLITIALGTKT/ES 
<400> 383 

aawtttctca acaattcctc accgcagagc ctctgaagct cccaccaggc cagctctcct 60 
cccacaacag cttcccacag c atg aag ate tec gtg get gcc att ccc ttc 111 
Met Lys lie Ser Val Ala Ala He Pro Phe 
-20 -15 
ttc etc etc ate acc ate gcc eta ggg acc aag act gaa tec tec tea 159 
Phe Leu Leu He Thr He Ala Leu Gly Thr Lys Thr Glu Ser Ser Ser 

-10 -5 1 

egg gga ect tac cac ccc tea gag tgc tgc ttc acc tac act acc tac 207 
Arg Gly Pro Tyr His Pro Ser Glu Cys Cys Phe Thr Tyr Thr Thr Tyr 
5 10 15 20 

aag ate ccg cgt cag awt tat gga tta eta tgagaccaac agceagtget 257 
Lys He Pro Arg Gin Xaa Tyr Gly Leu Leu 

25 30 
ecaagceegg aattgtcttc atcaccaaaa ggggccattc cgtctgtacc aaccccagtg 317 
acaagtgggt ccaggactat atcaaggaca tgaaggagaa ctgagtgacc cagaaggggt 377 
ggegaaggea eagcteagag acataaagag aagatgccaa ggccccctcc tccacccacc 437 
gctaactctc agccccagtc accctcttgg agcttccctg etttgaatta aagaeeaete 4 97 
atgetctcca aaaaaaaaaa aaaaaggtea aaaget 533 

<210> 384 
<211> 472 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 92. .307 

<220> 

<221> sig_peptide 

<222> 92. .217 

<223> Von Heijne matrix 

score 4.40000009536743 

seq LRLEVAAAPGTPA/QP 

<400> 384 

aaaagaaatt cccgggcccg gccagcctgt gccgctgtta tgaggagaag caaagcccct 60 
tgcagaagca cttgcggtct gcagcctacg c atg aat agg ttg gca ggt gtg 112 
Met Asn Arg Leu Ala Gly Val 
-40 

ggc tgg egg gtg gac tac acc ctg age tec age ctg ctg caa tec gtg 160 
Gly Trp Arg Val Asp Tyr Thr Leu Ser Ser Ser Leu Leu Gin Ser Val 
-35 -30 -25 -20 

gaa gag ecc atg gtg eac etg egg etg gag gtg gea get gee eca ggg 208 
Glu Glu Pro Met Val His Leu Arg Leu Glu Val Ala Ala Ala Pro Gly 
-15 -10 -5 
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acc cca gcc cag cct gtt gcc atg tec etc tea gca gac aag ttc cag 256 
Thr Pro Ala Gin Pro Val Ala Met Ser Leu Ser Ala Asp Lys Phe Gin 

1 5 10 

gtc etc ctg gca gaa ctg aag eag gee eag aec etg atg age tee etg 304 
Val Leu Leu Ala Glu Leu Lys Gin Ala Gin Thr Leu Met Ser Ser Leu 

15 20 25 

ggc tgaggagaag ggtgtteeag geetgtgtgg ageegeeetg eeegtatgga 357 
Gly 
30 

gtcacgecct ctgaactgct cttcgggagg cagccctggt tctaggatgc tgaggccctg 417 
geccggactc tggectceca gatceccagc tgccteactt ctctcttgag aactt 4 72 



<210> 385 
<211> 514 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 113. .364 



<220> 

<221> sig_peptide 

<222> 113. .172 

<223> Von Heijne matrix 

score 4.30000019073486 
seq SLLLSLPPHQGLT/FS 



<400> 385 

ttttttacat ggtgttccca cagctgggag gacacecaea tggtcggcgt gcaggatatt 60 
tcgctggacc ctagaaaagc caccacgacc tgtgggccat gatgctaccc ea atg get 118 

Met Ala 
-20 



get 


get 


get 


gtt 


cct 


tct 


ctt 


ctt 


ctt 


tct 


ctt 


cct 


cct 


cae 


eag 


ggg 


166 


Ala 


Ala 


Ala 


Val 
-15 


Pro 


Ser 


Leu 


Leu 


Leu 
-10 


Ser 


Leu 


Pro 


Pro 


His 
-5 


Gin 


Gly 




ete 


act 


ttc 


tec 




aaa 


ata 


caa 


cct 


ttt 


gga 






gga 


gtc 


ttg 


214 


Leu 


Thr 


Phe 
1 


Ser 


Asn 


Lys 


He 
5 


Gin 


Pro 


Phe 


Gly 


Ala 
10 


Gin 


Gly 


Val 


Leu 




cat 


ccg 


gaa 


cca 


gga 


ctg 


cga 


gac 


tgg 


ctg 


ctg 


cca 


acg 


tgc 


tee 


aga 


262 


His 


Pro 


Glu 


Pro 


Gly 


Leu 


Arg 


Asp 


Trp 


Leu 


Leu 


Pro 


Thr 


Cys 


Ser 


Arg 




15 










20 










25 










30 




caa 


ttg 


cga 


gtc 


gca 


ctg 


ccg 


gag 


aag 


ggg 


tec 


gag 


ggc 


agt 


ctg 


tgt 


310 


Gin 


Leu 




Val 


Ala 
35 


Leu 


Pro 


Glu 


Lys 


Gly 
40 


Ser 


Glu 


Gly 


Ser 


Leu 
45 


Cys 




caa 


acg 


cag 


ctg 


cca 


get 


act 


cca 


tgc 


ttc 


ctg 


cct 


teg 


aat 


acg 


gtg 


358 


Gin 


Thr 


Gin 


Leu 
50 


Pro 


Ala 


Thr 


Pro 


Cys 
55 


Phe 


Leu 


Pro 


Ser 


Asn 
60 


Thr 


Val 





aga acg tgaagtcatg agetgctget aaggcatgtg geaaccttga agagaaggtc 414 
Arg Thr 

aagagctaee agecaccaaa agaatgeeag caettcctgt gtctttgett tggattcatg 474 

agaaatatac gttcetattt gcttcaaaaa aaaaaaaaaw 514 



<210> 386 
<211> 508 
<212> DNA 
<213> Homo 



sapiens 



<220> 
<221> CDS 
<222> 83. .244 



<220> 
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<221> sig_peptide 

<222> 83. .139 

<223> Von Heijne matrix 

score 4.19999980926514 

seq LLILSKVLDDTFQ/SV 

<400> 386 

actgctatag cgatataact cagatgttct caggacagca agtttacatg tgaatagaga 60 
gggagggacg tgttaggaag ac atg aaa ttc aat ttt gta ctt ttg ata ctt 112 
Met Lys Phe Asn Phe Val Leu Leu lie Leu 
-15 -10 
tec aaa gtt eta gat gac act ttc caa agt gtc aag aaa tgg tta aat 160 
Ser Lys Val Leu Asp Asp Thr Phe Gin Ser Val Lys Lys Trp Leu Asn 

-5 15 
tat ttt cag ttt act tta aga aat ggt tta atg tgg oca ggt gcg gtg 208 
Tyr Phe Gin Phe Thr Leu Arg Asn Gly Leu Met Trp Pro Gly Ala Val 

10 15 20 

get cat gcc tgt aat ccc age aet gge tea ege etg taatcccagc 254 
Ala His Ala Cys Asn Pro Ser Thr Gly Ser Arg Leu 

25 30 35 

aetttgggag teegaggtgg geagateatg aggtcaggag atcgagacca tcctggctaa 314 
catggtgaaa ccccgtctct actaaaaata caaaaaaatt ageeaggcat ggtggcgggc 374 
gcctgtagtc ccagctactc gggaggctga ggcaggagaa tggcgtgaac ctgggaggcg 434 
gagcttacag tgagccgaga tcgcgccact gcactccagc ctgggcgaca gagcgagact 4 94 
ctgttcaaaa aaaa 508 

<210> 387 
<211> 505 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 55.-327 

<220> 

<221> sig_peptide 

<222> 55. .111 

<223> Von Heijne matrix 

score 4.19999980926514 

seq FVLLLSEIVSISA/LS 

<400> 387 

ttgtetttgg tagttttttt gcactaactt caggaaccag ctcatgatct cagg atg 57 

Met 

tat gga aaa ata ate ttt gta tta eta ttg tea gaa att gtg age ata 105 
Tyr Gly Lys He He Phe Val Leu Leu Leu Ser Glu He Val Ser lie 

-15 -10 -5 

tea gea tta agt aee aet gag gtg gea atg eac aet tea ace tet tet 153 
Ser Ala Leu Ser Thr Thr Glu Val Ala Met His Thr Ser Thr Ser Ser 
15 10 



tea 


gtc 


aca 


aag 


agt 


tac 


ate 


tea 


tea 


cag 


aca 


aat 


gga 


gaa 


ayg 


gga 


Ser 


Val 


Thr 


Lys 


Ser 


Tyr 


He 


Ser 


Ser 


Gin 


Thr 


Asn 


Gly 


Glu 


Xaa 


Gly 


15 










20 










25 










30 


eaa 


ctt 


gtc 


cat 




ttc 


act 


gta 


cca 


get 


cct 


gta 


gtg 


ata 


ata 




Gin 


Leu 


Val 


His 


Arg 


Phe 


Thr 


Val 


Pro 


Ala 


Pro 


Val 


Val 


He 


He 


Leu 










35 










40 










45 




att 


att 


ttg 


tgt 


gtg 


atg 


get 


ggt 


att 


att 


gga 


acg 


ate 


etc 


tta 


att 


He 


He 


Leu 


Cys 


Val 


Met 


Ala 


Gly 


He 


He 


Gly 


Thr 


He 


Leu 


Leu 


lie 








50 










55 










60 






tot 


tac 


agt 


att 


cgc 


cga 


etg 


ata 


aag 


gea 


tgaggatgtg geetgeatge 


Ser 


Tyr 


Ser 


He 


Arg 




Leu 


He 




Ala 
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tgcctgatct tgcctagaac cggctgcacc tgctgttctc ttgtttatgc aaactggctg 407 

cacctgctat tcctttgctt atgcccctac ccctggctat cctaattccc tgttctcctg 4 67 
cctcactatt actgtattct ctacttctaa ataaaaat 505 

<210> 388 
<211> 775 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 395. .736 

<220> 

<221> sig_peptide 

<222> 395. .490 

<223> Von Heijne matrix 

score 4.19999980926514 

seq LLGSGLFVFSLTA/FN 

<400> 388 

atgatgggta acaggaccgg tgggstcccc aggaagtcct agagggggtc ggggtttggg 60 

tggacaagct ttcctcgtcc tctcccgaca gagctgacgt gtcctgggtt ccaccgggag 120 

cgggcatttc caccggacgg gagggttcgg ggtgtccggg gctggggaat acgtaggggt 180 

tgccgcgcgg tgtggggagt tggggcgtgt ggctgcagtc ccgggagttc ttggaggggg 240 

tcggcccacc gagcttccgg accggctgat ctgcccgtag cttgccggag gagggcggas 300 

tgactctccg tcccttctcc catcccctcc agtggtgggt acgggcacct cgctggcgct 360 

ctcctccctc ctgtccctgc tgctctttgc tggg atg cag atg tac age cgt cag 415 
Met Gin Met Tyr Ser Arg Gin 
-30 



ctg 


gcc 


tec 




gag 


tgg 


etc 


acc 


ate cag ggc ggc 


ctg 


ctt 


ggt 


teg 


463 


Leu 


Ala 


Ser 


Thr 


Glu 


Trp 


Leu 


Thr 


He Gin Gly Gly 


Leu 


Leu 


Gly 


Ser 




-25 










-20 






-15 








-10 




ggt 


etc 


ttc 


gtg 


ttc 


teg 


etc 


act 


gcc ttc aat aat 


ctg 


gag 


aat 


ctt 


511 


Gly 


Leu 


Phe 


Val 


Phe 

-5 


Ser 


Leu 


Thr 


Ala Phe Asn Asn 
1 


Leu 


Glu 
5 


Asn 


Leu 




gtc 


ttt 


ggc 




gga 


ttc 






aag ate ttc cct 


gag 


att 


etc 


ctg 


559 


Val 


Phe 


Gly 


Lys 


Gly 


Phe 


Gin 


Ala 


Lys He Phe Pro 


Glu 


He 


Leu 


Leu 








10 










15 




20 










tgc 


etc 


ctg 


ttg 


get 


etc 


ttt 


gea 


tet ggc etc ate 


eac 


ega 


gtc 


tgt 


607 


Cys 


Leu 


Leu 


Leu 


Ala 


Leu 


Phe 


Ala 


Ser Gly Leu He 


His 


Arg 


Val 


Cys 






25 










30 




35 












gtc 


acc 


ace 


tgc 


ttc 


ate 


ttc 


tee 


atg gtt ggt ctg 


tac 


tac 


ate 


aac 


655 


Val 


Thr 


Thr 


Cys 


Phe 


He 


Phe 


Ser 


Met Val Gly Leu 


Tyr 


Tyr 


He 


Asn 




40 










45 






50 








55 




aag 


ate 


tec 


tee 


ace 


ctg 


tac 


cag 


gea gea get cea 


gtc 


etc 


aca 


cea 


703 


Lys 


He 


Ser 


Ser 


Thr 


Leu 


Tyr 


Gin 


Ala Ala Ala Pro 


Val 


Leu 


Thr 


Pro 












60 








65 






70 






gcc 


aag 


gtc 


aca 


ggc 


aag 


age 


aag 


aag aga aac tgaceetgaa tgttcaataa 


756 


Ala 


Lys 


Val 


Thr 


Gly 


Lys 


Ser 


Lys 


Lys Arg Asn 













75 80 
agttgattct ttgtaaaaa 775 



<210> 389 
<211> 466 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 166. .324 

<220> 
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<221> sig_peptide 

<222> 166. .204 

<223> Von Heijne matrix 

score 4.09999990463257 

seq MAVAFVLSLGVAA/LY 



<400> 389 

attctgcgcc tgcgcgcggc tacagcacgg ttcgtttttc ctttagtcag gaaggacgtt 60 
ggtgttgagg ttagcatacg tatcaaggac agtaactacc atggctcccg aagttttgcc 120 
aaaacctcgg atgcgtggcc ttctggcagg cgtctgcgaa atcat atg get gta gca 177 

Met Ala Val Ala 
-10 



ttc 


gtg eta 


tec ctg ggg gtt gca get ttg tat 


aag 


ttt 


cgt gtg 


get 


225 


Phe 


Val Leu 


Ser Leu Gly Val Ala Ala Leu Tyr 
-5 1 


Lys 


Phe 


Arg Val 
5 


Ala 




gat 


caa aga 


aag aag gca tac gca gat ttc tac 


aga 


aac 


tac gat 


gtc 


273 


Asp 


Gin Arg 


Lys Lys Ala Tyr Ala Asp Phe Tyr 


Arg 


Asn 


Tyr Asp 


Val 






10 


15 




20 








atg 


aaa gat 


ttt gag gag atg agg aag get ggt 


ate 


ttt 


cag agt 


gta 


321 


Met 


Lys Asp 


Phe Glu Glu Met Arg Lys Ala Gly 


He 


Phe 


Gin Ser 


Val 






25 


30 


35 










aag 


taatcttgga atataaagaa tttcttcagg ttgaattacc 


tagaagtttg 




374 



Lys 
40 

tcactgactt gtgttcctga actatgacac atgaatatgt gggctaagaa atagttcctc 434 
ttgataaata aacaattaac aaatactttt ga 4 66 



<210> 390 
<211> 522 
<212> DNA 

<213> Homo sapiens 



<220> 
<221> CDS 
<222> 94. .417 



<220> 

<221> sig_peptide 

<222> 94. .213 

<223> Von Heijne matrix 

seq VTAPSMVAPVTFA/SI 



<400> 390 

ctgtaatctc tgcgaaagct tccaggttcg ctcatcgtag gtcactggaa aaaattgtag 
cattgctaaa gagtatttca gagaaaattt tgc atg caa aaa ttt agg aag atg 
Met Gin Lys Phe Arg Lys Met 
-40 -35 
agt gaa act cat cat tct gta ate tct gtg aaa get tec agt cec tgg 
Ser Glu Thr His His Ser Val He Ser Val Lys Ala Ser Ser Pro Trp 

-30 -25 -20 

eta tct tct tea gtg act get eca tee atg gta gee eca gtc act ttt 
Leu Ser Ser Ser Val Thr Ala Pro Ser Met Val Ala Pro Val Thr Phe 

-15 -10 -5 

gca tct att gta gaa gaa gaa eta caa caa gaa gca get ctt att aga 
Ala Ser He Val Glu Glu Glu Leu Gin Gin Glu Ala Ala Leu He Arg 

15 10 15 

agt cga gaa aaa ccg ttg get ctg att cag att gag gag cat gee ata 
Ser Arg Glu Lys Pro Leu Ala Leu He Gin He Glu Glu His Ala He 

20 25 30 

caa gat tta ttg gtt ttc tat gag gca ttt gge aac cet gaa gag ttt 
Gin Asp Leu Leu Val Phe Tyr Glu Ala Phe Gly Asn Pro Glu Glu Phe 
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gtc att gtt gaa agg aca ccg cag gga cca ctg gca gta cct atg tgg 4 02 

Val lie Val Glu Arg Thr Pro Gin Gly Pro Leu Ala Val Pro Met Trp 

50 55 60 

aat aag cat gga tgc tagttcactg tggagttgag atgcatttta cataattatg 457 
Asn Lys His Gly Cys 
65 

agtttgttca tataaagaaa gctgtggaaa agagtcttag agatttgaat atcgtctaaa 517 
tagat 522 



<210> 391 
<211> 409 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 9. .389 



<220> 

<221> sig_peptide 

<222> 9. .149 

<223> Von Heijne matrix 

score 3.90000009536743 
seq LSSDLQQVGGASA/RI 



<220> 

<221> misc_feature 

<222> 8,403 

<223> n=a, g, c or t 



tgggagtn atg ttc acg cct mtg ama gtg aaa tarn gcg tac tac gac act 50 
Met Phe Thr Pro Xaa Xaa Val Lys Xaa Ala Tyr Tyr Asp Thr 
-45 -40 -35 



gaa 


cgc 


ate 


gga 


gtt 


gac 


mtg 


ate 


atg 


aag 


acc 


tgm 


ttt age 


ccc 


aac 


98 


Glu 


Arg 


He 


Gly 
-30 


Val 


Asp 


Xaa 


He 


Met 
-25 


Lys 


Thr 


Xaa 


Phe Ser 
-20 


Pro 


Asn 




aga 


gtg 


att 


gga 


etc 


tea 


agt 


gac 


ttg 


cag 


caa 


gta 


gga ggg 


gca 


tea 


146 


Arg 


Val 


He 


Gly 


Leu 


Ser 


Ser 


Asp 


Leu 


Gin 


Gin 


Val 


Gly Gly Ala 


Ser 








-15 










-10 










-5 










cgc 






gat 


gcc 


ctg 


agt 




gtg 


ttg 




tat gca 


gag 


gat 


194 


Ala 


Arg 
1 


He 


Xaa 


Asp 


Ala 
5 


Leu 


Ser 


Thr 


Val 


Leu 
10 


Gin 


Tyr Ala 


Glu 


Asp 
15 




gta 


ctg 


tct 


gga 


aag 


gtg 


tea 


get 


gac 




act 


gtg 


ggc cgc 


ttc 


ctg 


242 


Val 


Leu 


Ser 


Gly 


Lys 
20 


Val 


Ser 


Ala 


Asp 


Asn 
25 


Thr 


Val 


Gly Arg 


Phe 

30 


Leu 




atg 






gtt 




caa 


gta 


ccg 


aaa 


ata 


gtt 




gat gac 


ttt 


gag 


290 


Met 


Ser 




Val 
35 


Asn 


Gin 


Val 


Pro 


Lys 
40 


He 


Val 


Pro 


Asp Asp 
45 


Phe 


Glu 




acc 


atg 


etc 


aac 


age 


aac 


ate 


aat 


gac 


mtt 


ttg 


atg 


gtg ace 


tac mtg 


338 


Thr 


Met 


Leu 


Asn 


Ser 


Asn 


He 


Asn Asp Xaa 


Leu 


Met 


Val Thr 


Tyr Xaa 








50 










55 










60 








gcc 


aac 


etc 


aca 


cag 


tea 


cag 


att 


gca 


etc 


aat 


gaa 


aaa ctt 


gta 


aac 


386 


Ala 


Asn 
65 


Leu 


Thr 


Gin 


Ser 


Gin 
70 


He 


Ala 


Leu 


Asn 


Glu 
75 


Lys Leu 


Val 


Asn 





ctg tgaatggaee ceangcagta 409 
Leu 



<210> 392 
<211> 431 
<212> DNA 
<213> Homo sapiens 
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<220> 
<221> CDS 
<222> 60. .314 

<220> 

<221> sig_peptide 

<222> 60. .155 

<223> Von Heijne matrix 

score 3.79999995231628 

seq LAHIQALISGIEA/QL 

<220> 

<221> misc_feature 
<222> 381 

<223> n=a, g, c or t 
<400> 392 

cataagaaca gggaggttag aagtagggtc ttgagattga gctgcagtca cagctgagc 59 



atg 


aaa 


get 


gcc 


ttg 


gaa 


gac 




ctg 


gca 


gaa 


acg 


gag 


gcg 


cgc 


ttt 


107 


Met 


Lys 


Ala 
-30 


Ala 


Leu 


Glu 


Asp 


Thr 
-25 


Leu 


Ala 


Glu 


Thr 


Glu 
-20 


Ala 


Arg 


Phe 




gga 


gee 


cag 


ctg 


gcg 


cat 


ate 


cag 


gcg 


ctg 


ate 


age 


ggt 


att 


gaa 


gcc 


155 


Gly 


Ala 
-15 


Gin 


Leu 


Ala 


His 


He 
-10 


Gin 


Ala 


Leu 


He 


Ser 
-5 


Gly 


He 


Glu 


Ala 




cag 


ctg 


ggc 


gat 


gtg 


cga 


get 


gat 


agt 


gag 


egg 


cag 


aat 


cag 


gag 


tac 


203 


Gin 




Gly 


Asp 


Val 


Arg 


Ala 


Asp 


Ser 


Glu 


Arg 


Gin 


Asn 


Gin 


Glu 


Tyr 




1 








5 










10 










15 






cag 


egg 


etc 


atg 


gac 


ate 


aag 


teg 


egg 


ctg 


gag 


cag 


gag 


att 


gee 


ace 


251 


Gin 


Arg 


Leu 


Met 
20 


Asp 


He 


Lys 


Ser 


Arg 
25 


Leu 


Glu 


Gin 


Glu 


He 
30 


Ala 


Thr 




tac 


cgc 


age 


ctg 


etc 


gag 


gga 


cag 


gaa 


gat 


eac 


tac 


aac 


aat 


ttg 


tet 


299 


Tyr 


Arg 


Ser 

35 


Leu 


Leu 


Glu 


Gly 


Gin 
40 


Glu 


Asp 


His 


Tyr 


Asn 
45 


Asn 


Leu 


Ser 





gcc tec aag gtc etc tgaggcagea ggctctgggg cttctgctgt cctttggagg 354 
Ala Ser Lys Val Leu 
50 

gtgtcttctg ggtagaggga tgggaangaa gggaccctta cccccggctc ttctcctgac 414 
etgeeaataa aaattta 431 

<210> 393 
<211> 551 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 232. .534 

<220> 

<221> sig_peptide 

<222> 232. .306 

<223> Von Heijne matrix 

score 3.70000004768372 

seq AKTCLVLCSRVLS/VI 

<400> 393 

tatcactgtt acgaaccaag gatttacaga tcactggcaa aaattctgag aactttcaca 60 
ccagtatact gtccaagccc attaagtggc atcacacctc tcttttatgt agctcagaca 120 
agaeagteta atatetteaa aataetactg caatatggaa tettagaaag agaaaaaaac 180 
cctatcaaca ttgtcttaac aatagtacte tacccttega gagtaagagt a atg gtt 237 

Met Val 
-25 
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gat 


cgt 


gaa 


ttg 


get 


gae 


ate 


eat 


gaa 


Asp 


Arg 


Glu 


Leu 


Ala 


Asp 


He 


His 


Glu 








-20 










-15 


eta 


tgt 


tec 


aga 


gtg 


ett 


tct 


gtc 


att 


Leu 


Cys 


Ser 
-5 


Arg 


Val 


Leu 


Ser 


Val 
1 


He 


cag 


ctg 


agt 


tta 


gga 


aga 


cat 


cca 


att 


Gin 


Leu 


Ser 


Leu 


Gly 


Arg 


His 


Pro 


He 


10 










15 








att 


cct 


tea 


aca 


aga 


tac 


aaa 


gat 


cca 


He 


Pro 


Ser 


Thr 


Arg 


Tyr 


Lys 


Asp 


Pro 










30 










aga 


eta 


acc 


ate 


agg 


aat 


caa 


eta 


tta 


Arg 


Leu 


Thr 


He 


Arg 


Asn 


Gin 


Leu 


Leu 








45 










50 


gga 


ata 


ttt 


tea 


ctt 


eta 


att 


cct 


get 


Gly 


lie 


Phe 


Ser 


Leu 


Leu 


He 


Pro 


Ala 






60 










65 





tta gaa ate taacatacgt cagtgtc 
Leu Glu He 
75 
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gat gee aaa aea tgt ttg gta 285 

Asp Ala Lys Thr Cys Leu Val 
-10 

tea gtc aag gaa ata aag aea 333 

Ser Val Lys Glu He Lys Thr 
5 

att tea aat tgg ttt gat tac 381 

He Ser Asn Trp Phe Asp Tyr 

20 25 

tgt gaa eta tta cat ctt tgc 429 

Cys Glu Leu Leu His Leu Cys 

35 40 

aec aac aat atg etc cea gat 477 

Thr Asn Asn Met Leu Pro Asp 
55 

cgt eta caa aac tat ctg aat 525 

Arg Leu Gin Asn Tyr Leu Asn 
70 

551 



<210> 394 
<211> 529 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 78. .281 



<220> 

<221> sig_peptide 

<222> 78. .218 

<223> Von Heijne matrix 

score 3.70000004768372 
seq PLLSLHSRGGSSS/ES 



<400> 394 

ttttttgcga aeggegagea gcggcgcgcg cggagagacg agcggaggtt ttcctggttt 60 

eggaccceag cggccgg atg gtg aaa tec tec ctg cag egg ate etc aat 110 
Met Val Lys Ser Ser Leu Gin Arg He Leu Asn 
-45 -40 





cac 


tgc 


ttc 


gcc 


aga 


gag 






ggg 


gat 










acc 


158 


Ser 


His 


Cys 


Phe 


Ala 


Arg 


Glu 




Glu 


Gly 


Asp 


Lys 


Pro 


Ser 


Ala 


Thr 






-35 










-30 










-25 












ate 


eac 


gee 


age 


egc 


ace 


atg 


ccg 


etc 


eta 


age 


ctg 


cac 


age 


egc 


ggc 


206 


He 


His 


Ala 


Ser 


Arg 


Thr 


Met 


Pro 


Leu 




Ser 


Leu 


His 


Ser 


Arg 


Gly 




-20 










-15 










-10 










-5 




ggc 


age 


age 


agt 


gag 


agt 


tec 


agg 


gtc 


tec 


etc 


cac 


tgc 


tgt 


agt 


aac 


254 


Gly 


Ser 


Ser 


Ser 


Glu 

1 


Ser 


Ser 


Arg 


Val 

5 


Ser 


Leu 


His 


Cys 


Cys 


Ser 


Asn 




ccg 


ggt 


ccg 


ggg 




egg 


tgg 


tgc 


tec 


tgatgcccct cacccacccc 




301 


Pro 


Gly 


Pro 


Gly 


Pro Arg 


Trp 


Cys 


Ser 



















15 20 

tgaagatcce aggtgggcga gggaatagtc agagggatca caatctttca gctaacttat 361 

tetactcega tgatcggctg aatgtaacag aggaactaac gtccaacgac aagacgagga 421 

ttcteaacgt ceagtccagg ctcacagacg ccaaacgcat taactggcga acagtgctga 481 

gtggeggcas tcctacatcg agateccggg cggcgegctg ccgagggg 529 



<210> 395 
<211> 491 
<212> DNA 
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<213> Homo sapiens 

<220> 
<221> CDS 
<222> 80. .283 

<220> 

<221> sig_peptide 

<222> 80. .220 

<223> Von Heijne matrix 

score 3.70000004768372 

seq PLLSLHSRGGSSS/ES 

<400> 395 

aktttgcgaa cggcgagcag cggcggcggc gcggagagac gcagcggagg ttttcctggt 60 
ttcggacccc agcggccgg atg gtg aaa tec tec ctg cag egg ate etc aat 112 
Met Val Lys Ser Ser Leu Gin Arg lie Leu Asn 
-45 -40 



age 


cae 


tge 


tte 


gee 


aga 


gag 


aag 


gaa 


ggg gat 


aaa 


cee 


age gee 


aee 


160 


Ser 


His 


Cys 


Phe 


Ala 


Arg 


Glu 


Lys 


Glu 


Gly Asp 


Lys 


Pro 


Ser Ala 


Thr 






-35 










-30 








-25 










ate 


cac 


gee 


age 


cgc 


aee 


atg 


ceg 


cte 


eta age 


ctg 


cae 


age ege 


ggc 


208 


He 


His 


Ala 


Ser Arg 


Thr 


Met 


Pro 


Leu 


Leu Ser 


Leu 


His 


Ser Arg Gly 




-20 










-15 








-10 








-5 




ggc 


age 


age 


agt 


gag 


agt 


tee 


agg 


gte 


tee cte 


cae 


tgs 


tgt agt 


aae 


256 


Gly 


Ser 


Ser 


Ser 


Glu 
1 


Ser 


Ser Arg 


Val 

5 


Ser Leu 


His 


Xaa 


Cys Ser 


Asn 




ccg 


ggt 


ccg 


ggg 


cot 


egg 


tgg 


tge 


too 


tgatgcccct cacccacccc 




303 


Pro 


Gly 


Pro 


Gly 


Pro 


Arg 


Trp 


Cys 


Ser 


















15 










20 

















tgaagatccc aggtgggcga gggaatagtc agagggatca caatctttca gctaacttat 363 

tctacteega tgateggetg aatgtaacag aggaaetaae gtseaaegae aagacgagga 423 

ttetcaaegt tcagtceagg etcaeaggaa egecaaacge attaaetgee gaaeagtgct 483 

gagtggcg 4 91 

<210> 396 
<211> 503 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 9. .395 

<220> 

<221> sig_peptide 
<222> 9. .56 

<223> Von Heijne matrix 

score 3.59999990463257 
seq LLLLLRALRRGPG/PG 

<400> 396 

aggecaae atg gee gtg ctg ctg ctg ctg etc cgt gcc etc cgc egg ggt 50 
Met Ala Val Leu Leu Leu Leu Leu Arg Ala Leu Arg Arg Gly 
-15 -10 -5 

eea ggc ccg ggt cct egg ccg ctg tgg ggc cca ggc ccg gcc tgg agt 98 
Pro Gly Pro Gly Pro Arg Pro Leu Trp Gly Pro Gly Pro Ala Trp Ser 

15 10 
cca ggg ttc ccc gcc 
Pro Gly Phe Pro Ala 

15 20 25 30 

agg cct ccg ggg gae etc gcc gag get gga ggc cga get ctg cag age 
Arg Pro Pro Gly Asp Leu Ala Glu Ala Gly Gly Arg Ala Leu Gin Ser 
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tta caa ttg aga ctg eta acc cct acc ttt gaa ggg ate aac gga ttg 

Leu Gin Leu Arg Leu Leu Thr Pro Thr Phe Glu Gly lie Asn Gly Leu 

50 55 60 

ttg ttg aaa caa cat tta gtt cag aat oca gtc aga etc tgg eaa ett 

Leu Leu Lys Gin His Leu Val Gin Asn Pro Val Arg Leu Trp Gin Leu 

65 70 75 

tta ggt ggt act tte tat ttt aae ace tea agg ttg aag cag aag aat 

Leu Gly Gly Thr Phe Tyr Phe Asn Thr Ser Arg Leu Lys Gin Lys Asn 

80 85 90 

aag gag aag gat aag teg aag ggg aag gcg cct gaa gag gae gaa ggt 

Lys Glu Lys Asp Lys Ser Lys Gly Lys Ala Pro Glu Glu Asp Glu Gly 
95 100 105 110 
ata tte ate tgatgttett cagteagtag ctgeetctgg atgtctttae 
He Phe He 

atttetgttt weettttagc aaggtgaaae cagtctggae aatggggaga tgggeegggt 
geagtgge 



<210> 397 
<211> 517 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 93. .419 



<220> 

<221> sig_peptide 

<222> 93. .191 

<223> Von Heijne matrix 

score 3.59999990463257 
seq LAVFQMLKSMCAG/QR 



<400> 397 

gaaaaaaage gaaggcegge egggegggga agggaaatgg cgaggcagga gtgcggggga 60 



gggastggte ettagctgaa tgegeetgeg tt 


atg 


gcg gee 


tec 


ggc 


gcc 


eca 


113 




















Met 


Ala Ala 


Ser 
-30 


Gly 


Ala 


Pro 




agg 


ate 


etg 


gtg 


gae 


etg 


etg 


aag 


etg 


aae 


gtg gee 


eee 


ete 


gee 


gte 


161 


Arg 


He 


Leu 


Val 


Asp 


Leu 


Leu 


Lys 


Leu 


Asn 


Val Ala 


Pro 


Leu 


Ala 


Val 






-25 










-20 








-15 












tte 


cag 


atg 


etc 


aag 


tee 


atg 


tgt 


gee 


ggg 


eag agg 


eta 


gcg 


age 


gag 


209 


Phe 


Gin 


Met 


Leu 


Lys 


Ser 


Met 


Cys 


Ala 


Gly 


Gin Arg 


Leu 


Ala 


Ser 


Glu 




-10 










-5 










1 






5 






eee 


eag 


gae 


eet 


gcg 


gee 


gtg 


tet 


etg 


eee 


aeg teg 


age 


gtg 


eee 


grg 


257 


Pro 


Gin 


Asp 


Pro 


Ala 


Ala 


Val 


Ser 


Leu 


Pro 


Thr Ser 


Ser 


Val 


Pro 


Xaa 










10 










15 








20 








aee 


cga 


ggg 


aga 


aac 


aaa 


ggc 


age 


get 


gee 


etc ggg 


gga 


gea 


ttg 


gee 


305 


Thr 


Arg 


Gly 


Arg 


Asn 


Lys 


Gly 


Ser 


Ala 


Ala 


Leu Gly 


Gly 


Ala 


Leu 


Ala 








25 










30 








35 










ctg 


gcg 








age 




gaa 


gga 


tec 


age cag 


agg 


atg 






353 


Leu 


Ala 


Glu 


Xaa 


Xaa 


Ser 


Arg 


Glu 


Gly 


Ser 


Ser Gin 


Arg 


Met 


Pro 


Arg 






40 










45 








50 












cag 






get 




agg 


ctg 




aag 


ggg 


ggc ggg 


cct 


ggg 


aag 


age 


401 


Gin 


Pro 


Ser 


Ala 


Thr 


Arg 


Leu 


Pro 


Lys 


Gly 


Gly Gly 


Pro 


Gly 


Lys 


Ser 




55 










60 










65 








70 




eet 


aea 


egg 


gge 


age 


aee 


taggatgggg eagagaettg ttgeatettt 




449 


Pro 


Thr 




Gly 


Ser 


Thr 























75 



gteeceagea aaggetaeat gttaeeteet teaattgata ataaaeettt etgagatgag 509 
agggteea 517 
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<210> 398 
<211> 507 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 78.-386 

<220> 

<221> sig_peptide 

<222> 78. .185 

<223> Von Heijne matrix 

score 3.59999990463257 

seq RALSTFLFGSIRG/AA 

<400> 398 

aagtgcgcag acgcaggggt cggcgccggg tgagagcgtg cggccgggta agggcgtgtg 60 
gccggattca ccacaac atg gca aat ctt ttt ata agg aaa atg gtg aac 110 
Met Ala Asn Leu Phe lie Arg Lys Met Val Asn 















-35 










-30 












cct 


ctg 


etc 


tat 


etc 


agt 


cgt 


cac 


aeg 


gtg 


aag 


cct 


ega 


gee 


etc 


tec 


158 


Pro 


Leu 


Leu 


Tyr 


Leu 


Ser 


Arg 


His 


Thr 


Val 


Lys 


Pro 


Arg 


Ala 




Ser 




-25 










-20 










-15 










-10 




aca 


ttt 


eta 


ttt 


gga 


tec 


att 


ega 


ggt 


gca 


gee 


ccc 


gtg 


get 


gtg 


gaa 


206 


Thr 


Phe 


Leu 


Phe 


Gly 
-5 


Ser 


lie 


Arg 


Gly 


Ala 
1 


Ala 


Pro 


Val 


Ala 

5 


Val 


Glu 




ccc 


ggg 


gca 


gca 


gtg 


cgc 


tea 


ctt 


etc 


tea 


ccc 


ggc 


etc 


ctg 


ccc 


cat 


254 


Pro 


Gly 


Ala 


Ala 


Val 


Arg 


Ser 


Leu 


Leu 


Ser 


Pro 


Gly 


Leu 


Leu 


Pro 


His 








10 










15 










20 










ctg 


ctg 


cct 


gcg 


ctg 


ggg 


ttc 


aaa 


aac 


aag 


act 


gtc 


ctt 


aag 


aag 


cgc 


302 


Leu 


Leu 


Pro 


Ala 


Leu 


Gly 


Phe 


Lys 


Asn 


Lys 


Thr 


Val 


Leu 


Lys 


Lys 


Arg 






25 










30 










35 












tgc 


aag 


gac 


tgt 


tac 


ctg 


gtg 


aag 


agg 


egg 


ggt 


egg 


tgg 


tac 


gtc 


tac 


350 


Cys 


Lys 


Asp 


Cys 


Tyr 


Leu 


Val 


Lys 


Arg 


Arg 


Gly 


Arg 


Trp 


Tyr 


Val 


Tyr 




40 










45 










50 










55 




tgt 


aaa 


acc 


cat 


cog 


agg 


cac 


aag 


cag 


aga 




atg 


tagaecettt 




396 


Cys 


Lys 


Thr 


His 


Pro 


Arg 


His 


Lys 


Gin 




Gin 


Met 













60 65 



ccctccagag tcacgeacat actcgtcatc gcatcacttg ggagaatggt tgtatettat 456 
ggaaggaatt atcacatcaa ggagtcaggg gamagtgact gggaagcaaa c 507 

<210> 399 
<211> 493 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 58. .315 

<220> 

<221> sig_peptide 

<222> 58. .165 

<223> Von Heijne matrix 

score 3.59999990463257 

seq SVSLSGFTVGTLS/ET 

<400> 399 

ctgtgggaga ggccaggatg ctgcattagg cacaggataa cctgggaacc caggcac 57 
atg ggt cct get etc ega agt ctg eaa gtc aag aag gga aca gag cae 105 
Met Gly Pro Ala Leu Arg Ser Leu Gin Val Lys Lys Gly Thr Glu His 
-35 -30 -25 
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gcc 


gac 


act 


etc 


cct 


tte 


ecc 


tct 


gtc 


tct 


ctt 


agt 


gge 


ttt 


aca 


gtg 


153 


Ala 


Asp 


Pro 


Leu 


Pro 


Phe 


Pro 


Ser 


Val 


Ser 


Leu 


Ser 


Gly 


Phe 


Thr 


Val 




-20 










-15 










-10 










-5 




ggt 


acc 


ctg 


tea 


gaa 


aee 


age 


aet 


ggg 


gge 


cct 


gee 


aee 


eee 


aca 


tgg 


201 


Gly 


Thr 


Leu 


Ser 


Glu 
1 


Thr 


Ser 


Thr 


Gly 
5 


Gly 


Pro 


Ala 


Thr 


Pro 
10 


Thr 


Trp 




aag 


gag 


tgt 


cct 


ate 


tgt 


aag 


gag 


ege 


ttt 


cct 


get 


gag 


agt 


gac 


aag 


249 


Lys 


Glu 


Cys 


Pro 


He 


Cys 


Lys 


Glu 


Arg 


Phe 


Pro 


Ala 


Glu 


Ser 


Asp 


Lys 








15 










20 










25 










gat 


gcc 


ctg 


gag 


gac 


cac 


atg 


gat 


gga 


cac 


ttc 


ttt 


ttc 


age 


acc 


cag 


297 




Ala 


Leu 


Glu 


Asp 


His 


Met 


Asp 


Gly 


His 


Phe 


Phe 


Phe 


Ser 


Thr 


Gin 






30 










35 










40 













gae ecc tte acc ttt gag tgatettact ccctcgtaca tgcacaaata 345 
Asp Pro Phe Thr Phe Glu 
45 50 

cacactcatg cacacacaca ctcacacaca tgcatacact taggtttcat gcccattttc 405 

tatcacaetg ggctccatga tattctgttc cctaagaact gcttctgtgt geeetgtttt 4 65 

eateecaaga tttctcactt catcctct 4 93 

<210> 400 
<211> 769 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 506. .661 

<220> 

<221> sig_peptide 
<222> 506.. 595 
<223> Von Heijne matrix 
score 3.5 

seq VKLLLKMVSSATT/ER 

<400> 400 

ettggctett aaagccaggt tgagggttat gtgtttttat tcttttawat ctccttttca 60 
cttaggcttg gcaactagtt cattgaaaca gagccaacaa gaatgagtga gtcactcagt 120 
gatatgaagc aagctgtggt catttatcta ctacctteca tacctacttc ctaggtttga 180 
taagtagtgt ttgtagtgta gttgtcagta acttttctca gcttacataa gtttatgtgg 240 
aattcattta caaatgtgtg attattacga tagatgttat tttataaatt teaaggttet 300 
ttaatacagc atggacatgt ttetaaatca ataaatatag ettamecaea gaetttttat 360 
gggatgtaaa gtatgtatta caaeaggtaa ateeatttet ctgttttggg aaagcaggtt 420 
aatgttttgt ttgttttgtt gctgcatata atgttgcact atatatatat kctaatgcag 480 
ttggteteet aatyctacag gataa atg tct ago tgt gge att gtt gga tea 532 

Met Ser Ser Cys Gly lie Val Gly Ser 

-30 -25 
tea gtt teg ttt cag tta gat get gtg aaa ttg etc ttg aaa atg gtg 580 
Ser Val Ser Phe Gin Leu Asp Ala Val Lys Leu Leu Leu Lys Met Val 

-20 -15 -10 

tec tct gee aee aca gaa egg tgt tgt aat gga agt gee aat ttt cat 628 
Ser Ser Ala Thr Thr Glu Arg Cys Cys Asn Gly Ser Ala Asn Phe His 
-5 15 10 

aaa aac ttg tgt gea aea ggt att aag aat ttt tgaggtcggg cgcagtagct 681 
Lys Asn Leu Cys Ala Thr Gly He Lys Asn Phe 

15 20 
cacgeetgta ateceageae tttgggaggc ctaggcaggt ggatcacctg aggtcaggag 741 
ttceagacca gcctgaccaa catggtga 769 

<210> 401 
<211> 646 
<212> DNA 
<213> Homo sapiens 
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<220> 
<221> CDS 
<222> 76. . 621 

<400> 401 

attacgagat tggcttggat tctgtcggat ggacttgggg ctagctgcgg cggggctgga 60 
ggaggccaga taacc atg tea gcc aca gtt gta gat gca gtt aat get gca 111 
Met Ser Ala Thr Val Val Asp Ala Val Asn Ala Ala 











1 5 








10 








ccc 


eta 


teg 


ggg 


tee aaa gaa atg agt ttg 


gaa 


gaa 


cea 


aag 


aag 


atg 


159 


Pro 




Ser 
15 


Gly 


Ser Lys Glu Met Ser Leu 
20 


Glu 


Glu 


Pro 
25 




Lys 


Met 




acc 


aga 


gag 


gae 


tgg aga aag aag aag gag 


eta 


gaa 


gaa 


cag 


ega 


aaa 


207 


Thr 


Arg 
30 


Glu 


Asp 


Trp Arg Lys Lys Lys Glu 
35 


Leu 


Glu 
40 


Glu 


Gin 


Arg 


Lys 




ttg 


ggc 


aat 


get 


eet gca gaa gtt gat gaa 


gaa 


gga 


aaa 


gae 


ate 


aac 


255 


Leu 


Gly 


Asn 


Ala 


Pro Ala Glu Val Asp Glu 


Glu 


Gly 


Lys 


Asp 


He 


Asn 




45 








50 


55 










60 




ccc 


cat 


att 


cet 


cag tat att tct tea gtg 


cea 


tgg 


tat 


att 


gat 


cct 


303 


Pro 


His 


lie 


Pro 


Gin Tyr He Ser Ser Val 
65 70 


Pro 


Trp 


Tyr 


He 


Asp 
75 


Pro 




tea 


aaa 


aga 


eet 


act tta aaa eae eag aga 


cea 


caa 


cea 


gaa 


aaa 


caa 


351 


Ser 


Lys 


Arg 


Pro 
80 


Thr Leu Lys His Gin Arg 
85 


Pro 


Gin 


Pro 


Glu 
90 


Lys 


Gin 




aag 


cag 


tte 


age 


tea tct gga gaa tgg tac 


aag 


agg 


ggt 


gta 


aaa 


gag 


399 


Lys 




95 




Ser Ser Gly Glu Trp Tyr 
100 


Lys 




Gly 
105 


Val 


Lys 


Glu 




aat 


tec 


ata 


att 


act aag tac cgc aaa gga 


gca 


tgt 


gaa 


aat 


tgt 


ggg 


447 


Asn 


Ser 


He 


He 


Thr Lys Tyr Arg Lys Gly Ala 


Cys 


Glu 


Asn 


Cys 


Gly 






110 






115 




120 












gcc 


atg 


aca 


cac 


aaa aag aaa gac tgc ttt 


gag 


aga 


cct 


agg 


cga 


gtt 


4 95 


Ala 


Met 


Thr 


His 


Lys Lys Lys Asp Cys Phe 


Glu 


Arg 


Pro 


Arg 


Arg 


Val 




125 








130 


135 










140 




gga 


gcc 


aaa 


ttt 


aca ggt act aat ata get 


cea 


gat 


gaa 


cat 


gtc 


cag 


543 


Gly 


Ala 


Lys 


Phe 


Thr Gly Thr Asn He Ala 
145 150 


Pro 


Asp 


Glu 


His 


Val 
155 


Gin 




cct 


caa 


ctg 


atg 


ttt gac tat gat ggg aag 


agg 


gat 


egg 


tgg 


aat 


gge 


591 


Pro 


Gin 


Leu 


Met 


Phe Asp Tyr Asp Gly Lys Arg 


Asp 


Arg 


Trp 


Asn 


Gly 










160 


165 








170 








tac 


aat 


cea 


gaa 


gaa eae atg aaa att gtt 


tgaagagtat geaaagtgat 


641 


Tyr 


Asn 


Pro 


Glu 


Glu His Met Lys He Val 

















175 180 
ttgge 646 



<210> 402 
<211> 484 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 29. .469 

<400> 402 

acttggcgag tgagacgctg atgggagg atg gac gta ctg gtg tct gag tgc 52 
Met Asp Val Leu Val Ser Glu Cys 

1 5 

tec gcg egg ctg ctg cag cag gaa gaa gag att aaa tct ctg act get 100 
Ser Ala Arg Leu Leu Gin Gin Glu Glu Glu He Lys Ser Leu Thr Ala 

10 15 20 

gaa att gae egg ttg aaa aac tgt gge tgt tta gga get tct cea aat 148 
Glu He Asp Arg Leu Lys Asn Cys Gly Cys Leu Gly Ala Ser Pro Asn 
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ttg gag cag tta caa gaa gaa aat tta aaa tta aag tat cga ctg aat 
Leu Glu Gin Leu Gin Glu Glu Asn Leu Lys Leu Lys Tyr Arg Leu Asn 

45 50 55 

att ctt cga aag agt ctt cag gca gaa agg aac aaa cca act aaa aat 
lie Leu Arg Lys Ser Leu Gin Ala Glu Arg Asn Lys Pro Thr Lys Asn 

60 65 70 

atg att aac att att age cgc eta caa gag gtc ttt ggt cat gca att 
Met He Asn He He Ser Arg Leu Gin Glu Val Phe Gly His Ala He 

75 80 85 

aag get gca tat cca gat ttg gaa aat cct cct ctg eta gtg aca cca 
Lys Ala Ala Tyr Pro Asp Leu Glu Asn Pro Pro Leu Leu Val Thr Pro 

90 95 100 

agt cag cag gee aag ttt ggg gac tat cag tgt aat agt get atg ggt 
Ser Gin Gin Ala Lys Phe Gly Asp Tyr Gin Cys Asn Ser Ala Met Gly 
105 110 115 120 

att tct cag gtg atg tat tgt cat gac tct tgg ctg ttt gat ttt ttt 
He Ser Gin Val Met Tyr Cys His Asp Ser Trp Leu Phe Asp Phe Phe 

125 130 135 

aag tat tat tat cat cat tgc cat tta cag aaa taatactatt acaag 
Lys Tyr Tyr Tyr His His Cys His Leu Gin Lys 

140 145 



<210> 403 
<211> 345 
<212> DNA 
<213> Homo sapiens 



<400> 403 

ctttctccag ctcaagcgtc tggtatttgg tgtagggaca gcgctttttc cgggtggagc 60 

gagcgtggat ccagttcgcg gcggggttgt ttgggtcaag ttgctgetgc tgctgcggct 120 

gegaatgctg cttctcctcc tccttcagcg aacagcctgg gatcgggtgg tcgctgcaag 180 

etgagggcte cgacgagccg cccgtcecag tcgeggcecc gatccctgec ccaggeccgg 240 

ttcccceegt cgcegetgcc gccttctect gcaggaaega gttgcaeatg ttaetgette 300 

etactaataa atgctgacct gatcaaatgg agcccaaaar aaaaa 345 

<210> 404 
<211> 432 
<212> DNA 
<213> Homo sapiens 



<400> 404 

accaggacte caaaatggeg tcagttgtac 
aaetggggga gctgccaage tggatettga 
agegtttcaa agaggttact accggtacta 
catctcgggg attaecatgg tgctggcatg 
caagcatctc aagcacgagc ggctccgcaa 
ccccacccca cgaccttggc ccgagcccct 
atcctttcat atcctaatag gaattaacct 
aaaaaaaaaa ag 



cagtgaagga 


caagaaactt 


ctggaggtca 


60 


tgcgggactt 


cagtctagtg 


gcattttcgg 


120 




atcaatgtga 


agaaggggag 


180 


ctacgtgctc 


tttagctact 


ccttttccta 


240 


ataccactga 


agaggacaca 


ctctgcaccc 


300 


ccgtgagaac 


acaatctcaa 


tcgttgctga 


360 


ccaaataaaa 


catgactggt 


acgyraaaaa 


420 








432 



<210> 405 
<211> 428 
<212> DNA 
<213> Homo sapiens 



<400> 405 

actccgcctt ccaegtgcac ccactgcctc ttcccttctc gcttgggaac tctagtctcg 60 

cctcgggttg caatggaccc caactgctcc tgtgccgctg caggtgtctc ctgcacctgc 120 

gccagctcct gcaagtgcaa agagtgcaaa tgcacctcct gcaagaagag ctgetgctcc 180 

tgctgccctg tgggctgtge caagtgtgcc cagggctgca tetgcaaagg ggcatcggag 240 

aagtgcagct gctgcgcctg atgtcgggac agccctgctc ccaagtacaa atagagtgac 300 

ccgtaaaatc caggattttt tgttttttgc tacaatcttg acccctttgc tacattcctt 360 
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tttttctgtg aaatatgtga ataataatta aacacttaga caaaaaaaaa aaaaaaaggt 420 

caaaagct 428 

<210> 406 
<211> 391 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 196. .390 

<220> 

<221> sig_peptide 

<222> 196. .279 

<223> Von Heijne matrix 

score 5.90000009536743 

seq IILLLLKIWKSRS/CA 

<400> 406 

ctcttcccgg ctccagctcc gccgccagct ccagcctttg ctccccctcc caaagtcccc 60 
tccccggagc ggagcgcacc tagggtccct cttccgtccc cccagcccag ctacccgttc 120 
agaccagcag cctcgggggg cacccccccg ccagcctgcc tccctcccgc tcagccctgc 180 
cagggttccc cagcc atg aat etc ttc cga ttc ctg gga gac etc tec cac 231 
Met Asn Leu Phe Arg Phe Leu Gly Asp Leu Ser His 
-25 -20 



etc 


etc 


gee 


ate 


ate 


ttg 


eta 


etg 


etc 


aaa 


ate 


tgg 


aag 


tec 


cgc 


teg 


279 


Leu 


Leu 
-15 


Ala 


lie 


lie 


Leu 


Leu 
-10 








He 


Trp 
-5 


Lys 


Ser 


Arg 


Ser 




tgc 


gee 


gee 


eac 


eec 


eag 


ett 


cct 


etc 


tec 


ttc 


tgt 


ctg 


tct 


gtc 


tgt 


327 


Cys 


Ala 


Ala 


His 


Pro 


Gin 


Leu 


Pro 


Leu 


Ser 


Phe 


Cys 


Leu 


Ser 


Val 


Cys 




1 








5 










10 










15 






ctg 


tct 


gtc 


tct 


etc 


tct 


etc 


tct 


gtc 


tct 


etc 


tct 


etc 


tct 


ttc 


tct 


375 


Leu 


Ser 


Val 


Ser 

20 


Leu 


Ser 


Leu 


Ser 


Val 
25 


Ser 


Leu 


Ser 


Leu 


Ser 
30 


Phe 


Ser 




gtc 


tea 


aaa 


aaa 


aaa 


t 






















391 


Val 


Ser 


Lys 


Lys 


Lys 



























<210> 407 
<211> 500 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 81. .500 



<220> 

<221> sig_peptide 

<222> 81. .200 

<223> Von Heijne matrix 

score 5.19999980926514 
seq LLMRALHLLKSGC/SP 



<400> 407 

aaccccaccg cetaactctc ceaactctgc agtecgtgac cttcaggtgc cettcaaggt 
cagaaaaaet eggcaagaat atg gtt tct agt ttt agg gtt tct gaa eta eaa 
Met Val Ser Ser Phe Arg Val Ser Glu Leu Gin 
-40 -35 -30 

gta tta eta gge ttt get gga egg aat aaa agt gga cgc aag eat gac 
Val Leu Leu Gly Phe Ala Gly Arg Asn Lys Ser Gly Arg Lys His Asp 
-25 -20 -15 
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etc ctg atg agg gcg ctg 
Leu Leu Met Arg Ala Leu 
-10 

gtt cag att aaa ate cga 
Val Gin He Lys He Arg 
5 

ctt gaa gga ctt tct gat 
Leu Glu Gly Leu Ser Asp 
20 25 
ttg gat ggt ggc tea tea 

Leu Asp Gly Gly Ser Ser 
40 

ate eae teg ttg ect tec 
He His Ser Leu Pro Ser 
55 

cet gtt ggt tet gtg ctg 
Pro Val Gly Ser Val Leu 
70 

cag cag cca tct ccc cca 
Gin Gin Pro Ser Pro Pro 
85 

aaa 
Lys 

100 



234 

cat tta ttg aag age ggc 
His Leu Leu Lys Ser Gly 
-5 

gaa ttg tat aga cgc cga 
Glu Leu Tyr Arg Arg Arg 
10 15 
tta tec aca ate aaa tea 
Leu Ser Thr He Lys Ser 
30 

cet gta gaa cct gac ttg 
Pro Val Glu Pro Asp Leu 
45 

act tea gtt aca cct cac 
Thr Ser Val Thr Pro His 
60 

ctt caa gat act aag ccc 
Leu Gin Asp Thr Lys Pro 
75 

att cct cct gtc cat cct 
He Pro Pro Val His Pro 
90 95 



PCT/lBOl/00914 

tgc age cet gcg 209 
Cys Ser Pro Ala 
1 

tat cca cga act 257 
Tyr Pro Arg Thr 

teg gtt tte agt 305 
Ser Val Phe Ser 
35 

gee gtg get gga 353 
Ala Val Ala Gly 
50 

tea cca tec tct 401 
Ser Pro Ser Ser 
65 

aca ttt gag atg 449 

Thr Phe Glu Met 

80 

gat gtg cag tta 4 97 

Asp Val Gin Leu 

500 



<210> 408 
<211> 497 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 35. .496 



<400> 408 

tcctaatcga aawctcttct gaggatccgg caag atg gca gaa gta gag cag aag 55 
Met Ala Glu Val Glu Gin Lys 



aag 


aag 


egg 


acc 


ttc 


cgc 


aag 


ttc 


acc 


1 

tac 


cgc 


ggc 


gtg 


5 
gac 


etc 


gac 


103 


Lys 


Lys 


Arg 

10 


Thr 


Phe 


Arg 


Lys 


Phe 
15 


Thr 


Tyr 


Arg 


Gly 


Val 
20 


Asp 


Leu 


Asp 




cag 


ctg 


ctg 


gac 


atg 


tec 


tac 


gag 


cag 


ctg 


atg 


cag 


ctg 


tac 


agt 


gcg 


151 


Gin 


Leu 
25 


Leu 


Asp 


Met 


Ser 


Tyr 
30 


Glu 


Gin 


Leu 


Met 


Gin 
35 


Leu 


Tyr 


Ser 


Ala 




cgc 


eag 


gcg 


gcg 


get 


gaa 




ggg 




gcg 


gcg 


gaa 


gca 


gca 


etc 


cct 


199 


Arg 


Gin 


Ala 


Ala 


Ala 


Glu 


Pro 


Gly 


Pro 


Ala 


Ala 


Glu 


Ala 


Ala 


Leu 


Pro 




40 










45 










50 










55 




get 


gaa 


gcg 




gcg 




ggc 




gaa 


gga 


ggc 


gee 


gee 


cat 


gga 


gaa 


247 


Ala 


Glu 


Ala 


Pro 


Ala 
60 


Gin 


Gly 


Gin 


Glu 


Gly 
65 


Gly 


Ala 


Ala 


His 


Gly 
70 


Glu 




gcc 


gga 


agt 


ggt 


gaa 


gac 


gca 


cct 


gcg 


gga 


cat 


gat 


cat 


cct 


acc 


cga 


295 


Ala 


Gly 


Ser 


Gly 
75 


Glu 


Asp 


Ala 


Pro 


Ala 
80 


Gly 


His 


Asp 


His 


Pro 
85 


Thr 


Arg 




gat 


ggt 


ggg 


eag 


eat 


ggt 


ggg 


egt 


eta 


caa 


egg 


caa 


gac 


ctt 


caa 


cca 


343 


Asp 


Gly 


Gly 
90 


Gin 


His 


Gly 


Gly 


Arg 
95 


Leu 


Gin 


Arg 


Gin 


Asp 
100 


Leu 


Gin 


Pro 




ggt 


gga 


gat 


caa 


gcc 


cga 


gat 


gat 


egg 


cca 


eta 


cct 


ggg 


cga 


gtt 


etc 


391 


Gly 


Gly 
105 


Asp 


Gin 


Ala 


Arg 


Asp 

110 


Asp 


Arg 


Pro 


Leu 


Pro 
115 


Gly 


Arg 


Val 


Leu 




cat 


cac 


eta 


caa 


gee 


egt 


aaa 


gca 


tgg 


ccg 


gee 


egg 


eat 


egg 


ggc 




439 


His 


His 


Leu 


Gin 


Ala 


Arg 


Lys 


Ala 


Trp 


Pro 


Ala 


Arg 


His 


Arg 


Gly 


Xaa 




120 










125 










130 










135 




cma 


mte 


etc 




ctt 


cat 




tct 


caa 


gta 


atg 


get 




eta 






487 
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Xaa Xaa Leu Xaa Leu His Pro Ser Gin Val Met Ala Gin Leu lie Lys 
140 145 150 

gcg cac atg a 497 
Ala His Met 

<210> 409 
<211> 304 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 110. .304 

<220> 

<221> misc_f eature 
<222> 296. .297 
<223> n=a, g, c or t 



<400> 409 

acgtacggca gggcggggtc ggcccgcagt ggccagctgg gagctacgag cgcggasttg 60 
cgcagaagac ccccatcagg gtgcggggtg cagttgcggc tccagggcc atg gcg gag 118 























Met Ala Glu 




gag 


cag 


ggc egg 


gaa 


egg 


gac 


teg gtt ccc 


aag 


ccg 


teg 


gtg 


ctg 


ttc 


166 


Glu 


Gin 


Gly Arg 


Glu 


Arg 


Asp 


Ser Val Pro 


Lys 


Pro 


Ser 


Val 


Leu 


Phe 






5 








10 






15 












etc 


cac 


cca gac 


ctg 


ggc 


gtg 


ggc ggc get 


gag 


egg 


ctg 


gtg 


etc 


ttg 


214 


Leu 


His 


Pro Asp 


Leu 


Gly 


Val 


Gly Gly Ala 


Glu 


Arg 


Leu 


Val 


Leu 


Leu 




20 








25 






30 










35 




cct 


ttt 


ccc act 


gag 


aga 


agg 


ctg etc ttt 


tgt 


act 


gee 


ccc 


egc 


tea 


262 


Pro 


Phe 


Pro Thr Glu Arg Arg 


Leu Leu Phe Cys 


Thr 


Ala 


Pro 


Arg 


Ser 










40 






45 










50 






tta 


aac 


age etc 


cee 


eta 


aaa 


aaa aaa aaa 


att 


nna 


aaa 


age 






304 


Leu 


Asn 


Ser Leu 


Pro 


Leu 


Lys 


Lys Lys Lys 


He 


Xaa 


Lys 


Ser 









55 60 65 



<210> 410 
<211> 553 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 155. .553 

<400> 410 

aatagtgagc ggcagacaga ggggtggcca gtttgttgtt ttacetaaaa ggtgttttee 60 
acatggagtg ggactgagag gcacaggtgg ctgaggggac ccgcctggga cgtgaggcgc 120 
aggtectegg ceccttcate tgteagetge agea atg gaa tae gtg ctg gaa gtg 175 
Met Glu Tyr Val Leu Glu Val 



aag 


aac 


tct 


ccg 


egg 


cac 


etc 


ctg 


aag 


1 

caa 


ttc 


aca gtg 


5 

tgt 


gac 


gtt 


223 


Lys 


Asn 


Ser 
10 


Pro 


Arg 


His 


Leu 


Leu 
15 


Lys 


Gin 


Phe 


Thr Val 
20 


Cys 


Asp 


Val 




cct 


ctg 


tat 


gac 


att 


tgt 


gac 


tac 


aac 


gtc 


tec 


aga gac 


cga 


tgc 


cag 


271 


Pro 


Leu 


Tyr Asp 


He 


Cys Asp 


Tyr 


Asn 


Val 


Ser 


Arg Asp 


Arg 


Cys 


Gin 






25 










30 










35 










gag 


etc 


ggg 




tgc 


ttc 


tac 


gaa 


ggc 


gtc 


tgc 


tac aag 




gcg 


gtt 


319 


Glu 


Leu 


Gly 


Cys 


Cys 


Phe 


Tyr 


Glu 


Gly 


Val 


Cys 


Tyr Lys 


Lys 


Ala 


Val 




40 










45 










50 








55 






att 


tac 


ate 


cac 


gtg 


ttc 


tct 


gee 


ttg 


att 


gtg ate 


ata 


get 


ggg 


367 


Pro 


He 


Tyr 


He 


His 


Val 


Phe 


Ser 


Ala 


Leu 


He 


Val He 


He 


Ala 


Gly 
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60 

gcc ttc gtc ate acc ate 
Ala Phe Val He Thr He 

75 

gaa aag gcc ate cot gtg 
Glu Lys Ala He Pro Val 
90 

aag gcg gag ttg gcc tea 
Lys Ala Glu Leu Ala Ser 
105 

gte ctg gge cte aaa gtg 
Val Leu Gly Leu Lys Val 
120 125 



236 
65 

ate tae aga gtc att cag 
He Tyr Arg Val He Gin 
80 

tat gtc gcg ctg cca cag 
Tyr Val Ala Leu Pro Gin 
95 

tec age age aag tta ggg 
Ser Ser Ser Lys Leu Gly 
110 115 
ctg gge etc gat gag agt 
Leu Gly Leu Asp Glu Ser 
130 



PCT/lBOl/00914 



70 

gag age agg aaa 415 
Glu Ser Arg Lys 
85 

aag tec age gaa 4 63 

Lys Ser Ser Glu 

100 

ctg aag ctg cga 511 
Leu Lys Leu Arg 

gae gag 553 
Asp Glu 



<210> 411 
<211> 470 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 23. .469 



<400> 411 

gggtteccgt tccecgcgga sc atg egg tac aac gag aag gag ctg cag get 52 
Met Arg Tyr Asn Glu Lys Glu Leu Gin Ala 
15 10 



ctg 


tec 


egg 


cag 


eeg 


gee 


gag 


atg 


gcg 


gee 


gag 


ctg 


gge 


atg 


agg 


gge 


100 


Leu 


Ser 


Arg 


Gin 


Pro 


Ala 


Glu 


Met 


Ala 


Ala 


Glu 


Leu 


Gly 


Met 


Arg 


Gly 












15 










20 










25 






eec 


aag 


aag 


gge 


age 


gtg 


etg 


aag 


egg 


egg 


ctg 


gtg 


aag 


ctg 


gtg 


gtg 


148 


Pro 


Lys 


Lys 


Gly 


Ser 


Val 


Leu 


Lys 


Arg 


Arg 


Leu 


Val 


Lys 


Leu 


Val 


Val 










30 










35 










40 








aat 


ttc 


etc 


ttc 


tac 


ttt 


egg 


aca 


gae 


gag 


gcc 


gag 


cce 


gte 


gga 


gee 


196 


Asn 


Phe 


Leu 


Phe 


Tyr 


Phe 


Arg 


Thr 


Asp 


Glu 


Ala 


Glu 


Pro 


Val 


Gly 


Ala 








45 










50 










55 










ctg 


ctg 


ctg 


gag 




tgc 


aga 


gtc 


gtc 


egg 


gaa 


gag 


eec 


gge 




ttc 


244 


Leu 


Leu 


Leu 


Glu 


Arg 


Cys 




Val 


Val 




Glu 


Glu 


Pro 


Gly 


Thr 


Phe 






60 










65 










70 












tec 


ate 


age 


ttc 


att 


gag 


gae 


cet 


gag 


agg 


aag 


tat 


eac 


ttt 


gag 


tgc 


292 


Ser 


He 


Ser 


Phe 


He 


Glu 


Asp 


Pro 


Glu 


Arg 


Lys 


Tyr 


His 


Phe 


Glu 


Cys 




75 










80 










85 










90 




age 


age 


gag 


gag 


cag 


tgt 


cag 


gag 


tgg 


atg 


gag 


get 


ctg 


cgt 


egg 


gcc 


340 


Ser 


Ser 


Glu 


Glu 


Gin 


Cys 


Gin 


Glu 


Trp 


Met 


Glu 


Ala 


Leu 


Arg 


Arg 


Ala 












95 










100 










105 






age 


tac 


gag 


ttc 


atg 


egg 


aga 


age 


cte 


ate 


ttc 


tae 


agg 


aac 


gaa 


ate 


388 


Ser 


Tyr 


Glu 


Phe 


Met 


Arg Arg 


Ser 


Leu 


He 


Phe 


Tyr 


Arg 


Asn 


Glu 


He 










110 










115 










120 








egg 


aag 


gtg 


acg 


gge 


aag 


gae 


cce 


ctg 


gaa 


cag 


ttc 


gge 


ata 


tee 


gag 


436 


Arg 


Lys 


Val 


Thr 


Gly 


Lys 


Asp 


Pro 


Leu 


Glu 


Gin 


Phe 


Gly 


He 


Ser 


Glu 








125 










130 










135 










gag 


gcc 


agg 


ttc 


cag 


ctg 


agt 


gge 


ttg 


cag 


gcg 


t 










470 


Glu 


Ala 


Arg 


Phe 


Gin 


Leu 


Ser 


Gly 


Leu 


Gin 


Ala 















140 145 



<210> 412 
<211> 486 

<212> DNA 
<213> Homo 



sapiens 



<220> 
<221> CDS 
<222> 211. .486 
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<400> 412 

agattcttaa ctacgtgtct tacagcaaag ttgcccatgg gaacaaaggc tgtagggtgt 60 

cattgatctt gtaatacaac acatatttga tgggtgaatc ttattagcac tcctttgaaa 120 

atgttccagt atggagtcag ttgctgataa agaggaagtt cacagctagg aatccgaagt 180 

taagatttct caattgccgc ctgggcactc atg cct gta gtc cca gca ctt ggg 234 
Met Pro Val Val Pro Ala Leu Gly 



agg 


cca 


agg 


tgg 


gca 


gat 


cac 


ctg 


agg 


teg 


gga gtt egg gae 


eag 


cct 


282 


Arg 


Pro 


Arg 


Trp 


Ala 


Asp 


His 


Leu Arg 


Ser 


Gly Val Arg Asp 


Gin 


Pro 






10 










15 








20 








ggc 


caa 


cct 


ggt 


gaa 


gcc 


ccc 


cca 


tct 


eta 


eta aaa ata caa 


aaa 


ttg 


330 


Gly 


Gin 


Pro 


Gly 


Glu 


Ala 


Pro 


Pro 


Ser 


Leu 


Leu Lys He Gin 








25 










30 










35 




40 




gcc 


ggg 


tat 


ggt 


ggt 


ggc 


tge 


ctg 


tgg 


tec 


eag eta etc ggg 


agg 


ctg 


378 


Ala 


Gly Tyr 


Gly 


Gly 


Gly Cys 


Leu 


Trp Ser Gin Leu Leu Gly Arg Leu 












45 










50 




55 






agg 


egg 


gag 


aat 


cac 


ttg 


age 


ccg 


gga 


ggc 


gga ggt tge agt 


gag 


ccg 


426 


Arg 


Arg 


Glu 


Asn 
60 


His 


Leu 


Ser 


Pro 


Gly 
65 


Gly 


Gly Gly Cys Ser 
70 


Glu 


Pro 




aga 


ttg 


tge 


cae 


tge 


act 


eea 


gee 


tgg 


gtg 


aca gag eaa gae 


tec 


ate 


474 


Arg 


Leu 


Cys 
75 


His 


Cys 


Thr 


Pro 


Ala 
80 


Trp 


Val 


Thr Glu Gin Asp 
85 


Ser 


He 




tea 


aaa 


ata 


gaa 




















486 


Ser 


Lys 
90 


He 


Glu 























<210> 413 
<211> 335 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 73. .252 

<220> 

<221> sig_peptide 

<222> 73. .135 

<223> Von Heijne matrix 

score 4.30000019073486 

seq AGVSCTCAXPASA/KS 

<400> 413 

actccgcctt ccacgtgcac ccactgcctc ttcccttcte gettgggaac tctagtctcg 60 
cetegggttg ea atg gae ccc aae tge tec tgt gcc get ggt gte tee tge 111 
Met Asp Pro Asn Cys Ser Cys Ala Ala Gly Val Ser Cys 
-20 -15 -10 

ace tge gee ast cet gca agt gca aag agt gca aat gca cct cct gca 159 
Thr Cys Ala Xaa Pro Ala Ser Ala Lys Ser Ala Asn Ala Pro Pro Ala 

-5 15 
aga aga get get get cet ggt gca etg gee cca ecc teg tgg aca cct 207 
Arg Arg Ala Ala Ala Pro Gly Ala Leu Ala Pro Pro Ser Trp Thr Pro 

10 15 20 

gee ctg cee tge cac ctg tct gtc tgt ccc aaa gaa gtt ctg gta 252 
Ala Leu Pro Cys His Leu Ser Val Cys Pro Lys Glu Val Leu Val 
25 30 35 

tgaacttgag gaeaeatgtc cagtgggagg tgagaeeaee teteaatatt eaataaaget 312 
gctgagaate tagcctcgaa aaa 335 

<210> 414 
<211> 494 
<212> DNA 
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<213> Homo sapiens 

<220> 
<221> CDS 
<222> 63. .422 

<220> 

<221> sig_peptide 

<222> 63.. 167 

<223> Von Heijne matrix 

score 4.09999990463257 

seq LSQSLAMSPQTHS/QT 

<400> 414 

cacggatctc acaggtggag gtggggcgag cttctttcta gtgtggacca gaggtggggg 60 
ca atg agt aaa cat ctt cac tgt aag gtt ttg gga ctg ccc etc agg 107 
Met Ser Lys His Leu His Cys Lys Val Leu Gly Leu Pro Leu Arg 





-35 










-30 










-25 












ctt 


gag 


gtc 


teg 


gea 


teg 


tgt 


etc 


agt 


cag 


tec 


ttg 


gcg 


atg 


tec 


cct 


155 


Leu 


Glu 


Val 


Ser 


Ala 


Ser 


Cys 


Leu 


Ser 


Gin 


Ser 




Ala 


Met 


Ser 


Pro 




-20 










-15 










-10 










-5 




cag 


acg 


cac 


tea 


eag 


act 


tgc 


ata 


cgt 


aat 


tta 


gta 


aca 


tgc 


ate 


aac 


203 


Gin 


Thr 


His 


Ser 


Gin 
1 


Thr 


Cys 


He 


Arg 
5 


Asn 


Leu 


Val 


Thr 


Cys 
10 


He 


Asn 




tat 


cct 


aga 


aca 


tet 


aca 


ggg 


tgc 


aaa 


gga 


acc 


acc 


act 


cag 


aga 


ate 


251 


Tyr 


Pro 


Arg 

15 


Thr 


Ser 


Thr 


Gly 


Cys 
20 


Lys 


Gly 


Thr 


Thr 


Thr 
25 


Gin 


Arg 


He 




atg 


gag 


cca 


gtg 


gag 


tta 


gaa 


gtg 


gaa 


ggg 


aca 


gaa 


caa 


gac 


aat 


get 


299 


Met 


Glu 
30 


Pro 


Val 


Glu 


Leu 


Glu 
35 


Val 


Glu 


Gly 


Thr 


Glu 
40 


Gin 


Asp 


Asn 


Ala 




aaa 


acc 


tgt 


ggt 


tea 


eta 


gga 


agg 


ggg 


aat 


gaa 


aac 


ace 


atg 


etc 


ega 


347 


Lys 


Thr 


Cys 


Gly 


Ser 


Leu 


Gly 


Arg 


Gly 


Asn 


Glu 


Asn 


Thr 


Met 


Leu Arg 




45 










50 










55 










60 




ggt 


gga 


ttc 


age 


atg 


aac 


aca 


act 


gtg 


ggg 


caa 


gga 


att 


tec 


aag 


caa 


395 


Gly 


Gly 


Phe 


Ser 


Met 

65 


Asn 


Thr 


Thr 


Val 


Gly 
70 


Gin 


Gly 


He 


Ser 


Lys 
75 


Gin 




aca 


cac 


cac 


act 


agt 


acc 


act 


tct 


tec 


taaacagaaa agggaaagag 




442 


Thr 


His 


His 


Thr 
80 


Ser 


Thr 


Thr 


Ser 


Ser 
85 



















catccttcga cttcccectc cctccatcag eaccgggttt tcctccaaaa aa 4 94 

<210> 415 
<211> 366 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 112. .306 

<220> 

<221> sig_peptide 

<222> 112. .153 

<223> Von Heijne matrix 

score 3.70000004768372 

seq LCWLKLVSITTVA/II 

<400> 415 

ectttctcgt tccccggcca tcttatcggc tgctgttggt tgggggccgt cccgctccta 60 
aggcaggaag atggtggccg caaagaagac ggaaatctga aatagagtae t atg eta 117 

Met Leu 

tgt tgg eta aaa ctg gtg tec ate act aca gtg gea ata ata ttg aac 165 
Cys Trp Leu Lys Leu Val Ser He Thr Thr Val Ala He He Leu Asn 
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-10 -5 1 

tgg gca cag cat gcg gaa aat act aca gag tgt gca cac tgg eta tea 213 

Trp Ala Gin His Ala Glu Asn Thr Thr Glu Cys Ala His Trp Leu Ser 

5 10 15 20 

ttg ate cag gtg act ctg aca tea tta gaa gca tgc cag aac aga ctg 261 

Leu lie Gin Val Thr Leu Thr Ser Leu Glu Ala Cys Gin Asn Arg Leu 

25 30 35 

gtg aaa agt aaa cct ttt cac eta caa aat ttc acc tgc aaa cct 306 

Val Lys Ser Lys Pro Phe His Leu Gin Asn Phe Thr Cys Lys Pro 

40 45 50 

taaacctgca aaattttcct ttaataaaat ttgcttgttc taaaaaaaaa aaaaaaaaag 366 



<210> 416 
<211> 472 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 115. .411 



<400> 416 

aagaaagggg tgaggcctaa gggacaatca ggatgttttt cagagagaag tgtggatgct 
ggaeaggaag aaccacagat accagatacg ggtactgttg taactctgtt ctcc atg 

Met 
1 

aaa aaa aag gaa gaa aca aca ctt ■ 
Lys Lys Lys Glu Glu Thr Thr Leu : 

5 10 15 

cag tac caa eta gtc aat get gaa teg act tet cec ttt eta cat tgc 
Gin Tyr Gin Leu Val Asn Ala Glu Ser Thr Ser Pro Phe Leu His Cys 

20 25 30 

ctg aga gaa gtc att ggg gaa tac tet gta cac gaa ttt tea ctg ttg 
Leu Arg Glu Val He Gly Glu Tyr Ser Val His Glu Phe Ser Leu Leu 

35 40 45 

ggg aaa aca gag agt caa ggg att gga ttg tgg att gca ttg gtg gtt 
Gly Lys Thr Glu Ser Gin Gly He Gly Leu Trp He Ala Leu Val Val 
50 55 60 65 

ttc etc agt ttc etc ate tte tee aea agt tte tac ata teg aat gea 
Phe Leu Ser Phe Leu He Phe Ser Thr Ser Phe Tyr He Ser Asn Ala 

70 75 80 

gag cag ccc ttc ttc aaa gaa cct cct acg gaa get get aag gaa etc 
Glu Gin Pro Phe Phe Lys Glu Pro Pro Thr Glu Ala Ala Lys Glu Leu 

85 90 95 

agt ctg tagctctgeg tggagceatg tgtaaacact gaaetgagae ctgecaecte 
Ser Leu 
etaetaccta a 



<210> 417 
<211> 532 
<212> DNA 
<213> Homo 



sapiens 



<220> 
<221> CDS 
<222> 125. .508 



<400> 417 

agctreetgg gtaaggeeca agatggetgt ettegeetta gtactegtgt gaagttggcg 60 
gggacggttc ctgtcatctt cttgggctta tttggtgtgc tgttgaaggg gggagactag 120 
agaa atg gea ggg aac etc tta tec ggg gea ggt agg cgc ctg tgg gac 169 
Met Ala Gly Asn Leu Leu Ser Gly Ala Gly Arg Arg Leu Trp Asp 
15 10 15 
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tgg 


gtg 


cct 


ctg 


gcg 


tgc 


aga 


age 


ttc 


tet 


ctt 


ggt 


gtg 


cct 


aga 


ttg 


217 


Trp 


Val 


Pro 


Leu 


Ala 
20 


Cys 


Arg 


Ser 


Phe 


Ser 
25 




Gly 


Val 


Pro 


Arg 
30 






ate 


ggt 


ata 


agg 


etc 


act 


etc 


ecg 


cec 


cec 


aaa 


gtg 


gtt 


gat 


cgt 


tgg 


265 


He 


Gly 


He 


Arg 
35 


Leu 


Thr 


Leu 


Pro 


Pro 
40 


Pro 


Lys 


Val 


Val 


Asp 
45 


Arg 


Trp 




aac 


gag 


aaa 


agg 


gcc 


atg 


ttc 


gga 


gtg 


tat 


gae 


aac 


ate 


ggg 


ate 


ctg 


313 


Asn 


Glu 


Lys 
50 


Arg 


Ala 


Met 


Phe 


Gly 
55 


Val 


Tyr 


Asp 


Asn 


He 
60 


Gly 


He 


Leu 




gga 


aac 


ttt 


gaa 


aag 


cac 


mcc 


aaa 


gaa 


ctg 


ate 


agg 


ggg 


cec 


ata 


tgg 


361 


Gly Asn 


Phe 


Glu 


Lys 


His 


Xaa 


Lys 


Glu 


Leu 


lie Arg 


Gly 


Pro 


He 


Trp 






65 










70 










75 












ctt 


cga 


ggt 


tgg 


aaa 


ggg 


aat 


gaa 


ttg 


caa 


cgt 


tgt 






aag 


agg 


409 


Leu 


Arg 


Gly 


Trp 


Lys 


Gly 


Asn 


Glu 


Leu 


Gin 


Arg 


Cys 


He Arg 


Lys 


Arg 




80 










85 










90 










95 




aaa 


atg 


gtt 


gga 


agt 


aga 


atg 


ttc 


get 


gat 


gae 


ctg 


cac 


aac 


ctt 


aat 


457 


Lys 


Met 


Val 


Gly 


Ser 
100 


Arg 


Met 


Phe 


Ala 


Asp 
105 


Asp 


Leu 


His 


Asn 


Leu 
110 


Asn 




aaa 


cgc 


ate 


cgc 


tat 


etc 


tac 


aaa 


cac 


ttt 


aac 


cga 


cat 


ggg 


aag 


ttt 


505 


Lys 


Arg 


He 


Arg 


Tyr 


Leu 


Tyr 


Lys 


His 


Phe 


Asn Arg 


His 


Gly Lys 


Phe 










115 










120 










125 









cga tagaagagaa agetgagaac ttcg 532 
Arg 



<210> 418 
<211> 470 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 50. .205 



<400> 418 

actctttcct tggctgctca ggtcataggt gctgtgatct agagacaaa atg ttc ctt 58 

Met Phe Leu 

1 

aca ctg gea gat act tgc aaa eta agg gga atg age ttc ctt ctg aat 106 
Thr Leu Ala Asp Thr Cys Lys Leu Arg Gly Met Ser Phe Leu Leu Asn 

5 10 15 

gtt tat gaa gga gag gcc act gtg tea tet gtc tta gag eta ttg gaa 154 

Val Tyr Glu Gly Glu Ala Thr Val Ser Ser Val Leu Glu Leu Leu Glu 

20 25 30 35 

tec tgg ate att gtg gga aat gaa aga tac ttc gat gga ate age age 202 

Ser Trp He He Val Gly Asn Glu Arg Tyr Phe Asp Gly He Ser Ser 

40 45 50 

cat tgatccaatg ccaactccaa gactggaacg tcgcaatgat agttccaagg 255 
His 

cggaaatttg acgtaattct tttcgacaca gttttacagg tctggatgcc cattttaatt 315 
cttctgaaag eatgcctyct ccttctggct tcaggactec atctscagcc tcttgatcta 375 
aaaataatce ccaaaceaaa aaaaaattag atactatttc cteaaaatta ggtattttaa 435 
tcaaaaacat cttaacataa atacattatt atcac 470 



<210> 419 
<211> 559 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 175. .426 



<400> 419 
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actcggcggg cgctgttgag ggagtcgggc cgcgactgtg gtcgttttta taccttcccg 60 
cgcggacgcc ggcgctgcca acggaagggc gggtagggcg agacggagtt tcgtcatgtt 120 
ggccaggccc atttgagatc tttgaagata tcctcaacgt gaggctctgc tgcc atg 177 

Met 
1 



aag 


gtg 


aag 


att 


aag 


tgc 


tgg 


aac 


ggc 


gtg 


gcc 


act 


tgg 


etc 


tgg 


gtg 


225 


Lys 


Val 


Lys 


lie 


Lys 


Cys 


Trp 


Asn 


Gly 


Val 


Ala 


Thr 


Trp 




Trp 


Val 






























is" 








gcc 


aac 


gat 


gag 


aac 


tgt 


ggc 


ate 


tgc 


agg 


atg 


gca 


ttt 


aac 


gga 


tgc 


273 


Ala 


Asn 


Asp 


Glu 


Asn 


Cys 


Gly 


He 


Cys 


Arg 


Met 


Ala 


Phe 


Asn 


Gly 


Cys 








20 










25 










30 










tgc 


cct 


gac 


tgc 


aag 


gtg 


ccc 


ggc 


gac 


gac 


tgc 


ccg 


etg 


gtg 


tgg 


ggc 


321 


Cys 


Pro 


Asp 


Cys 


Lys 


Val 


Pro 


Gly 


Asp 


Asp 


Cys 


Pro 


Leu 


Val 


Trp 


Gly 






35 










40 










45 












cag 


tgc 


tec 


cac 


tgc 


ttc 


cac 


atg 


cat 


tgc 


ate 


etc 


aag 


tgg 


etg 


cac 


369 


Gin 


Cys 


Ser 


His 


Cys 


Phe 


His 


Met 


His 


Cys 


He 


Leu 


Lys 


Trp 


Leu 


His 




50 










55 










60 










65 




gca 


cag 


cag 


gtg 


cag 


cag 


cac 


tgc 


ccc 


atg 


tgc 


cgc 


cag 


gaa 


tgg 


aag 


417 


Ala 


Gin 


Gin 


Val 


Gin 


Gin 


His 


Cys 


Pro 


Met 


Cys 


Arg 


Gin 


Glu 


Trp 


Lys 





70 75 80 



ttc aag gag tgaggcccga cctggctctc gctggagggg catcctgaga 4 66 

Phe Lys Glu 

ctccttcctc atgctggcgc cgatggctgc tggggacage geecctgage tgeaaeaagg 526 

tggawacaag ggctggagct gcgtttgttt tgc 559 

<210> 420 
<211> 462 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 29. .268 

<400> 420 

atecagcagc ccectgctcc ggcccagc atg gcg ace ccg ace cag ace ccc 52 
Met Ala Thr Pro Thr Gin Thr Pro 



aca 


aag 


get 


cct 


gag 


gaa 


cct 


] 

gac 


L 

cca 


ttt 


tac 


5 

tat gac 


tac 


aac 


acg 


100 


Thr 


Lys 
10 


Ala 


Pro 


Glu 


Glu 


Pro 

15 


Asp 


Pro 


Phe 


Tyr 


Tyr Asp 
20 


Tyr 


Asn 


Thr 




gtg 


cag 


act 


gtg 


ggc 


atg 




etg 


gca 


ace 


ate 


ttg ttc 


etg 


etg 


ggt 


148 


Val 


Gin 


Thr 


Val 


Gly 


Met 


Thr 


Leu 


Ala 


Thr 


He 


Leu Phe 


Leu 


Leu 


Gly 




25 










30 










35 








40 




ate 


etc 


ate 


gtc 


ate 




aag 


aag 


gtg 


aag 


tgc 


agg aag 


gcg 


gae 


tee 


196 


He 


Leu 


He 


Val 


He 
45 


Ser 


Lys 


Lys 


Val 


Lys 
50 


Cys 


Arg Lys 


Ala 


Asp 
55 


Ser 




agg 


tct 


gag 


age 








aaa 


tec 


tgt 


aag 


tct gag 


ett 


eee 


tct 


244 


Arg 


Ser 


Glu 


Ser 
60 


Pro 


Thr 


Cys 


Lys 


Ser 
65 


Cys 


Lys 


Ser Glu 


Leu 
70 


Pro 


Ser 




tea 


gcc 


cct 


ggt 


ggc 


ggc 


ggc 


gtg 


taacaccttc ccgaggaaac tccgctgccg 


298 


Ser 


Ala 


Pro 
75 


Gly 


Gly 


Gly 


Gly 


Val 

80 



















accctgcetg agegegggag cetgaggace gggtggagge ggtggggaec eagcegegcg 358 
ccgggagcgc tccccggaat gagcgcccca cccaccccaa ggctggagcc gctgcaccct 418 
gctgtccctc tccaggcctt ggcaatgacg atcccccaaa gage 4 62 

<210> 421 
<211> 547 
<212> DNA 
<213> Homo sapiens 



<220> 
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<221> CDS 
<222> 99. .479 



<400> 421 

tttcgaaggg aagggagccc cctataaaac agcctacagt ggacagtctg gtcggcagag 60 
ccgcaggtca gtcgtgaaga gggagctcta ttgccacc atg agt ttc tec ggc aag 116 

Met Ser Phe Ser Gly Lys 
1 5 



tac 


caa 


ctg 


cag 


age 


cag 


gaa 


aac 


ttt 


gaa 


gee 


ttc 


atg 


aag 


gca 


ate 


164 


Tyr 


Gin 


Leu 


Gin 

10 


Ser 


Gin 


Glu 


Asn 


Phe 
15 


Glu 


Ala 


Phe 


Met 


Lys 
20 


Ala 


He 




ggt 


ctg 


ccg 


gaa 


gag 


etc 


ate 


cag 


aag 


ggg 


aag 


gat 


ate 


aag 


ggg 


gtg 


212 


Gly 


Leu 


Pro 
25 


Glu 


Glu 


Leu 


He 


Gin 
30 


Lys 


Gly 


Lys 


Asp 


He 
35 


Lys 


Gly 


Val 




teg 


gaa 


ate 


gtg 


eag 


aat 


ggg 


aag 


eae 


tte 


aag 


tte 


aee 


ate 


aee 


get 


260 


Ser 


Glu 


He 


Val 


Gin 


Asn 


Gly 


Lys 


His 


Phe 


Lys 


Phe 
50 


Thr 


He 


Thr 


Ala 




ggg 


tec 


aaa 


gtg 


ate 


eaa 


aae 


gaa 


tte 


aeg 


gtg 


ggg 


gag 


gaa 


tgt 


gag 


308 


Gly 


Ser 


Lys 


Val 


He 


Gin 




Glu 


Phe 


Thr 


Val 


Gly 


Glu 


Glu 


Cys 


Glu 




55 










60 










65 










70 




ctg 


gag 


aea 


atg 


aea 


ggg 


gag 


aaa 


gte 


aag 


aea 


gtg 


gtt 


eag 


ttg 


gaa 


356 


Leu 


Glu 


Thr 


Met 


Thr 
75 


Gly 


Glu 


Lys 


Val 


Lys 
80 


Thr 


Val 


Val 


Gin 


Leu 
85 


Glu 




ggt 


gac 


aat 


aaa 


etg 


gtg 


aea 


aet 


tte 


aaa 


aae 


ate 


aag 


tct 


gtg 


aec 


404 


Gly 


Asp 


Asn 


Lys 
90 


Leu 


Val 


Thr 


Thr 


Phe 
95 


Lys 


Asn 


He 


Lys 


Ser 

100 


Val 


Thr 




gaa 


etc 


aac 


ggc 


gac 


ata 


ate 


aec 


aat 




atg 




ttg 


ggt 


gac 


att 


452 


Glu 


Leu 


Asn 

105 


Gly 


Asp 


He 


He 


Thr 

110 


Asn 


Thr 


Met 


Thr 


Leu 
115 


Gly 


Asp 


He 




gtc 


ttc 


aag 


aga 


ate 


age 


aag 


aga 


att 


taaaeaagte tgeattteat 




499 


Val 


Phe 


Lys 




He 


Ser 


Lys 


Arg 


He 



















120 125 
attattttag tgtgtaaaat taatgtaata aagtgaaett tgttttaa 547 



<210> 422 
<211> 669 
<212> DNA 
<213> Homo 



sapiens 



<220> 
<221> CDS 
<222> 29. .259 

<400> 422 

gacgcakagc cgtggggaag ttttcgca atg gcc gee gga acg teg ccg gee 52 

Met Ala Ala Gly Thr Ser Pro Ala 
1 5 



gat 


get 






gac 


gga 


aag 


aaa gcc 


gcc 


gga 


ccc act tec ggt 


gcg 


100 


Asp 


Ala 
10 


Leu 


Cys 


Asp 


Gly 


Lys 
15 


Lys Ala 


Ala 


Gly 


Pro Thr Ser Gly 
20 


Ala 




ttt 


tae 


aat 


gaa 


ete 


egg 


gea 


ete gag 


teg 


ete 


eee gga aae tat 


agt 


148 


Phe 


Tyr 


Asn 


Glu 




Arg 


Ala 


Leu Glu 


Ser 




Pro Gly Asn Tyr 


Ser 




25 










30 








35 




40 




tct 


get 


tec 


aec 


gtt 


gte 


eta 


ett tgg 


ata 


gca 


tea aga caa aaa 


tct 


196 


Ser 


Ala 


Ser 


Thr 


Val 
45 


Val 




Leu Trp 


He 
50 


Ala 


Ser Arg Gin Lys 
55 


Ser 




eag 


tet 


teg 


agg 


aae 


tea 


eag 


tgt age 


aga 


aaa 


gae aaa gga ggt 


aaa 


244 


Gin 


Ser 


Ser 


60 


Asn 


Ser 


Gin 


Cys Ser 
65 


Arg 


Lys 


Asp Lys Gly Gly 
70 






eag 


gag 


gag 


eat 


gaa 


taatteatet tgatgtaate atgeaagaet eetaagaaga 


299 


Gin 


Glu 


Glu 
75 


His 


Glu 



















tgttaggaga ccetgtcatc attcttttgg ectetaatgt ecgttatctt catgtttgct 359 
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ggcttccttt tactactatg gaaatacaac atatgcaaaa gtagtgtttt cgctgtttaa 419 

aaacatcgtc agcagcatca tcatcattaa ctgcaccagg aagccatgac tcctttctgg 479 

acaaggtgca cttgcctgtc atccttccat ctctttcttc ctccttgaag ttcacaatac 539 

ccagtccagg agcattgcag agaatgctag gtgtgtctgc agacttgact ttaaaagaaa 5 99 

acaagttcct tcaagtgagt tcatcatgat ggggatgata tttgtttgag agaatgtgtg 659 

tctttggcaa 669 



<210> 423 
<211> 546 
<212> DNA 
<213> Homo 



sapiens 



<220> 
<221> CDS 
<222> 78. .302 



<400> 423 

ctygccagct gccggtcttt cgggggctcc gtaactttct atccgtccgc gtcascyttg 
ccaccctcat ctccaat atg cct ggt ccg acc ccc agt ggc act aac gtg 
Met Pro Gly Pro Thr Pro Ser Gly Thr Asn Val 











1 






5 








10 






gga 
Gly 


tec 
Ser 


tea ggg 
Ser Gly 
15 


cgc tct 
Arg Ser 


ccc 
Pro 


age 
Ser 


aaa 
Lys 
20 


gca 
Ala 


gtg 

Val 


gee 
Ala 


gee 
Ala 


egg geg 
Arg Ala 
25 


egg 
Arg 


158 


gat 
Asp 


cca 
Pro 


ctg 
Leu 
30 


tec 
Ser 


ggc aga 
Gly Arg 


gga 
Gly 


aaa atg 
Lys Met 
35 


cca 
Pro 


get 
Ala 


gtg 
Val 


gga 
Gly 
40 


caa gga 
Gin Gly 


gtg 
Val 


206 


cag 
Gin 


gcc 
Ala 
45 


gca 
Ala 


caa 
Gin 


cct egg 
Pro Arg 


cag 

Gin 
50 


gca 

Ala 


ccg 
Pro 


ggg 

Gly 


gga 
Gly 


tgt 

Cys 
55 


ggc 

Gly 


gat tct 

Asp Ser 


aca 

Thr 


254 


cag 
Gin 
60 


aag 
Lys 


att 
He 


cac 
His 


ctg grc 
Leu Xaa 
65 


tea 
Ser 


aag 
Lys 


ttg 
Leu 


gcc 
Ala 


ctg 
Leu 
70 


ttc 
Phe 


cag 
Gin 


tat tgg 
Tyr Trp 


tta 
Leu 
75 


302 



tgagtcttct gttcatcgct tctgtattta tgttgcacat ttggggcaag tacactcgtt 362 

cgtagattca gttacatcca tetgtcatet gaagaakgag gaaawaaece aacatttett 422 

ggacccaaaa gtatagtgac tatetgttea tgagagaaat tttetgtaag cttgetgttt 482 

ttaamcaggg gatttateaa ttaattgatt tgaggaatea gtttttttct atggctaata 542 

aact 546 



<210> 424 
<211> 499 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 16. .183 



<400> 424 

tttattttgc ettet atg tea tct tgg age tta atg tge atg caa gat atg 51 
Met Ser Ser Trp Ser Leu Met Cys Met Gin Asp Met 
15 10 

gga aat act aaa gca aga eta etc tat gaa gcc aat ctt eca gag aac 99 
Gly Asn Thr Lys Ala Arg Leu Leu Tyr Glu Ala Asn Leu Pro Glu Asn 

15 20 25 

ttt ega aga cca cag aca gat caa gca gtg gaa ttt ttc ate aga gat 147 
Phe Arg Arg Pro Gin Thr Asp Gin Ala Val Glu Phe Phe He Arg Asp 

30 35 40 

aaa tat gaa aag aag aaa tac tac gaa aaa atg cca tagctattac 193 
Lys Tyr Glu Lys Lys Lys Tyr Tyr Glu Lys Met Pro 
45 50 55 

aaatatttcc tcctctgatg ctcctcttca gcctttggta tcctctcctt ctctgcaagc 253 
tgctgttgac aaaaataaat tggagaaaga aaaggaaaaa aaaaaggaag agaaaaagag 313 
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agaaaaggag ccagaaaagc cggcaaaacc acttacagct gaaaagctgc agaagaaaga 373 
tcagcaactg gagcctaaaa aaagtaccag ccctaaaaaa gctgcggasc cactgtggat 4 33 
cttttaggac ttgtttggag aaagattgag aaggaaaaga gtgaaaggca gttagttgaa 4 93 
gtggaa 4 99 

<210> 425 
<211> 512 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 122. .487 



<400> 425 

aaacacatcc aagcttaaga cggtgaggtc agcttcacat tctcaggaac tctccttctt 60 
tgggtctggc tgaagttgag gatctcttac tctctaggcc acggaattaa cccgagcagg 120 
c atg gag gcc tct get etc acc tea tea gea gtg aee agt gtg gee aaa 169 
Met Glu Ala Ser Ala Leu Thr Ser Ser Ala Val Thr Ser Val Ala Lys 
15 10 15 



gtg 


gtc 


agg 


gtg 


gee tct 


ggc 


tct 


gcc 


gta 


gtt 


ttg 


cec 


ctg 


gee 


agg 


217 


Val 


Val 


Arg 


Val 
20 


Ala Ser 


Gly 


Ser 


Ala 
25 


Val 


Val 


Leu 


Pro 


Leu 
30 


Ala 


Arg 




att 


get 


aca 


gtt 


gtg att 


gga 


gga 


gtt 


gtg 


gcc 


atg 


gcg 


get 


gtg 


ecc 


265 


lie 


Ala 


Thr 
35 


Val 


Val He 


Gly 


Gly 
40 


Val 


Val 


Ala 


Met 


Ala 
45 


Ala 


Val 


Pro 




atg 


gtg 


etc 


agt 


gcc atg 


ggc 


ttc 


act 


gcg 


gcg 


gga 


ate 


gcc 


teg 


tec 


313 


Met 


Val 

50 


Leu 


Ser 


Ala Met 


Gly 
55 


Phe 


Thr 


Ala 


Ala 


Gly 
60 


He 


Ala 


Ser 


Ser 




tec 


ata 


gca 


gcc 


aag atg 


atg 


tec 


gcg 


gcg 


gcc 


att 


gcc 


aat 


ggg 


ggt 


361 


Ser 


He 


Ala 


Ala 


Lys Met 


Met 


Ser 


Ala 


Ala 


Ala 


He 


Ala 


Asn 


Gly 


Gly 




65 








70 










75 










80 




gga 


gtt 


gcc 


teg 


ggc age 


ctt 


gtg 


get 


act 


ctg 


cag 


tea 


ctg 


gga 


gca 


409 


Gly 


Val 


Ala 


Ser 


Gly Ser 
85 


Leu 


Val 


Ala 


Thr 
90 


Leu 


Gin 


Ser 


Leu 


Gly 
95 


Ala 




act 


gga 


etc 


tec 


gga ttg 


acc 


aag 


ttc 


ate 


ctg 


ggc 


tee 


att 


ggg 


tct 


457 


Thr 


Gly Leu 


Ser 


Gly Leu 


Thr 


Lys 


Phe 


He 


Leu 


Gly 


Ser 


He 


Gly 


Ser 










100 








105 










110 








gcc 


att 


gcg 


get 


gtc att 


gcg 


agg 


ttc 


tac 


tagctccctg cectcgectg 


507 


Ala 


He 


Ala 
115 


Ala 


Val He 


Ala 


Arg 
120 


Phe 


Tyr 

















eagag 512 



<210> 426 
<211> 502 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 57. .278 



<400> 426 

atteaeegge gaegcgatae ggttceteea eegaggceca tgegaagett teeact atg 5 9 

Met 
1 

get tee age act gtc eeg gtg age get get ggc teg get aat gaa act 107 
Ala Ser Ser Thr Val Pro Val Ser Ala Ala Gly Ser Ala Asn Glu Thr 

5 10 15 

cec gaa ata eeg gae aae gtg gga gat tgg ctt egg ggc gtc tac cge 155 
Pro Glu He Pro Asp Asn Val Gly Asp Trp Leu Arg Gly Val Tyr Arg 

20 25 30 

ttt gee act gat agg aat gae ttc egg agg aae ttg ata eta aat ttg 203 
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Phe Ala Thr Asp Arg Asn Asp Phe Arg Arg Asn Leu lie Leu Asn Leu 

35 40 45 

gga etc ttt get gcg gga gtt tgg ctg gcc agg aac ttg agt gac att 251 
Gly Leu Phe Ala Ala Gly Val Trp Leu Ala Arg Asn Leu Ser Asp lie 
50 55 60 65 

gac etc atg gea cet cag cea ggg gtg tagccaagta gttctaatgc 298 
Asp Leu Met Ala Pro Gin Pro Gly Val 
70 

cacctgtcgt cttatcatct gattgcagac aaatggaatc ctgtgctgaa cccgaatctt 358 
ccaaaaaaca gcctacaatc tgtgaccacc acaagatgtg ccctgatggc agctgaagtt 418 
tgattcagat gggcactttt cttccccttc cctgcctagt ttccttttgt tccttgagtc 478 
acgcagaatt ccattctctg gtca 502 



<210> 427 
<211> 575 
<212> DNA 
<213> Homo 



sapiens 



<220> 
<221> CDS 
<222> 50. .340 



<400> 427 

gaggagaaga ggtgaccccg ctaaagataa taaatcatca ttgatcaag atg tct tea 58 

Met Ser Ser 



gca 


cet 


gag 


cet 


cea 


aea 


ttc 


aaa 


aag 






ccc 


aaa 


gar 


aaa 


gag 


106 


Ala 


Pro 

5 


Glu 


Pro 


Pro 


Thr 


Phe 
10 




Lys 


Glu 


Pro 


Pro 
15 




Glu 




Glu 




ttt 


eaa 


age 


cea 


ggg 


etc 


aga 


ggg 


gtg 


cgc 


aca 




ace 


tta 


ttt 


egt 


154 


Phe 


Gin 


Ser 


Pro 


Gly 


Leu 


Arg 


Gly 


Val 




Thr 


Thr 


Thr 


Leu 


Phe 


Arg 




20 










25 










30 










35 




get 


gtg 


aat 


cea 


gag 


etc 


ttc 


att 


aaa 


cet 


aac 


aaa 


cet 


gta 


atg 


get 


202 


Ala 


Val 


Asn 


Pro 


Glu 


Leu 


Phe 


He 


Lys 


Pro 


Asn Lys 


Pro 


Val 


Met 


Ala 












40 










45 










50 






ttc 


gga 


ttg 


gta 


act 


ctt 


tea 


ctt 


tgc 


gtg 


gca 


tat 


att 


ggt 


tat 


eta 


250 


Phe 


Gly 


Leu 


Val 
55 


Thr 


Leu 


Ser 


Leu 


Cys 
60 


Val 


Ala 


Tyr 


He 


Gly 
65 


Tyr 


Leu 




cat 


gca 


ata 


caa 


gag 


aat 


aaa 


aag 


gac 


etc 


tat 


gaa 


get 


att 


gat 


agt 


298 


His 


Ala 


lie 
70 


Gin 


Glu 


Asn 


Lys 


Lys 

75 


Asp 


Leu 


Tyr 


Glu 


Ala 
80 


He 


Asp 


Ser 




gag 


ggg 


eae 


agt 


tat 


atg 


agg 


egg 


aaa 


aca 


tec 


aaa 


tgg 


gat 






340 


Glu 


Gly 


His 


Ser 


Tyr 


Met 


Arg 


Arg 


Lys 


Thr 


Ser 


Lys 


Trp 


Asp 









85 90 95 



tagtagtget ggttagtgca ratggacctt tattaaaggt tetgaaatet teaaatgaaa 400 

gaccttgtga gtgtaeagta tcatgtttct tgttctagaa catgetaatg aagagagaag 460 

atagcagttg caaccagaea aetgtcgtaa attttgtcct ttcacagctg cagccattat 520 

cteattcttt ttccaeagag tgagegtcat aatattttet ttccttactc ttata 575 



<210> 428 
<211> 493 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 177. .470 



<400> 428 

ategaccacg gaaatttgac acctcegggc 

gcttataaac eteagecetg aggctecage 

ccctgaeaaa ggtcagetag cccettgagg 



ttggaagcag ctetctccte ettccccget 60 

tcactctaec ccatetcett gecgggteag 120 

acateagctt tggcetcagg gtecta atg 17 9 
Met 
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gca 


gca 


gaa 


cca 


ctg 


aca 


gag 


eta 


gag 


gag 


tec 


att 


gag 


aac 


gtg 


gtc 


227 


Ala 


Ala 


Glu 


Pro 

5 


Leu 


Thr 


Glu 


Leu 


Glu 
10 


Glu 


Ser 


He 


Glu 


Asn 
15 


Val 


Val 




acc 


acc 


ttc 


ttc 


acc 


ttt 


gca 


agg 


cag 


gag 


ggc 


egg 


aag gat 


age 


etc 


275 


Thr 


Thr 


Phe 
20 


Phe 


Thr 


Phe 


Ala 


Arg 
25 


Gin 


Glu 


Gly 


Arg 


Lys 
30 


Asp 


Ser 


Leu 




age 


gtc 


aac 


gag 


ttc 


aaa 


gag 


ctg 


gtt 


acc 


cag 


cag 


ttg 


ecc 


eat 


ctg 


323 


Ser 


Val 
35 


Asn 


Glu 


Phe 


Lys 


Glu 
40 


Leu 


Val 


Thr 


Gin 


Gin 
45 


Leu 


Pro 


His 


Leu 






aag 




gtg 


ggc 








gag 


aag 


atg 


aag 


age 


ttg 




gtg 


371 


Leu 




Asp 


Val 


Gly 


Ser 




Asp 


Glu 


Lys 


Met 


Lys 


Ser 


Leu 


Asp 


Val 




50 










55 










60 










65 




aat 


cag 


gac 


teg 


gag 


etc 


aag 


ttc 


aat 


gag 


tac 


tgg 


aga 


ttg 


att 


ggg 


419 


Asn 


Gin 


Asp 


Ser 


Glu 


Leu 


Lys 


Phe 


Asn 


Glu 


Tyr 


Trp Arg 


Leu 


He 


Gly 












70 










75 










80 






gag 


ctg 


gcc 


aag 


gaa 


ate 


agg 


aag 


aag 


aaa 


gac 


etk 


aag 


ate 


agg 


aag 


467 


Glu 


Leu 


Ala 


Lys 


Glu 


He 


Arg 


Lys 


Lys 


Lys Asp 


Leu 


Lys 


He 


Arg 


Lys 





85 90 95 



aag taaagecgee tggctgagat ggg 4 93 

Lys 



<210> 429 
<211> 505 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 114. .458 



<400> 429 

agttceggct etgestgaag agctttgeat tgtgggaagt ytttytttct egtteeeegg 60 
ceatcttage ggetgetgtt ggttgggggc egteeegete etaaggeagg aag atg 116 

Met 



gtg 


gcc 


gca 


aag 


aag 


a eg 


aaa 


aag 


teg 


ctg 


gag 


teg 


ate 


aac 


tct 


agg 


164 


Val 


Ala 


Ala 


Lys 


Lys 


Thr 


Lys 


Lys 


Ser 




Glu 


Ser 


He 


Asn 


Ser 


Arg 










5 










10 










15 








etc 


caa 


etc 


gtt 


atg 


aaa 


agt 


ggg 


aag 


tac 


gtc 


ctg 


ggg 


tac 


aag 


cag 


212 


Leu 


Gin 


Leu 


Val 


Met 


Lys 


Ser 


Gly 


Lys 


Tyr 


Val 


Leu 


Gly 


Tyr 


Lys 


Gin 








20 










25 










30 










act 


ctg 


aag 


atg 


ate 


aga 


caa 


ggc 


aaa 


gcg 


aaa 


ttg 


gtc 


att 


etc 


get 


260 


Thr 


Leu 


Lys 


Met 


He 


Arg 


Gin 


Gly 


Lys 


Ala 


Lys 


Leu 


Val 


He 


Leu 


Ala 






35 










40 










45 












aac 


aac 


tgc 


cca 


get 


ttg 


agg 


aaa 


tct 


gaa 


ata 


gag 


tac 


tat 


get 


atg 


308 


Asn 


Asn 


Cys 


Pro 


Ala 


Leu 


Arg 


Lys 


Ser 


Glu 


He 


Glu 


Tyr 


Tyr 


Ala 


Met 




50 










55 










60 










65 




ttg 


get 


aaa 


act 


ggt 


gtc 


cat 


cac 


tac 


agt 


ggc 


aat 


aat 


att 


gaa 


ctg 


356 


Leu 


Ala 


Lys 


Thr 


Gly 


Val 


His 


His 


Tyr 


Ser 


Gly 


Asn 


Asn 


He 


Glu 


Leu 












70 










75 










80 






ggc 


aca 


gca 


tgc 


gga 








aga 


gtg 


tgc 




ctg 




ate 


att 


404 


Gly 


Thr 


Ala 


Cys 


Gly 




Tyr 


Tyr 


Arg 


Val 


Cys 


Thr 




Ala 


He 


He 










85 










90 










95 








gat 


cca 


ggt 


gac 


tct 


gac 


ate 


att 


aga 


age 


atg 


cca 


gaa 


cag 


act 


ggt 


452 


Asp 


Pro 


Gly 


Asp 


Ser 


Asp 


He 


He 




Ser 


Met 


Pro 


Glu 


Gin 


Thr 


Gly 





100 105 110 

gaa aag taaaeetttt eaeetaeaaa attteaeetg eaaaaaaaaa aaaaaav 505 

Glu Lys 



115 



<210> 430 
<211> 772 
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<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 324. .629 

<400> 430 

ataatcgcgt catttccggg aggggacgaa ggggtagttc tttcacctcg gctgggcgcc 60 

tagaaaagcc tagaaacagc tccttttttc ttccgcctcc gagtcttcgc gtcagcgtcc 120 

tgcgcagggc ccttggggcg aatcgcggtg cgcgtcgggg cgaccgccct ccctccctgg 180 

gaggggcgag ggggctagcg gcgacsgctg gggcgagcgc gcctgcgcgc tgggtgattt 240 

tttcacgtgt cgccagggcc ggactgcgag tctctttgcg gcgctacact agagcagagt 300 

acgagtctga ggcggaggga gta atg gca gga caa gcg ttt aga aag ttt ctt 353 
Met Ala Gly Gin Ala Phe Arg Lys Phe Leu 
15 10 



cca 


etc 


ttt 


gac 


cga 


gta 


ttg 


gtt 


gaa 


agg 


agt 


get 


get 


gaa 


act 


gta 


401 


Pro 


Leu 


Phe 


Asp 


Arg 


Val 


Leu 


Val 


Glu 


Arg 


Ser 


Ala 


Ala 


Glu 


Thr 


Val 












15 










20 










25 






acc 


aaa 


gga 


ggc 


att 


atg 


ctt 




gaa 




tct 




gga 




gta 


ttg 


449 


Thr 


Lys 


Gly 


Gly 


He 


Met 


Leu 


Pro 


Glu 


Lys 


Ser 


Gin 


Gly 


Lys 


Val 


Leu 










30 










35 










40 








caa 


gca 


aca 


gta 


gtc 


get 


gtt 


gga 


teg 


ggt 


tct 


aaa 


gga 


aag 


ggt 


gga 


497 


Gin 


Ala 


Thr 


Val 


Val 


Ala 


Val 


Gly 


Ser 


Gly 


Ser 


Lys 


Gly 


Lys 


Gly 


Gly 








45 










50 










55 










gag 


att 


caa 


cca 


gtt 


age 


gtg 


aaa 


gtt 


gga 


gat 


aaa 


gtt 


Ctt 


etc 


cca 


545 


Glu 


He 


Gin 


Pro 


Val 


Ser 


Val 


Lys 


Val 


Gly 


Asp 


Lys 


Val 


Leu 


Leu 


Pro 






60 










65 










70 












gaa 


tat 


gga 


ggc 


acc 


aaa 


gta 


gtt 


eta 


gat 


rac 


aag 


gat 


tat 


ttc 


eta 


593 


Glu 


Tyr 


Gly 


Gly 


Thr 


Lys 


Val 


Val 


Leu 


Asp 


Xaa 


Lys 


Asp 


Tyr 


Phe 







75 80 85 90 

ttt aga gat ggt gac att ctt gga aag tac gta gac tgaaataagt 639 
Phe Arg Asp Gly Asp He Leu Gly Lys Tyr Val Asp 

95 100 
cactattgaa atggcatcaa catgatgctg cecattecae tgaagttetg aaatettteg 699 
tcatgtaaat aatttecata tttctetttt ataataaact aatgataact aatgaccaaa 75 9 
aaaaaaaaaa aaa 772 

<210> 431 
<211> 672 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 224. .529 

<400> 431 

cataagaaca gggaggttag aagtagggtc ttggtgacaa aatatgttgt gtagagttca 60 
ggggagagtg cgtcatatgt tgttctagga agattgtagt ggtgagggtg tttattataa 120 
taatgtttgt gtattcggct atgaagaata ggccagggcc ggactgcgag tctctttgcg 180 

gcgctacact agagcagagt acgagtctga ggcggaggga gta atg gca gga caa 2 35 

Met Ala Gly Gin 
1 

gcg ttt aga aag ttt ctt cca etc ttt gac cga gta ttg gtt gaa agg 283 

Ala Phe Arg Lys Phe Leu Pro Leu Phe Asp Arg Val Leu Val Glu Arg 

5 10 15 20 

agt get get gaa act gta ace aaa gga ggc att atg ett cca gaa aaa 331 

Ser Ala Ala Glu Thr Val Thr Lys Gly Gly He Met Leu Pro Glu Lys 

25 30 35 

tct caa gga aaa gta ttg caa gca aca gta gte get gtt gga teg ggt 379 
Ser Gin Gly Lys Val Leu Gin Ala Thr Val Val Ala Val Gly Ser Gly 
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tct aaa gga aag ggt gga gag att caa cca gtt age gtg aaa gtt gga 427 
Ser Lys Gly Lys Gly Gly Glu He Gin Pro Val Ser Val Lys Val Gly 

55 60 65 

gat aaa gtt ctt etc eca gaa tat gga gge aee aaa gta gtt eta gat 475 
Asp Lys Val Leu Leu Pro Glu Tyr Gly Gly Thr Lys Val Val Leu Asp 

70 75 80 

gae aag gat tat tte eta ttt aga gat ggt gae att ett ggr aag tae 523 
Asp Lys Asp Tyr Phe Leu Phe Arg Asp Gly Asp He Leu Gly Lys Tyr 
85 90 95 100 

gta gae tgaaataagt cactattgaa atggcatcaa catgatgctg cccattccac 57 9 

Val Asp 

tgaagttctg aaatcttteg teatgtaaat aattteeata tttetetttt ataataaaet 639 
aatgataact aatgaccaaa aaaaaaaaaa aaa 672 



<210> 432 
<211> 517 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 97. .444 



<400> 432 



atctttccgt cccgggcagc cagcgccagt cggagccagc gcgagccgcc gccgccatca 60 
ctgeegetgc caagtcctcc acccgctgcc cccgcc atg tct get aee get gee 114 























Met Ser 
1 


Ala 


Thr 


Ala 
5 


Ala 




aeg 


gee 


eee 


eet 


get 


gee 


eeg 


get 


ggg 


gag 


ggt ggt 


eee 


eet 


gea 


eee 


162 


Thr 


Ala 


Pro 


Pro 


Ala 


Ala 


Pro 


Ala 


Gly 


Glu 


Gly Gly 


Pro 


Pro 


Ala 


Pro 










10 










15 








20 








eet 


eea 


aae 


ete 


aee 


agt 


aae 


agg 


aga 


etg 


eag eag 


aee 


eag 


gee 


eag 


210 


Pro 


Pro 


Asn 


Leu 


Thr 


Ser 


Asn 


Arg 


Arg 


Leu 


Gin Gin 


Thr 


Gin 


Ala 


Gin 








25 










30 








35 










gtg 


gat 


gag 


gtg 


gtg 


gae 


ate 


atg 


agg 


gtg 


aae gtg 


gae 


aag 


gtc 


ctg 


258 


Val 


Asp 


Glu 


Val 


Val 


Asp 


He 


Met 


Arg 


Val 


Asn Val 


Asp 


Lys 


Val 


Leu 






40 










45 








50 












gag 


cga 


gae 


cag 


aag 


ctg 


teg 


gag 


ctg 


gae 


gae cgt 


gea 


gat 


gea 


etc 


306 


Glu 


Arg 


Asp 


Gin 


Lys 


Leu 


Ser 


Glu 


Leu 


Asp 


Asp Arg 


Ala 


Asp 


Ala 


Leu 




55 










60 










65 








70 




cag 


gcg 


ggg 


gcc 


tec 


cag 


ttt 


gaa 


aea 


age 


gea gcc 


aag 


etc 


aag 


ege 


354 


Gin 


Ala 


Gly 


Ala 


Ser 


Gin 


Phe 


Glu 


Thr 


Ser 


Ala Ala 


Lys 


Leu 


Lys 


Arg 












75 










80 








85 








tac 


tgg 


tgg 


aaa 




etc 


aag 


atg 


atg 


ate ate 


ttg 


gga 


gtg 


att 


402 


Lys 


Tyr 


Trp 


Trp 


Lys 


Asn 


Leu 


Lys 


Met 


Met 


He He 


Leu 


Gly 


Val 


He 










90 










95 








100 








tgc 




ate 




etc 


ate 


ate 


ate 


ata 


gtt 


tae tte 


age 


act 






444 


Cys 


Ala 


He 


He 


Leu 


He 


He 


He 


He 


Val 


Tyr Phe 


Ser 


Thr 










105 










110 








115 











taaateeeeg aggagtetge eetgeetaga gaagggeete teeeeeaaee eteageegtt 504 
ccteeaeete tet 517 



<210> 433 
<211> 485 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 21. .311 



<400> 433 

tttgttttet aatagggeaa atg tet eta age ttg gtg ttt aga get get tea 



53 
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Met Ser Leu Ser Leu Val Phe Arg Ala Ala Ser 

15 10 



tat 


ttt 


aaa 


eta 


gtt 


cca 


ttc 


cac 


agt 


tct 


agt 


tea 


aac 


cag 


ttt 


tta 


101 


Tyr 


Phe 


Lys 


Leu 
15 


Val 


Pro 


Phe 


His 


Ser 
20 


Ser 


Ser 


Ser 


Asn 


Gin 
25 


Phe 


Leu 




cag 


cct 


cct 


ggg 


tgg 


gtc 


gtc 


ttg 


acc 


caa 


act 


ctt 


gtg 


ttg 


tta 


cat 


149 


Gin 


Pro 


Pro 
30 


Gly 


Trp 


Val 


Val 


Leu 
35 


Thr 


Gin 


Thr 


Leu 


Val 
40 


Leu 


Leu 


His 




ttt 


gag 


agg 


ttt 


tea 


tac 


cag 


aat 


gta 


cea 


raa 


agt 


gea 


caa 


ggt 


aaa 


197 


Phe 


Glu 


Arg 


Phe 


Ser 


Tyr 


Gin 


Asn 


Val 


Pro 


Xaa 


Ser 


Ala 


Gin 


Gly Lys 






45 










50 










55 












ggt 


aat 


tta 


cag 


cca 


gaa 


aea 


aat 


ata 


cat 


ttg 


ttt 


eat 


ttc 


ttg 


act 


245 


Gly 


Asn 


Leu 


Gin 


Pro 


Glu 


Thr 


Asn 


He 


His 


Leu 


Phe 


His 


Phe 




Thr 




60 










65 










70 










75 




ttc 


cct 


aag 


cag 


ata 


age 


aga 


aac 


ttg 


ttt 


aac 


tea 


tta 


ett 


tgt 


ttg 


293 


Phe 


Pro 


Lys 


Gin 


He 
80 


Ser 




Asn 


Leu 


Phe 
85 


Asn 


Ser 


Leu 


Leu 


Cys 
90 


Leu 




atg 


tgt 


ctt 


aca 


tat 


ttt 


tgactaaaag ttatagaatg ttattcctct 




341 


Met 


Cys 


Leu 


Thr 
95 


Tyr 


Phe 

























gggggaatct tttacaggtg gaggaatggg gatageagta ttgceteaga ttcaaactgg 401 
cateaeeata aaeectgtag gccagggtgg aatgaagtca getccttttt atagttgaaa 461 
taeaattttt tatteaeeat tgte 485 



<210> 434 
<211> 511 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 31. .369 



<400> 434 

acttgctctg cccgctcaaa cagttgcagg atg teg atg aca gac ttg ctg aac 54 

Met Ser Met Thr Asp Leu Leu Asn 



get 


gag 


gac 


ate 


aag 


aag 


gcg 


gtg 


1 

gga 


gcc 


ttt 


age 


5 
get 


acc 


gac 


tec 


102 


Ala 


Glu 


Asp 


He 


Lys 


Lys 


Ala 


Val 


Gly 


Ala 


Phe 


Ser 


Ala 


Thr 


Asp 


Ser 






10 










15 










20 












ttc 


gac 


cac 




aag 


ttc 


ttc 


caa 


atg 


gtc 


ggc 


etg 


aag 


aaa 


aag 


agt 


150 


Phe 


Asp 


His 


Lys 


Lys 


Phe 


Phe 


Gin 


Met 


Val 


Gly 


Leu 


Lys 


Lys 


Lys 


Ser 




25 










30 










35 










40 




gcg 


gat 




gtg 




aag 


gtg 


ttt 


cae 


atg 


ctg 


gae 


aag 


gae 


aaa 


agt 


198 


Ala 


Asp 


Asp 


Val 


Lys 


Lys 


Val 


Phe 


His 


Met 


Leu 


Asp 


Lys 


Asp 


Lys 


Ser 












45 










50 










55 






ggc 


ttc 


ate 


gag 


gag 


gat 


gag 


ctg 


gga 


ttc 


ate 


eta 


aaa 


ggc 


tte 


tee 


246 


Gly 


Phe 


He 


Glu 


Glu 


Asp 


Glu 


Leu 


Gly 


Phe 


He 


Leu 


Lys 


Gly 


Phe 


Ser 










60 










65 










70 








cea 


gat 


gee 


aga 


gae 


etg 


tet 


get 


aaa 


gaa 


aee 


aag 


atg 


etg 


atg 


get 


294 


Pro 


Asp 


Ala 


Arg 


Asp 


Leu 


Ser 


Ala 


Lys 


Glu 


Thr 


Lys 


Met 


Leu 


Met 


Ala 








75 










80 










85 










get 


gga 


gae 


aaa 


gat 


ggg 


gae 


ggc 


aaa 


att 


ggg 


gtt 


gae 


gaa 


tte 


tee 


342 


Ala 


Gly 


Asp 


Lys 


Asp 


Gly 


Asp 


Gly 


Lys 


He 


Gly 


Val 


Asp 


Glu 


Phe 


Ser 






90 










95 










100 












aet 


etg 


gtg 


get 


gaa 


ast 


aag 


aag 


eae 


tgaetgeece tggtctteca 




389 


Thr 


Leu 


Val 


Ala 


Glu 


Xaa 


Lys 


Lys 


His 



















105 110 

eetetctgce ctgaacaeec aatcteggcc cctctegcea ceetcetgca tttetgttea 44 9 

gttegtttat gttatttttt aeteeeceat eeectgtgge cetctaatga caeeattett 509 

ct 511 



<210> 435 
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<211> 511 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 82. .417 

<400> 435 

cccttctgcg cggtcacgcg agccagcgcc tgggcctgga accgggccgt agcccccagt 60 



ttcgcccacc acctccctac c 


atg 


gac 


ccc 


cgc 


aaa 


gtg 


aac 


gag 


ctt 


egg 


111 














Met 
1 


Asp 


Pro 


Arg 


Lys 
5 


Val 


Asn 


Glu 


Leu 


Arg 
10 




gcc 


ttt 


gtg 


aaa 


atg 


tgt 


aag 










gtt 








gag 




Ala 


Phe 


Val 


Lys 


Met 


Cys 


Lys 


Gin 


Asp 


Pro 


Ser 


Val 


Leu 


His 


Thr 


Glu 












15 










20 










25 






gaa 


atg 


cgc 


ttc 


ctg 


agg 


gag 


tgg 


gtg 


gag 


age 


atg 


ggt 


ggt 


aaa 


gta 


207 


Glu 


Met 


Arg 


Phe 


Leu 


Arg 


Glu 


Trp 


Val 


Glu 


Ser 


Met 


Gly 


Gly 


Lys 


Val 










30 










35 










40 








cca 


cot 


get 


act 


caa 


aaa 


get 


aaa 


tea 




gaa 


aat 




aag 


gaa 


gaa 


255 


Pro 


Pro 


Ala 


Thr 


Gin 


Lys 


Ala 


Lys 


Ser 


Glu 


Glu 


Asn 


Thr 


Lys 


Glu 


Glu 








45 










50 










55 










aaa 


cct 


gat 


agt 


aag 


aag 


gtg 


gag 


gaa 


gac 


tta 


aag 


gca 


gac 


gaa 


cca 


303 


Lys 


Pro 


Asp 


Ser 


Lys 


Lys 


Val 


Glu 


Glu 


Asp 


Leu 


Lys 


Ala 


Asp 


Glu 


Pro 






60 










65 










70 












tea 


agt 


gag 


gaa 


agt 


gat 


eta 


gaa 


att 


gat 


aaa 


gaa 


ggt 


gtg 


att 


gaa 


351 


Ser 


Ser 


Glu 


Glu 


Ser 


Asp 


Leu 


Glu 


He 


Asp 


Lys 


Glu 


Gly 


Val 


He 


Glu 




75 










80 










85 










90 




cca 


gac 


act 


gat 


get 


cct 


caa 


gaa 


atg 


gga 


gat 


gaa 


aat 


geg 


gag 


ata 


399 


Pro 


Asp 


Thr 


Asp 


Ala 


Pro 


Gin 


Glu 


Met 


Gly 


Asp 


Glu 


Asn 


Ala 


Glu 


He 





95 100 105 



aeg get caa cat ttt ttg tagceacagg ettccaegga cttcaegtea 447 
Thr Ala Gin His Phe Leu 
110 

ttattggctc aactttcctc actat 

ccea 

<210> 436 
<211> 475 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 22. .375 

<400> 436 

agtcgagcca gcgtcgccgc g atg 
Met 
1 

acg gag ctg acc aga ctt ttc 

Thr Glu Leu Thr Arg Leu Phe 
15 

tat ate aee ttg aag aag tat 
Tyr He Thr Leu Lys Lys Tyr 
30 

aaa ggt act gtg gag ggc ttt 
Lys Gly Thr Val Glu Gly Phe 
45 

aga get ace gat ggg aag aag 
Arg Ala Thr Asp Gly Lys Lys 

60 65 
gaa gtg aat aag ttt cag atg 



PLT/lBOl/00914 



ctgct tcatecgcca aetaatattt cactttacat 507 

511 



gtg 


ttg 


ttg 


gag 


age 


gag 


cag 


ttc 


Ctg 


51 


Val 


Leu 


Leu 


Glu 


Ser 


Glu 


Gin 


Phe 












5 










10 




cag 


aag 


tge 


egg 


aeg 


teg 


ggc 


age 


gte 


99 


Gin 




Cys 


Arg 


Thr 


Ser 


Gly 


Ser 


Val 








20 










25 






gac 


ggt 


cga 


acc 


aaa 


ccc 


att 


cca 


aag 


147 


Asp 


Gly 




Thr 


Lys 


Pro 


He 


Pro 


Lys 






35 










40 








gag 


ccc 


gca 


gac 


aac 


aag 


tgt 


Ctg 


tta 


195 


Glu 


Pro 


Ala 


Asp 


Asn 


Lys 


Cys 


Leu 


Leu 




50 










55 










aag 


ate 


age 


act 


gtg 


gtg 


age 


tee 


aag 


243 


Lys 


He 


Ser 


Thr 


Val 


Val 


Ser 


Ser 


Lys 





70 



get tat tea aac etc ctt aga get aac 291 
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Glu Val Asn Lys Phe Gin Met Ala Tyr Ser Asn Leu Leu Arg Ala Asn 

75 80 85 90 

atg gat ggg ttg aag aag aga gac aaa aag aac gaa act aag aag acc 339 

Met Asp Gly Leu Lys Lys Arg Asp Lys Lys Asn Glu Thr Lys Lys Thr 

95 100 105 

aaa gca gca gca gca gca gca gca aca gca gca cag taaagggcat 385 
Lys Ala Ala Ala Ala Ala Ala Ala Thr Ala Ala Gin 

110 115 
acatttcctg ctttcaccaa ttaaccactg aattgctatt ttttcctttt ggccagatag 4 45 
ctaggtttct ggttccccca cagtaggtgt 475 



<210> 437 

<211> 896 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> 351. .677 



<220> 

<221> raisc_f eature 
<222> 234 

<223> n=a, g, c or t 



<400> 437 

gcacgagacc attacataaa aacaaacagg aatgcattct tgacatttcc gaacacacat 60 

taagtgaaaa tgacttagaa gaactaaggg tagatcacta taaatgtaac atacaggcat 120 

ctgtacatgt ttctgatttc agtacagata atagtggatc tcaaccaaaa cagaagtcag 180 

atactgtgct ttttccagca aaggatctca aggaaaagga ccttcattca atanttactc 240 

atgattctgg tctgataaca ataaacagtt cacaagagca cctaactgtt caggcaaagg 300 

ctccattcca tactcctcct gaggaaccca atgaatgtga cttcaagaat atg gat 356 

Met Asp 



agt 


tta 


cct 


tct 


ggt 


aaa 


ata 


ert 


ega 


aaa 


gtg 




ata 


ata 


tta 


gga 


404 


Ser 


Leu 


Pro 
5 


Ser 


Gly 


Lys 


He 


Xaa 
10 


Arg 


Lys 


Val 


Lys 


He 
15 


He 


Leu 


Gly 




cga 


aat 


aga 


aaa 


gaa 


aat 


ctg 


gaa 




aat 


get 


gaa 


ttt 


gat 


aaa 


aga 


452 


Arg 


Asn 
20 


Arg 


Lys 


Glu 


Asn 


Leu 
25 


Glu 


Pro 


Asn 


Ala 


Glu 
30 


Phe 


Asp 


Lys 


Arg 




act 


gaa 


ttt 


wtt 


aca 


era 


gaa 


gaa 


aac 


aga 


att 


tgt 


agt 


tea 


eeg 


gta 


500 


Thr 


Glu 


Phe 


Xaa 


Thr 


Xaa 


Glu 


Glu 


Asn 


Arg 


He 


Cys 


Ser 


Ser 


Pro 


Val 




35 










40 










45 










50 




cag 


tct 


tta 


eta 


gac 


ttg 


ttt 


eag 


act 


agt 


gaa 


gag 


aaa 


tea 


gaa 


ttt 


548 


Gin 


Ser 


Leu 




Asp 
55 


Leu 


Phe 


Gin 


Thr 


Ser 
60 


Glu 


Glu 


Lys 


Ser 


Glu 
65 


Phe 




ttg 


ggt 


ttc 


aca 


age 


tac 


aca 


gaa 


aag 


agt 


ggt 


ata 


tgc 


aat 


gtt 


tta 


596 


Leu 


Gly 


Phe 


Thr 
70 


Ser 


Tyr 


Thr 


Glu 


Lys 
75 


Ser 


Gly 


He 


Cys 


Asn 
80 


Val 


Leu 




gat 


att 


tgg 


gaa 


gag 


gaa 


aat 


tea 


gat 


aat 


etg 


tta 


aea 


geg 


ttw 


tte 


64 4 


Asp 


He 


Trp 


Glu 


Glu 


Glu 


Asn 


Ser 


Asp 


Asn 


Leu 


Leu 


Thr 


Ala 


Xaa 


Phe 





85 90 95 

teg tee eet tea aet tet aea ttt aet gge ttt tagaatttaa aaaatgeata 697 
Ser Ser Pro Ser Thr Ser Thr Phe Thr Gly Phe 

100 105 
etttteagaa gtgataagga teatattett gaaattttta taaatatgta tggaaattct 757 
taggattttt ttaeeagett tgtttaeaga eeeaaatgta aatattaaaa ataaatattt 817 
gcaattttct aeagaattga ataeetgtta aagaaaaatt aeagaataaa ettgtgaetg 877 
gtaaaaaaaa aaaaaaatg 896 



<210> 438 
<211> 576 
<212> DNA 
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<213> Homo sapiens 

<220> 
<221> CDS 
<222> 162. .470 

<400> 438 

cattttaatt catatytgag tgggcggtgg cgattggtgt tggcggtctg gctcagctgg 60 
gcagggggta actttactga tttgggggtg gtttttagtt taatttttct tttctagctt 120 
cccatcgacg gtcagtgcgc acgttgtaat cagctgaggc c atg tea gga gac gga 17 6 

Met Ser Gly Asp Gly 



gcc 


acg 


gag 


cag 


gca 


get 


gag 


tat 


gtc 




gag 


1 

aag 


gtg 


aag 


aaa 


5 
gcg 


224 


Ala 


Thr 


Glu 


Gin 


Ala 
10 


Ala 


Glu 


Tyr 


Val 


Pro 
15 


Glu 


Lys 


Val 


Lys 


Lys 
20 


Ala 




gaa 


aag 


aaa 


tta 


gaa 


gag 


aat 


cea 


tat 


gac 


ctt 


gat 


get 


tgg 


age 


att 


272 


Glu 


Lys 


Lys 


Leu 

25 


Glu 


Glu 


Asn 


Pro 


Tyr 

30 


Asp 


Leu 


Asp 


Ala 


Trp 

35 


Ser 


He 




etc 


att 




gag 






aat 




cct 


ata 


gac 




gca 


egg 


aag 


act 


320 


Leu 


He 


4o'^ 


Glu 


Ala 


Gin 


Asn 


Gin 
45 


Pro 


He 


Asp 


Lys 


Ala 
50 


Arg 


Lys 


Thr 




tat 


gaa 


cgc 


ctt 


gtt 


gee 


cag 


ttc 


ccc 


agt 


tct 


ggc 


aga 


tte 


tgg 


aaa 


368 


Tyr 


Glu 
55 




Leu 


Val 


Ala 


Gin 
60 


Phe 


Pro 


Ser 


Ser 


Gly 
65 


Arg 


Phe 


Trp 


Lys 




ctg 


tac 


att 


gaa 


gca 


gag 


gtt 


act 


att 


tta 


ttt 


tat 


ttt 




tta 


tat 


416 


Leu 


Tyr 


He 


Glu 


Ala 


Glu 


Val 


Thr 


He 


Leu 


Phe 


Tyr 


Phe 


Phe 


Leu 


Tyr 




70 










75 










80 










85 




cag 


tat 


tgc 


age 


att 


cae 


tgt 








aaa 


caa 


gtt 


agg 


aac 


ata 


464 


Gin 


Tyr 


Cys 


Ser 


He 

90 


His 


Cys 


Ser 


Asp 


Arg 

95 


Lys 


Gin 


Val 




Asn 
100 


He 





gcc aat taggacaagg aggatttaaa tgtgtcttac etttattttg taaaataggt 520 
Ala Asn 

ataaaggagt aattaaaatg aatttttgaa atttgggtet tttacaagct gatgat 57 6 



<210> 439 
<211> 484 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 32. .349 

<400> 439 



actaactagc aaacggggac tagaaatagg g atg ctg aaa age aac ggg gag 52 
Met Leu Lys Ser Asn Gly Glu 
1 5 





cgc 


agt 


cgt 




gea 


ctt 


ceg 


gcg gtc 




gcg 


agg 


aag 




get 


100 


Arg 


Arg 


Ser 


Arg 


Asn 


Ala 


Leu 


Pro 


Ala 


Val 


Tyr 


Ala 




Lys 


Met 


Ala 








10 










15 










20 










gca 


tee 


eag 


eag 


caa 


get 


tea 


gcg 


get 


tee 


tea 


get 


get 


ggt 


gta 


teg 


148 


Ala 


Ser 


Gin 


Gin 


Gin 


Ala 


Ser 


Ala 


Ala 


Ser 


Ser 


Ala 


Ala 


Gly 


Val 


Ser 






25 










30 










35 












ggt 


cct 


agt 


teg 


get 


ggc 


ggc 


ccg 


ggt 


ccc 


cag 


cag 


cag 


ccg 


caa 


ccg 


196 


Gly 


Pro 


Ser 


Ser 


Ala 


Gly 


Gly 


Pro 


Gly 


Pro 


Gin 


Gin 


Gin 


Pro 


Gin 


Pro 




40 










45 










50 










55 




cea 


gca 




etg 


gtg 


ggc 


eet 


gee 


eag 


age 


ggc 


ete 


etg 


eag 


caa 


eag 


244 


Pro 


Ala 


Gin 


Leu 


Val 


Gly 


Pro 


Ala 


Gin 


Ser 


Gly 


Leu 


Leu 


Gin 


Gin 


Gin 












60 










65 










70 






caa 


eag 


gae 


tte 


gat 


eet 


gtg 


eag 


egt 


tat 


aag 


atg 


etc 


ate 


eeg 


cag 


292 


Gin 


Gin 


Asp 


Phe 
75 


Asp 


Pro 


Val 


Gin 


Arg 
80 


Tyr 


Lys 


Met 


Leu 


He 
85 


Pro 


Gin 




ctg 


aag 


gag 


agt 


eta 


cag 


gtg 


att 


ggc 


ctt 


aag 


cag 


cga 


gaa 


gea 




340 



[fije>'/^:/yv'GG208389e.s^] 



WO 02/083898 PCT/lBOl/00914 

253 

Leu Lys Glu Ser Leu Gin Val lie Gly Leu Lys Gin Arg Glu Ala Asn 

90 95 100 

tgg att tgg tagtctagag gggcaggccg rgggccaggt tagtcggaga 389 
Trp lie Trp 
105 

gagcgctaag cagaaagagg aatagaacct gctgttttgg cctggctggt gcacgcctgt 449 
gttccagcac tttgggaggc cgaggcgkgc ggatc 484 

<210> 440 
<211> 523 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 114. .371 

<400> 440 

ctctctcagc ttccggctgg tagtagttcc gcttcctgtc cgactgtggt gtctttgctg 60 
agggtcacat tgagctgcag gttgaatccg gggtgccttt aggattcagc acc atg 116 

Met 



gcg 


gaa 


gac 


atg 


gag 


acc 


aaa 


ate 


aag 


aac 


tac 


aag 


acc 


gcc 


cet 


ttt 


164 


Ala 


Glu 


Asp 


Met 


Glu 


Thr 




He 


Lys 


Asn 


Tyr 


Lys 


Thr 


Ala 


Pro 


Phe 










5 










10 










15 








gac 


age 


cgc 


ttc 


ccc 


aac 


cag 


aac 


cag 


act 


aga 


aac 


tgc 


tgg 


cag 


aac 


212 


Asp 


Ser 


Arg 


Phe 


Pro 


Asn 


Gin 


Asn 


Gin 


Thr 


Arg Asn 


Cys 


Trp 


Gin 


Asn 








20 










25 










30 










tac 


ctg 


gac 


ttc 


cac 


cgc 


tgt 


cag 


aag 


gca 


atg 


acc 


get 


aaa 


gga 


gge 


260 


Tyr 


Leu 


Asp 


Phe 


His 


Arg 


Cys 


Gin 


Lys 


Ala 


Met 


Thr 


Ala 


Lys 


Gly 


Gly 






35 










40 










45 












gat 


ate 


tct 


gtg 


tgc 


gaa 


tgg 


tac 


cag 


cgt 


gtg 


tac 


cag 


tec 


etc 


tgc 


308 


Asp 


He 


Ser 


Val 


Cys 


Glu 


Trp 


Tyr 


Gin 


Arg 


Val 


Tyr 


Gin 


Ser 


Leu 


Cys 




50 










55 










60 










65 




ccc 




tec 


tgg 


gtc 


aca 


gac 


tgg 


gat 


gag 


caa 


egg 


get 


gaa 


gge 


aeg 


356 


Pro 


Thr 


Ser 


Trp 


Val 


Thr 


Asp 


Trp 


Asp 


Glu 


Gin 


Arg 


Ala 


Glu 


Gly 


Thr 












70 










75 










80 







ttt ccc ggg aag ate tgaactggct geatctcect ttcctctgtc ctccatcctt 411 
Phe Pro Gly Lys He 
85 

cteccaggat ggtgaagggg gaectggtae ceagtgatec ceaccecagg atectaaate 471 
atgaettaee tgetaataaa aactcattgg aaaagyraaa aaaaaaaaaa aa 523 



<210> 441 
<211> 499 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 95. .409 

<400> 441 

ttegaagtgg ttgtstetge aetatttact gaaaagttea tcctttceee agtgatttgt 60 
aacaetgect ctttcataaa ttaaattttt gtgt atg tgt gtg ggt ctt ttg atg 115 
Met Cys Val Gly Leu Leu Met 
1 5 

gtt tct att ctg act gac ate aat ttg tct aat ctt gta gca gta cag 163 

Val Ser He Leu Thr Asp He Asn Leu Ser Asn Leu Val Ala Val Gin 

10 15 20 

tae agt eet gat tat tgc aac ttt agg aaa agg tct gat aaa aae eaa 211 
Tyr Ser Pro Asp Tyr Cys Asn Phe Arg Lys Arg Ser Asp Lys Asn Gin 
25 30 35 
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gat gcc tec aca ttt tgt cat aat tgt aac caa ttt cac etc tgt etc 259 

Asp Ala Ser Thr Phe Cys His Asn Cys Asn Gin Phe His Leu Cys Leu 

40 45 50 55 

cag tat cac cac aaa att gtt ett cct tgg agt gtc ttg get att ett 307 

Gin Tyr His His Lys lie Val Leu Pro Trp Ser Val Leu Ala lie Leu 

60 65 70 

age caa ctg ttc etc eat att act tet aga att aga eca aca att tat 355 
Ser Gin Leu Phe Leu His lie Thr Ser Arg lie Arg Pro Thr lie Tyr 

75 80 85 

aaa tea aac aaa ccc aag ago att gaa att ttg att gga ttt gta ttg 403 
Lys Ser Asn Lys Pro Lys Ser lie Glu lie Leu lie Gly Phe Val Leu 

90 95 100 

aat tta tagattaatc tggtaaacca tgtcatcttt acaatgttgt cttccaatac 459 
Asn Leu 
105 

atgaatatgg tacagctctt catttactta ggcctttgaa 4 99 



<210> 442 
<211> 460 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 75. .377 



<400> 442 

ccctcgcgga gcttactgag cgcggccccg agcccagctc cgccgccgag cgcctgtgcc 



ggcacggmwa cacc 


atg 


gag 


cgc 


ccg 


gat 


aag 


gcg 


gcg 


ctg 


aac 


gca 


ctg 


110 










Met 


Glu 


Arg 


Pro 


Asp 


Lys 


Ala 


Ala 


Leu 


Asn 


Ala 


Leu 












1 








5 










10 








cag 


cct 


cct 


gag 


ttc 


aga 


aat 


gaa 


age 


tea 


tta 


gca 


tct 


aca 


ctg 


aag 


158 


Gin 


Pro 


Pro 

15 


Glu 


Phe 


Arg 


Asn 


Glu 

20 


Ser 


Ser 


Leu 


Ala 


Ser 
25 


Thr 


Leu 


Lys 




acg 


etc 


ctg 


ttc 


ttc 


aca 


get 


tta 


atg 


ate 


act 


gtt 


cct 


att 


ggg 


tta 


206 


Thr 


Leu 
30 


Leu 


Phe 


Phe 


Thr 


Ala 
35 


Leu 


Met 


He 


Thr 


Val 
40 


Pro 


He 


Gly 


Leu 




tat 


ttc 




act 




tct 


tac 


ata 


ttt 


gaa 


ggc 


gcc 


ett 


ggg 


atg 


tec 


254 


Tyr 


Phe 


Thr 


Thr 


Lys 


Ser 


Tyr 


He 


Phe 


Glu 


Gly 


Ala 


Leu 


Gly 


Met 


Ser 




45 










50 










55 










60 




aat 


agg 


gac 


age 


tat 


ttt 


tac 


get 


get 


att 


gtt 


gea 


gtg 


gtc 


gee 


gtc 


302 


Asn 


Arg 


Asp 


Ser 


Tyr 
65 


Phe 


Tyr 


Ala 


Ala 


He 

70 


Val 


Ala 


Val 


Val 


Ala 

75 


Val 




cat 


gtg 


gtg 


ctg 


gcc 


etc 


ttt 


gtg 


tat 


ktg 




tgg 


aat 


gaa 


ggc 


tea 


350 


His 


Val 


Val 


Leu 
80 


Ala 


Leu 


Phe 


Val 


Tyr 
85 


Xaa 


Ala 


Trp 


Asn 


Glu 
90 


Gly 


Ser 




cga 




tgk 


egt 


gaa 


ggc 


aaa 


cag 


gat 


taaagtgaac ateacctttt 




397 


Arg 


Gin 


Xaa 


Arg 


Glu 


Gly 


Lys 


Gin 


Asp 



















95 100 
tatagcatta aattcatttt ttaaaatgat aaatgctgga gggggccatc tgatttgaat 457 
aaa 460 



<210> 443 
<211> 470 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 30. .305 



<400> 443 

agaatgtcgt actctcagga ggctccaee atg ttc agg gat ttc gga cgc cga 



53 
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Met 
1 


Phe 


Arg 


Asp Phe Gly Arg 
5 


Arg 




ctg 


cag 


agg 


gat 


ttg 


aag 


aga 


gtg 


gtg 


gat 


get 


agg etg agg ete 


age 


101 


Leu 


Gin 


Arg 


Asp 


Leu 


Lys 


Arg 


Val 


Val 


Asp 


Ala 


Arg Leu Arg Leu 


Ser 






10 










15 










20 






gag 


gag 


etc 


age 


ggc 


ggg 


agg 


ate 


aag 


ccg 


aag 


cct gtg gag gtc 


cag 


149 


Glu 


Glu 


Leu 


Ser 


Gly 


Gly 


Arg 


He 


Lys 


Pro 


Lys 


Pro Val Glu Val 


Gin 




25 










30 










35 




40 




gtg 


gtc 


acg 


cat 


cac 


atg 


cag 


cgc 


tac 


gcc 


gtg 


tgg ttc gga ggc 


tec 


197 


















Tyr 






Trp Phe Gly Gly Ser 












45 










50 




55 






atg 


ctg 


gcc 


teg 


act 


ccc 


gag 


ttc 


ttt 


cag 


gtc 


tgc cac acc aag 


aag 


245 


Met 




Ala 


Ser 


Thr 


Pro 


Glu 


Phe 


Phe 


Gin 


Val 


Cys His Thr Lys 


Lys 










60 










65 






70 






gac 


tat 


gaa 


gag 


tac 


ggg 


ccc 


age 


ate 


tgc 


cgc 


cac aac ccc gtc 


ttt 


293 


Asp 


Tyr 


Glu 


Glu 


Tyr 


Gly 


Pro 


Ser 


He 


Cys 


Arg 


His Asn Pro Val 


Phe 





75 80 85 

gga gtc atg tec tagtgtctgc ctgaacgcgt cgttcgatgg tgtcacgttg 345 
Gly Val Met Ser 
90 

gggaaeaagt gtecttcaga acccagagaa ggccgccgtt ctgtaaatag cgacgtcggt 405 
gttgetgeee ageagcgtgc ttgcattgcc ggtgcatgag gcgcggcgcg ggccttcagt 4 65 
aaaag 470 

<210> 444 
<211> 521 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 162. .455 

<400> 444 

agaatacgta caccaatgcg gacaagctcc tagaagcagc agagcagttg gctcagacgg 60 
gggaatgtga ccccgaggag atctacaagg cagctcgaca cctggaggtg cgcatccaag 120 
acttcgtgcg cagggtggag cagcggaagc ttctcctgga c atg tct gtt tec ttc 17 6 

Met Ser Val Ser Phe 



cac 




cac 


ace 


aaa 


gag 


ttg 


tgg 


aca 


tgg 


atg 


1 

gaa 


gac 


ctt 


cag 


5 

aag 


224 


His 


Thr 


His 


Thr 


Lys 
10 


Glu 


Leu 


Trp 


Thr 


Trp 

15 


Met 


Glu 


Asp 


Leu 


Gin 
20 


Lys 




gag 


atg 


ttg 


gag 


gat 


gtc 


tgt 


gea 


gat 


tct 


gtg 


gat 


gea 


gtc 


cag 


gaa 


272 


Glu 


Met 


Leu 


Glu 
25 


Asp 


Val 


Cys 


Ala 


Asp 
30 


Ser 


Val 


Asp 


Ala 


Val 
35 


Gin 


Glu 




ctg 


ate 


aag 


cag 


ttc 


cag 


cag 


cag 


cag 


ace 


gee 


act 


eta 


kat 


gee 


aca 


320 


Leu 


He 


Lys 
40 


Gin 


Phe 


Gin 


Gin 


Gin 
45 


Gin 


Thr 


Ala 


Thr 


Leu 
50 


Xaa 


Ala 


Thr 




etc 


aat 


gtc 


ate 


aag 


gaa 


ggc 


gaa 


gac 


ctt 


ate 


cag 


cag 


etc 


agg 


tea 


368 


Leu 


Asn 
55 


Val 


He 


Lys 


Glu 


Gly 
60 


Glu 


Asp 


Leu 


He 


Gin 
65 


Gin 


Leu 


Arg 


Ser 




geg 


eet 


eec 


tee 




ggg 


gag 




age 


gag 




agg 


tea 




tgg 


gea 


416 


Ala 


Pro 


Pro 


Ser 




Gly 


Glu 


Pro 


Ser 


Glu 


Ala 


Arg 


Ser 


Ala 


Trp 


Ala 




70 










75 










80 










85 






ctt 


tec 


agt 


ggg 


aaa 


tgc 


etc 


ggg 


eta 


gac 


gtg 


aga 


taagtcctgt 


465 


Xaa 


Leu 


Ser 


Ser 


Gly 


Lys 


Cys 


Leu 


Gly 


Leu 


Asp 


Val 


Arg 











90 95 



eecagtcttg aeteageaaa atettgettt gtggtcttgt gtgagectte ggttte 521 

<210> 445 
<211> 625 
<212> DNA 
<213> Homo sapiens 
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<220> 
<221> CDS 
<222> 281 . .547 

<400> 445 

atgctcgcca gaatacccct cgttaaaggc aaggcggctt ctggctcttc cgcagctcag 60 
ttatgggttc ctgtgtgctt agtcatcggg tccgacacag gctggactga tctggggagc 120 
cgcgaagggc ctgccttcac aaagggacgt aacgcaagta ctgcgggcag tgtttgaata 180 
tggccctgaa caatgtgtcc ctgtcctccg gtgatcagag gagcagggtg gcctaccgct 240 
cttcccatgg cgacctcaga ccgcgggcgt casgttggcg atg gtc tec gga gac 2 95 

Met Val Ser Gly Asp 
1 5 



ggc 


ttc 


etc 


gtt 


tec 


agg 


ect 


gaa 


gcg 


att 


cat 


eta 


gga 


ect 


egg cag 


343 


Gly 


Phe 


Leu 


Val 


Ser 


Arg 


Pro 


Glu 


Ala 


He 


His 


Leu 


Gly 


Pro 


Arg Gin 












10 










15 










20 




gcg 


gtg 


cga 


cca 


age 


gtt 


egg 


gee 


gag 


age 


cgt 


cga 


gtg 


gat 


ggt ggc 


391 


Ala 


Val 


Arg 


Pro 


Ser 


Val 


Arg 


Ala 


Glu 


Ser 






Val 


Asp 


Gly Gly 










25 










30 










35 






ggc 


egg 


age 


eca 


agg 


gaa 


cca 


gat 


ggc 


egg 


ggc 


egg 


age 


cgc 


caa gcs 


439 


Gly 


Arg 


Ser 


Pro 


Arg 


Glu 


Pro 


Asp 


Gly 


Arg 


Gly 


Arg 


Ser 


Arg 


Gin Ala 








40 










45 










50 








aga 


ttc 


tea 


ect 


tac 


eca 


ate 


ect 


gee 


gtt 


gaa 


eee 


gat 


etc 


eta aga 


487 


Arg 


Phe 


Ser 


Pro 


Tyr 


Pro 


lie 


Pro 


Ala 


Val 


Glu 


Pro 


Asp 


Leu 


Leu Arg 






55 










60 










65 










agt 


gtg 


ctg 


caa 


cag 


cgt 


ttg 


att 


gca 


tta 


gga 


ggt 


gtt 


ate 


gca get 


535 


Ser 


Val 


Leu 


Gin 


Gin 


Arg 


Leu 


He 


Ala 


Leu 


Gly 


Gly 


Val 


He 


Ala Ala 




70 










75 










80 








85 





cga att tea gtt taaacgaaca cctttcctct ggceetcact tagcttgtga 587 
Arg He Ser Val 

acaggccttt ttaaaatcct ttcttggtgt agcagcaa 625 

<210> 446 
<211> 534 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 8. .523 

<400> 446 



agtcaag 


atg 


gcg 


gga 


gca 


get 


acc 


cag 


get 


tec 


ctg 


gag teg gcc cca 


49 






Met 


Ala 


Gly 


Ala 


Ala 


Thr 


Gin 


Ala 


Ser 


Leu 


Glu Ser Ala Pro 








1 








5 










10 






egg 


ate 


atg 


egg 


ctg 


gtg 


gcc 


gaa 


tgc 


age 


cgc 


tec 


agg gcc egg gca 


97 


Arg 


He 


Met 


Arg 


Leu 


Val 


Ala 


Glu 


Cys 


Ser 


Arg 


Ser Arg Ala Arg Ala 




15 










20 










25 




30 




ggc 


gag 


ctg 


tgg 


ctg 


ccg 


cat 


ggg 


aca 


gtg 


gcc 


act 


ect gtg ttc atg 


145 


Gly 


Glu 


Leu 


Trp 


Leu 


Pro 


His 


Gly 


Thr 


Val 


Ala 


Thr 


Pro Val Phe Met 










35 










40 






45 




cca 


gtg 


ggc 


acg 


cag 


gcc 




atg 




ggc 


ate 


acg 


acc gaa cag ctg 


193 


Pro 


Val 


Gly 


Thr 
50 


Gin 


Ala 


Thr 


Met 


Lys 
55 


Gly 


He 


Thr 


Thr Glu Gin Leu 
60 




gac 




ctg 


ggt 


tgc 


cgc 


ate 


tgc 


ctg 


ggc 


aat 


acc 


tac cat ctg ggt 


241 


Asp 


Ala 


Leu 
65 


Gly 


Cys 


Arg 


He 


Cys 
70 


Leu 


Gly 


Asn 


Thr 


Tyr His Leu Gly 
75 




eta 


agg 


ccg 


gga 


cce 


gag 


etg 


ate 


cag 




gcc 




ggt etc cac ggc 


289 


Leu 


Arg 
80 


Pro 


Gly 


Pro 


Glu 


85 


He 


Gin 


Lys 


Ala 


Asn 
90 


Gly Leu His Gly 




ttc 


atg 


aat 


tgg 


ect 


cat 


aat 


ctg 


eta 


acg 


gac 


age 


ggc ggt ttc cag 


337 


Phe 


Met 


Asn 


Trp 


Pro 


His 








Thr 


Asp 


Ser 


Gly Gly Phe Gin 




95 










100 










105 




110 
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atg gtg teg ctg gtg tct ctg tec gag gtg aeg gag gag ggc gte ege 385 
Met Val Ser Leu Val Ser Leu Ser Glu Val Thr Glu Glu Gly Val Arg 

115 120 125 

tte ege tee eee tae gae gge aat gag aec etg ctg age ecg gag aaa 433 
Phe Arg Ser Pro Tyr Asp Gly Asn Glu Thr Leu Leu Ser Pro Glu Lys 

130 135 140 

tee gtg eag ate eag aat gcg ctg ggc teg gac ate ate atg eag etg 481 
Ser Val Gin lie Gin Asn Ala Leu Gly Ser Asp He He Met Gin Leu 

145 150 155 

gac gac gtg gtt age agt act gtg act ggg cca cgt gtg gag 523 
Asp Asp Val Val Ser Ser Thr Val Thr Gly Pro Arg Val Glu 

160 165 170 

taggccatgt a 534 

<210> 447 
<211> 111 
<212> DNA 
<213> Homo sapiens 

<400> 447 

agagctgagc tgaagcggga cccggagccc gagcagccgc cgccatggca atcaaatttc 60 
tggaagtcat caagcccttc tgtgtcatcc tgcaactaac atctgtgaag c 111 

<210> 448 
<211> 150 
<212> DNA 
<213> Homo sapiens 

<400> 448 

ttttctcaac aattcctcac cgcagagcct ctgaagctcc caccaggcca gctctcctcc 60 

aceeacegct aactctcagc cccagtcacc ctcttggagc ttccctgctt tgaattaaag 120 

accactcatg cgcaaaaaaa aaaaaaaaaa 150 

<210> 449 
<211> 465 
<212> DNA 
<213> Homo sapiens 

<400> 449 

aaccaaagca tgggctcttt gtttcctctt tgccaggcct gggtccttcc cagtgacccc 60 

tcagccttca cttcctcctc actcccgggg agctgtgaca ggaagaggag atcccagaac 120 

tggagattga cgtggatgag ctcctggaca tggagagtga cgatgcccgg gctgccaggg 180 

tcaaggaget gctggttgac tgttacaaac ccacagaggc cttcatttct ggcctgetgg 240 

acaagatceg gggeatgeag aagctgagca caccecagaa gaagtgaggg teceegacee 300 

aggagaacgg tggctcccac aggacaawtc gctgcccccc aacctcgtag caacakgcaa 360 

taccggggga ccctgcggcc aggctggtgc catgagcagg gctcctcgtg cccctggccc 420 

aggggtctct tcccctgccc cctcagtttt ccacttttgg ggtyt 465 

<210> 450 
<211> 394 
<212> DNA 
<213> Homo sapiens 

<400> 450 

agtctgaaga tggeggcetc agcagcgega ggtgetgcgg cgctgegtag aagtatcaat 60 

cagccggttg cttttgtgag aagaattect tggactgcgg cgtcgagtca gctgaaagaa 120 

cactttgcac agttcggcca tgtcagaagg tgcattttac cttttgacaa ggagactggc 180 

tttcacagag gtttgggttg ggttcagttt tcttcagaag aaggacttcg gaatgcacta 240 

caacaggaaa atcatattat agatggagta aaggtccagg ttcacactag aaggccaaaa 300 

cttccgcaaa catctgatga tgaaaagaaa gatttttgag actgcagcct attaataaag 360 

ttaacataac tgacaaaaaa aaaaaaaaaa arse 394 



<210> 451 
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<211> 391 
<212> DNA 
<213> Homo sapiens 

<400> 451 

aagtctgaag atggcggcct cagcagcgcg aggtgctgcg gcgctgcgta gaagtatcaa 60 

tcagccggtt gcttttgtga gaagaattcc ttggactccg gcgtcgagtc agctgaaaga 120 

acactttgca cagttcggcc atgtcagaag gtgcatttta ccttttgaca aggagactgg 180 

ctttcacaga ggtttgggtt gggttcagtt ttcttcagaa gaaggacttc ggaatgcact 240 

acaacaggaa aatcatatta tagatggagt aaaggtccag gttcacacta gaaggccaaa 300 

acttccgcaa acatctgatg atgaaaagaa agatttttga gactgcagcc tattaataaa 360 

gttaacataa ccaaaaaaaa aaaaaaaaag s 391 

<210> 452 
<211> 427 
<212> DNA 
<213> Homo sapiens 

<400> 452 

cccartcaga gggctgaaca tgtccgagtt tgtctgtaac ctgtcagcaa ggccttatgt 50 

gtacgacctc attgccgtgt ccaatcatta tggagccatg ggggttggcc actactaaag 120 

cagcttatgt gctattttac caacgtcgag atgatgaatt ttataagaca ccttcactta 180 

gcagttctgg ttcctctgat ggagggacac gaccaagcag ctctcagcag ggctttgggg 240 

atgatgaggc ttgcagcatg gacaccaact aatgctgact ccacgatcct gccaccctgt 300 

agcgccagtg taatccccca ggagaacatc tttgacactc tgcagactgc tagtgttctg 360 

tctaaaaacc agacaaggaa atacccttct tttatgagca gaaggaaaca aaaaaaaaaa 420 

aaaaagg 427 

<210> 453 
<211> 438 
<212> DNA 
<213> Homo sapiens 

<400> 453 

agatgttagg ctttgccaaa aactggatct ttagtatcac ttgagcactc attggacaca 50 

agaggtcaac tgttttgtca tttgtgtagt aagaacaaaa taaagtgtgt ggcattaaag 120 

aaataacact gtctggccgg gatcggtggc tcacgcctgt aatcccagca ctttgggagg 180 

ccgaggcggg cagatcacga ggtcaggagt tcgagaccag cctgaccaac atgatgaaac 240 

cctgtctcta ctaaaaatac aaaaattagc caggcatggt ggtgcacacc tgtagtccca 300 

gctgctcagg aggctgaggc aggagaatcg ctcgaacccg ggaggtggag gttgcagtga 360 

gctgagatgg cgccactgca ttccaggcct ggggggtgac agagcaagac tcagtctcag 4 20 

aaaaaaaaaa aaaaaaag 438 

<210> 454 
<211> 394 
<212> DNA 
<213> Homo sapiens 

<400> 454 

agtctgaaga tggcggcctc agcagcgcga ggtgctgcgg cgctgcgtag aagtatcaat 50 

cagccggttg cttttgtgag aagaattcct tggactgcgg cgtcgagtca gctgaaagaa 120 

cactttgcac agttcggcca tgtcagaagg tgcattttac cttttgacaa ggagactggc 180 

tttcacagag gtttgggttg ggttcagttt tcttcagaag aaggacttcg gaatgcacta 240 

caacaggaaa atcatattat agatggagta aaggtccagg ttcacactag aaggccaaaa 300 

cttccgcaaa catctgatga tgaaaagaaa gatttttgag actgcagcct attaataaag 350 

ttaacataac tgamaaaaaa aaaaaaaaaa aggc 394 

<210> 455 
<211> 713 
<212> DNA 
<213> Homo sapiens 



<400> 455 
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atataggcgt ggctgaatac gggcctcctg cgtacgtgcg tggtgtacgt acgtgcgtga 60 

ttacgcgcam. acgtacgttc ctcatgaaag ggacgacggg agctgcatga aagccgaagt 120 

tatggaccgc tagcatctgt cactggccac cggtttccgg gagtaagcgg cagctacctt 180 

acagccctga cacgagccgg gtgctctctc ttctcaccgc ggcccacgtc tcctcgctgg 240 

ctccggtggc ctcgctgggt cgcgaggagg cggaggactg tactctgagg ccaaaagcca 300 

gagtcggccc tgaacgccca cgactctcag ggtccagagg ccgtgagacc ggccggctga 360 

aaggtaaaga aaccaagtgg aagagtgttt cctcctctgg ccgtaaagca ggtactctct 420 

gcagcaccag ctgtccccgc cctactccgg accgccmcaa agactccatg ggatggacct 4 80 

gagtcagccg aatcctagcc ccttcccttg ggcctgctgt ggtgctcgac atcagtgaca 540 

gacggaagca gcagaccatc aaggctacgg gaggcccggg gcgcttgcga agatgaagtt 600 

tggctgcctc tccttccggc agccttatgc tggctttgtc ttaaatggaa tcaagactgt 660 

ggagacgcgc tggcgtcctc tgctgagcag ccagcggaac tgtaccatcg ccg 713 



<210> 456 
<211> 128 
<212> PRT 

<213> Homo sapiens 

<220> 

<221> SIGNAL 
<222> -34 . .-1 



<400> 456 
























Met 


Ser 


Leu 


Arg 


Leu Asp 


Thr Thr 


Pro 


Ser 


Cys 


Asn 


Ser 


Ala 




Pro 










-30 






-25 










-20 




Leu 


His 


Ala 


Leu 


Gin Val 


Leu Leu 


Leu 


Leu 


Ser 


Leu 


Leu 


Leu 


Thr 


Ala 








-15 






-10 










-5 






Leu 


Ala 


Ser 


Ser 


Thr Lys 


Gly Gin 


Thr 


Lys 


Arg 


Asn 


Leu 


Ala 


Lys 


Gly 






1 






5 








10 










Lys 


Glu 


Glu 


Ser Leu Asp 


Ser Asp Leu 


Tyr 


Ala 


Glu 


Leu 


Arg 


Cys 


Met 


15 








20 








25 










30 


Cys 


He 


Lys 


Thr 


Thr Ser 


Gly He 


His 


Pro 


Lys 


Asn 


He 


Gin 


Ser 


Leu 










35 






40 










45 




Glu 


Val 


He 


Gly 


Lys Gly 


Thr His 


Cys 


Asn 


Gin 


Val 


Glu 


Val 


He 


Ala 








50 






55 










60 






Thr 


Leu 


Lys 


Asp 


Gly Arg 


Lys He 


Cys 


Leu 


Asp 


Pro 


Asp Ala 


Pro 


Arg 






65 






70 










75 








He 


Lys 


Lys 


He 


Val Gin 


Lys Lys 


Leu 


Ala 


Gly Asp Glu 


Ser 


Ala 


Asp 




80 








85 








90 











<210> 457 

<211> 65 

<212> PRT 

<213> Homo sapiens 



<220> 

<221> SIGNAL 
<222> -18. .-1 



<400> 457 

Met Lys Ala Ala Val Leu Thr Leu 

-15 

Gin Ala Arg His Phe Trp Gin Gin 

1 5 
Asp Arg Val Lys Asp Leu Ala Thr 
15 20 
Ser Gly Arg Asp Tyr Val Ser Gin 
35 

Ser 



Ala Val Leu Phe Leu Thr Gly Ser 

-10 -5 
Asp Glu Pro Pro Gin Ser Pro Trp 
10 

Val Tyr Val Asp Val Leu Lys Asp 

25 30 
Phe Glu Gly Ser Ala Trp Glu Asn 
40 45 



<210> 458 
<211> 80 
<212> PRT 
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<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -26. .-1 

<220> 

<221> UNSURE 
<222> 31 

<223> Xaa = Ala,Val 
<400> 458 

Met Gly Arg Ala Met Val Ala Arg Leu 
-25 -20 

Ala Leu Leu Leu Pro Thr Gin lie Tyr 

-10 -5 

Thr Ser Ser Asn Ser Ser Gin Ser Thr 
10 15 

Asn Pro Thr Asn Ala Thr Thr Lys Xaa 
25 30 

Thr Ala Ser Leu Phe Val Val Ser Leu 
40 45 



PCT/lBOl/00914 

260 



Gly Leu Gly Leu Leu Leu Leu 
-15 

Ser Ser Glu Thr Thr Thr Gly 

1 5 
Ser Asn Ser Gly Leu Ala Pro 

20 

Ala Gly Gly Ala Leu Gin Ser 
35 

Ser Leu Leu His Leu Tyr Ser 
50 



<210> 459 
<211> 101 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> SIGNAL 
<222> -27. .-1 



<400> 459 
























Met 


Ser 


Ser 


Ala 


Ala 


Gly Phe Cys Ala 


Ser 


Arg 


Pro 


Gly Leu Leu 


Phe 






-25 










-20 










-15 






Gly 
-10 


Leu 


Leu 


Leu 


Leu 


Pro 
-5 


Leu 


Val 


Val 


Ala 


Phe 
1 


Ala Ser Ala 


Glu 
5 


Ala 


Glu 


Glu 


Asp 


Gly Asp 


Leu 


Gin 


Cys 


Leu 


Cys 


Val 


Lys Thr Thr 


Ser 










10 










15 






20 




Gin 


Val 


Arg 


Pro 
25 




His 


He 


Thr 


Ser 
30 


Leu 


Glu 


Val 


lie Lys Ala 
35 


Gly 


Pro 


His 


Cys 


Pro 


Thr 


Ala 


Gin 


Leu 


He 


Ala 


Thr 


Leu 


Lys Asn Gly Arg 






40 










45 










50 




Lys 


He 
55 


Cys 


Leu 


Asp 


Leu 


Gin 
60 


Ala 


Pro 


Leu 


Tyr 


Lys 
65 


Lys He He 


Lys 


Lys 


Leu 


Leu 


Glu 


Ser 




















70 





























<210> 460 
<211> 86 
<212> PRT 
<213> Homo sapiens 



<220> 

<221> SIGNAL 
<222> -18. .-1 



<400> 460 

Met He Pro Ala Val 
-15 

Ala Ala Leu Gly Glu 
1 

Phe Leu Tyr Gly He 



Val Leu Leu Leu Leu 

-10 

Pro Gin Leu Cys Tyr 
5 

Val Leu Thr Leu Leu 



Leu Leu Val Glu Gin Ala 

-5 

He Leu Asp Ala He Leu 
10 

Tyr Cys Arg Leu Lys He 
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15 20 25 30 

Gin Val Arg Lys Ala Ala He Thr Ser Tyr Glu Lys Ser Asp Gly Val 

35 40 45 

Tyr Thr Gly Leu Ser Thr Arg Asn Gin Glu Thr Tyr Glu Thr Leu Lys 

50 55 60 

His Glu Lys Pro Pro Gin 
65 

<210> 451 

<211> 92 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SIGNAL 
<222> -24. .-1 



<400> 461 




















Met 


Val 


Met 


Gly 


Leu 
-20 


Gly Val Leu 


Leu 


Leu 
-15 


Val 


Phe Val Leu 


Gly 
-10 


Leu 


Gly 


Leu 


Thr 


Pro 
-5 


Pro 


Thr Leu Ala 


Gin 
1 


Asp 


Asn 


Ser Arg Tyr 
5 


Thr 


His 


Phe 


Leu 


Thr 


Gin 


His 


Tyr Asp Ala 


Lys 


Pro 


Gin 


Gly Arg Asp Asp Arg 




10 








15 








20 






Tyr 


Cys 


Glu 


Ser 


He 


Met Arg Arg 


Arg Gly 


Leu 


Thr Ser Pro 


Cys 


Lys 


25 










30 






35 






40 


Asp 


He 


Asn 


Thr 


Phe 


He His Gly 


Asn 


Lys 


Arg Arg Ser Arg 


Pro 


Ser 










45 






50 






55 




Val 


Lys 


Thr 


Arg 
60 


Met 


Glu Thr Leu 


Thr 
65 


Glu 


Lys 


Thr 







<210> 462 
<211> 111 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -16. .-1 

<220> 

<221> UNSURE 
<222> 50 

<223> Xaa = Asp, Asn 
<400> 462 



Met 


Leu 


Leu 


He 


Leu 


Leu 


Ser 


Val 


Ala 


Leu 


Leu Ala 


Phe Ser Ser 


Ala 




-15 










-10 








-5 






Gin 


Asp 




Asp 


Glu 


Asp Val 


Ser 


Gin 


Glu 


Asp Val 


Pro Leu Val 


He 


1 








5 










10 




15 




Ser 


Asp 


Gly 


Gly 


Asp 


Ser 


Glu 


Gin 


Phe 


He 


Asp Glu 


Glu Arg Gin 


Gly 








20 










25 






30 




Pro 


Pro 




Gly 


Gly 


Gin 


Gin 


Ser 


Gin 


Pro 


Ser Ala 


Gly Asp Gly Asn 






35 










40 








45 




Gin 


Xaa 


Asp 


Gly 


Pro 


Gin 


Gin 


Gly 


Pro 


Pro 


Gin Gin 


Gly Gly Gin 


Gin 




50 










55 








60 






Gin 


Gin 


Gly 


Pro 


Pro 


Pro 


Pro 


Gin 


Gly Lys 


Pro Gin Gly Pro Pro 


Pro 


65 










70 










75 




80 


Gin 


Gly 


Gly 


Arg 


Pro 


Gin 


Gly Pro 


Pro 


Gin 


Gly Gin 


Ser Pro Gin 





85 90 95 



<210> 463 
<211> 106 
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<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -16. .-1 



<400> 463 














Met 


Leu 


Leu He 


Leu Leu Ser 


Val Ala 


Leu Leu Ala 


Phe Ser 


Ser 


Ala 




-15 




-10 




-5 








Gin 


Asp 


Leu Asp 


Glu Asp Val 


Ser Gin 


Glu Asp Val 


Pro Leu 


Val 


He 


1 






5 




10 




15 




Ser 


Asp 


Gly Gly 


Asp Ser Glu 


Gin Phe 


He Asp Glu 


Glu Arg 


Gin 


Gly 






20 




25 




30 






Pro 


Pro 


Leu Gly 


Gly Gin Gin Ser Gin 


Pro Ser Ala 


Gly Asp 


Gly Asn 






35 




40 




45 






Gin 


Asp 


Asp Gly 


Pro Gin Gin 


Gly Pro 


Pro Gin Gin 


Gly Gly 


Gin 


Gin 




50 




55 




60 








Gin 


Gin 


Gly Pro 


Pro Pro Pro 


Gin Gly 


Lys Pro Gin 


Gly Pro 


Pro 


Gin 


65 






70 




75 






80 


Gin 


Gly 


Gly Gin 


Ser Cys Cys 


Cys Asp 


Lys 









85 90 



<210> 464 
<211> 141 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> SIGNAL 
<222> -24. .-1 



<400> 464 

Met Pro Ser Ser Val Ser Trp Gly He Leu Leu Leu Ala Gly Leu Cys 

-20 -15 -10 

Cys Leu Val Pro Val Ser Leu Ala Glu Asp Pro Gin Gly Asp Ala Ala 

^5 15 
Gin Lys Thr Asp Thr Ser His His Asp Gin Asp His Pro Thr Phe Asn 

10 15 20 

Lys He Thr Pro Asn Leu Ala Glu Phe Ala Phe Ser Leu Tyr Arg Gin 
25 30 35 40 

Leu Ala His Gin Ser Asn Ser Thr Asn He Phe Phe Ser Pro Val Ser 

45 50 55 

He Ala Thr Ala Phe Ala Met Leu Ser Leu Gly Thr Lys Ala Asp Thr 

60 65 70 

His Asp Glu He Leu Glu Gly Leu Asn Phe Asn Leu Thr Glu He Pro 

75 80 85 

Glu Ala Gin He His Glu Gly Phe Gin Glu Leu Leu Arg Thr Leu Asn 

90 95 100 

Gin Pro Asp Ser Gin Leu Gin Leu Thr Thr Gly Lys Asn 
105 110 115 



<210> 465 
<211> 124 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> SIGNAL 
<222> -30. .-1 



<400> 465 

Met Pro Ala Cys Arg Leu Gly Pro Leu Ala Ala Ala Leu Leu Leu Ser 
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-30 


-25 








-20 






-15 


Leu Leu Leu Phe Gly 


Phe 


Thr Leu 


Val 


Ser 


Gly 


Thr Gly 


Ala Glu 


Lys 


-10 








-5 






1 




Thr Gly Val Cys Pro 


Glu 


Leu Gin 


Ala 


Asp 


Gin Asn Cys 


Thr Gin 


Glu 


5 




10 








15 






Cys Val Ser Asp Ser 


Glu 


Cys Ala 


Asp 


Asn 


Leu 


Lys Cys 


Cys Ser 


Ala 


20 




25 








30 






Gly Cys Ala Thr Phe 


Cys 


Ser Leu 


Pro 


Asn 


Asp 


Lys Glu 


Gly Ser 


Cys 


35 


40 








45 






50 


Pro Gin Val Asn He 


Asn 


Phe Pro 


Gin 


Leu 


Gly 


Leu Cys Arg Asp 


Gin 


55 








60 






65 




Cys Gin Val Asp Ser 


Gin 


Cys Pro 


Gly Gin 


Met 


Lys Cys 


Cys Arg 


Asn 


70 






75 








80 




Gly Cys Gly Lys Val 


Ser 


Cys Val 


Thr 


Pro 


Asn 


Phe 






85 




90 














<210> 466 


















<211> 124 


















<212> PRT 


















<213> Homo sapiens 


















<220> 


















<221> SIGNAL 


















<222> -30. .-1 


















<400> 466 


















Met Pro Ala Cys Arg 


Leu Gly Pro 


Leu 


Ala 


Ala 


Ala Leu 


Leu Leu 


Ser 


-30 


-25 








-20 






-15 


Leu Leu Leu Phe Gly 






val 


Ser 


Gly 


Thr Gly 


Ala Glu 


Lys 


-10 








-5 






1 




Thr Gly Val Cys Pro 


Glu 


Leu Gin 


Ala 


Asp 


Gin Asn Cys 


Thr Gin 


Glu 


5 




10 








15 






Cys Val Ser Asp Ser 


Glu 


Cys Ala 


Asp 


Asn 


He 


Lys Cys 


Cys Ser 


Ala 


20 




25 








30 






Gly Cys Ala Thr Phe 


Cys 


Ser Leu 


Pro 


Asn 


Asp Lys Glu Gly Ser 


Cys 


35 


40 








45 






50 


Pro Gin Val Asn He 


Asn 


Phe Pro 


Gin 


Leu 


Gly 


Leu Cys Arg Asp 


Gin 


55 








60 






65 




Cys Gin Val Asp Ser 


Gin 


Cys Pro 


Gly Gin 


Met 


Lys Cys 


Cys Arg 


Asn 


70 






75 








80 




Gly Cys Gly Lys Val 


Ser 


Cys Val 


Thr 


Pro 


Asn 


Phe 






85 




90 














<210> 467 


















<211> 69 


















<212> PRT 


















<213> Homo sapiens 


















<220> 


















<221> SIGNAL 


















<222> -17. .-1 


















<400> 467 


















Met Phe Pro Gly Phe 


Val 


Phe Leu 


Leu 


Phe 


Val 


He Ser 


Leu Ala 


Ala 


-15 




-10 








-5 






Ala Ala His Leu Trp 


Val 


Leu Ala 


Ala 


Phe 


Met 


Gly Arg 


He Thr 


Val 


1 


5 








10 






15 


Lys Val Cys Ser Phe 


Thr 


Pro Glu 


Ala 


Ser 


Lys 


Thr Val 


Ser Pro 


Pro 


20 








25 






30 




Glu Gly Ala Asn Asn 


Ser 


Arg Arg 


Thr 


Ala 


Phe 


Lys Ser 


Cys Asn 


Thr 


35 






40 








45 




His His Lys Gly Leu 



















50 
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<210> 468 
<211> 77 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> SIGNAL 
<222> -33. .-1 



Met Gly Pro 


Val 
-30 


Lys 


Gin 


Leu 


Lys Arg Met 
-25 


Phe 


Glu 


Pro 


Thr 
-20 


Arg 


Leu 


He Ala Thr 


He 


Met 


Val 


Leu 


Leu Ser Phe 


Ala 


Leu 


Thr 


Leu 


Cys 


Ser 


-15 










-10 






-5 








Ala Phe Trp 


Trp 


His 


Asn 


Met 


Gly Leu Ala 


Leu 


He 


Phe 


Cys 


He 


Leu 


1 






5 






10 










15 


Gin Ser Leu 


Ala 


Leu 
20 


Thr 


Trp 


Tyr Ser Leu 
25 


Ser 


Phe 


He 


Pro 


Phe 
30 


Ala 


Arg Asp Ala 


Val 
35 


Lys 


Lys 


Cys 


Phe Ala Val 
40 


Cys 


Leu 


Ala 









<210> 469 
<211> 146 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -26. .-1 



Met 


Thr 


Met 


Arg 


Ser 


Leu Leu Arg Thr Pro 


Phe Leu Cys Gly 


Leu 


Leu 




-25 








-20 


-15 








Trp 


Ala 


Phe 


Cys 


Ala 


Pro Gly Ala Arg Ala 


Glu Glu 


Pro Ala 


Ala 


Ser 


-10 










-5 


1 




5 




Phe 


Ser 


Gin 


Pro 
10 


Gly 


Ser Met Gly Leu Asp 
15 


Lys Asn 


Thr Val 
20 


His 


Asp 


Gin 


Glu 


His 
25 


He 


Met 


Glu His Leu Glu Gly 
30 


Val He 


Asn Lys 
35 


Pro 


Glu 


Ala 


Glu 


Met 


Ser 


Pro 


Gin Glu Leu Gin Leu 


His Tyr 


Phe Lys Met 


His 




40 








45 


50 








Asp 


Tyr 


Asp 


Gly 


Asn 


Asn Leu Leu Asp Gly 


Leu Glu 




Thr 


Ala 


55 










60 


65 






70 


He 


Thr 


His 


Val 


His 
75 


Lys Glu Glu Gly Ser 
80 


Glu Gin 


Ala Pro 


Leu 
85 


Met 


Ser 


Glu 


Asp 


Glu 


Leu 


He Asn He He Asp 


Gly Val 


Leu Arg Asp 


Asp 








90 




95 




100 






Asp 


Lys 


Asn 
105 


Asn 


Asp 


Gly Tyr He Asp Tyr 
110 


Ala Glu 


Phe Ala 
115 


Lys 


Ser 


Leu 


Gin 

120 



















<210> 470 
<211> 62 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -46. .-1 



<400> 470 
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Met Asp Gin 


Leu 


Val 


Phe 


Lys 


Glu 


Thr He 


Trp Asn Asp 


Ala 


Phe 


Trp 










-40 






-35 










Gin Asn Pro 


Trp Asp 


Gin 


Gly Gly 


Leu Ala 


Val He 


He 


Leu 


Phe 


He 


-30 






-25 








-20 








-15 


Thr Ala Val 




Leu 


Leu 


He 


Leu 


Phe Ala 


He Val 


Phe 


Gly 


Leu 


Leu 






-10 








-5 








1 




Thr Ser Thr 


Glu 


Asn 


Thr 


Gin 


Cys 




Gly Glu 


Glu 


Glu 






5 
















15 








<210> 471 
























<211> 162 
























<212> PRT 
























<213> Homo sapiens 




















<220> 
























<221> SIGNAL 






















<222> -47. .- 


-1 






















<4 00> All 
























Met Arg He 


Ala 


Asn 


Arg 


Thr Arg 


Phe Ser 


Ser Pro 


Phe 


Leu Ala Arg 


-45 










-40 






-35 








Gly Ala Gly 


Trp 


Thr 


His 


Gly Arg 


Gly Met 


Met Val 


Val 


Gly Thr Gly 


-30 








-25 






-20 










Thr Ser Leu 


Ala 


eu 


Ser 


Ser 


Leu 


Leu Ser 


Leu Leu 


Leu 


Phe 


Ala 


Gly 


-15 






-10 








-5 








1 


Met Gin Met 


Tyr 


Ser 


Arg 


Gin 


Leu 


Ala Ser 


Thr Glu 


Trp 


Leu 


Thr 


He 




5 










10 






15 






Gin Gly Gly 


Leu 


Leu 


Gly Ser 


Gly 


Leu Phe 


Val Phe 


Ser 


Leu 


Thr 


Ala 


20 










25 






30 








Phe Asn Asn 




Glu 


Asn 


Leu 


Val 


Phe Gly Lys Gly Phe 


Gin 


Ala 


Lys 


35 








40 






45 










He Phe Pro 


Glu 


He 


Leu 


Leu 


Cys 


Leu Leu 


Leu Ala 


Leu 


Phe 


Ala 


Ser 


50 






55 








60 








65 


Gly Leu He 


His 


Arg 


Val 


Cys 


Val 


Thr Thr Cys Phe 


He 


Phe 


Ser 


Met 




70 








75 








80 




Val Gly Leu 


Tyr 


Tyr 


He 


Asn 


Lys 


He Ser 


Ser Thr 


Leu 


Tyr 


Gin 


Ala 




85 










90 






95 






Ala Ala Pro 


Val 


Leu 


Thr 


Pro 


Ala 


Lys Val 


Thr Gly Lys 


Ser 


Lys 


Lys 


100 










105 






110 








Arg Asn 
























115 
























<210> 472 
























<211> 90 
























<212> PRT 
























<213> Homo sapiens 




















<220> 
























<221> SIGNAL 






















<222> -35. .- 


-1 






















<400> 472 
























Met Glu Thr 


Gly 


Ala 


Ser 


Ala 


Ser 


He Pro 


Glu Leu 


He 


Cys 


Glu 


Ala 


-35 






-30 








-25 








-20 


Met Arg Arg 


He 


Trp 


Ser 




Gly 


Leu Gly 


Leu Val 


Thr 




Thr 


Ala 






-15 








-10 








-5 




Ser Trp Ala 


Ala 


Leu 


Phe 


His Asp 


Gly Phe 


Ala Val 


Leu 


Gly Gly Asn 




1 








5 






10 








He Val Ser 


Asp 


Leu 


Ser 


Thr 


Val 


Arg Phe 


Val Ala 


Gin 


Gin 


Gin 


His 


15 








20 






25 










Phe Gin Leu 


Leu 


Asp 


Val 


Trp 


Thr 


Arg Asn 


Phe Arg 


Lys 


Pro 


Leu 


Gly 


30 






35 








40 








45 


Ser Met Cys 


Phe 


Val 


Phe 


Leu 
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50 55 

<210> 473 
<211> 61 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -25. .-1 

<220> 

<221> UNSURE 
<222> 27 

<223> Xaa = Ala,Val 
<400> 473 

Met Arg Leu Leu Gin Leu Leu Phe Arg Ala Ser Pro Ala Thr Leu Leu 
-25 -20 -15 -10 

Leu Val Leu Cys Leu Gin Leu Gly Ala Asn Lys Ala Gin Asp Asn Thr 

-5 15 
Arg Lys lie lie He Lys Asn Phe Asp lie Pro Lys Ser Val Arg Pro 

10 15 20 

Asn Asp Glu Xaa Leu Gin Cys Leu Gin Phe Lys Gin Asn 
25 30 35 

<210> 474 
<211> 120 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -32. .-1 

<400> 474 



Met 


Ala 


Ser 
-30 


Pro 


Lys 


Gly 


Phe 


Phe 
-25 


Asn Tyr 


Leu 


Thr 


Tyr 
-20 


Phe 


Leu 


Ala 


Ala 


Gly Ala Val 


Thr 




Gly 


He 


Gly Phe 


Phe 


Ala 


Leu 


Ala 


Ser 


Ala 




-15 










-10 








-5 










Leu 


Trp 


Phe 


Leu 


He 


Cys 


Lys 


Arg 


Arg Glu 


He 


Phe 


Gin 


Asn 


Ser 




1 








5 








10 










15 




Phe 


Lys 


Ala 


He 


Asp 


Glu 


Arg 


Cys 


Arg Gin Arg 


Pro 


Ser 


Met 


Ala 


Lys 








20 










25 








30 






lie 


Lys 


Ser 


His 


Ser 


Gin 


Cys 


Val 


Phe He 


Ser Arg 


Asn 


Phe 


His 


Thr 






35 










40 








45 








Gly 




Phe 


Gin 


Leu 


Gin 


Gin 


Leu Lys He 


He 


Leu 


Lys 


Met 


Asn 


Pro 


5o'^ 










55 








60 










Asn 


Leu 


Gin 


Gin 


Lys 


He 


Ser 


Phe 


Val He 


Pro 


Gin 


Arg 


Pro 


Ala 


Pro 


65 










70 








75 










80 


Gin 


Gin 


He 


Ala 


Ala 


Val 


Leu 


His 

















85 



<210> 475 

<211> 78 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SIGNAL 
<222> -19. .-1 

<400> 475 
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M6t Lys Pne Pne 






-15 


lie S Al A 


er 


e er a ^sp 




Phe His Glu Lys 


His 


15 




Glu Glu Ser Ser 


Lys 


T^r 




yr r eu ys 






50^ 


<210> 476 




<211> 87 




<212> PRT 




<213> Homo sapiens 


<220> 




<221> SIGNAL 




<222> -20. .-1 




<400> 476 




Met Gin Lys Val 


Thr 


-20 




Val Leu Asp Ala 


Asn 




1 


Asp Trp His Ser 


Leu 


15 




Cys Ala Met Gly 


He 


30 




Phe Gly Gin Lys 


Ser 


45 




Thr Pro Gly Ser 


Ala 




65 



<210> 477 
<211> 105 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -19. .-1 

<220> 

<221> UNSURE 
<222> 37 

<223> Xaa = * ,Lys 
<400> 477 

Met Arg Gly Ala Thr 
-15 

Ser Asp Cys Ala Val 
1 

Gly Ala Gly Thr Cys 
15 

Met Cys Thr Pro Leu 
30 

His Lys He Pro Phe 
50 

Leu Pro Asn Leu Leu 
65 

Ser Met Asp Leu Lys 
80 



PCT/JlBOl/00914 



Phe Ala Leu Val Leu Ala Leu Met He Ser Met 

-10 -5 
His Glu Lys Arg His His Gly Tyr Arg Arg Lys 

5 10 
His Ser Tyr His He Thr Leu Leu Pro Leu Phe 

20 25 
Ser Asn Ala Asn Glu Lys His Tyr Asn Leu Leu 
35 40 45 

Arg He Leu Ala Phe Ser He Val Thr 
55 



Leu Gly Leu Leu Val 
-15 

Asp Leu Glu Asp Lys 
5 

Gin Val Gly Gly Leu 
20 

He He Val Met Ser 
35 

Gly His His Pro Gly 
50 

Gin Ser 



Phe Leu Ala Gly Phe Pro 

-10 -5 

Asn Ser Pro Phe Tyr Tyr 
10 

He Cys Ala Gly Val Leu 
25 

Ala Lys Cys Lys Cys Lys 
40 

Glu Thr Pro Pro Leu He 

55 60 



Arg Val Ser He Met 
-10 

He Thr Gly Ala Cys 
5 

Cys Ala He Ser Leu 
20 

Gly Arg Xaa Gly Glu 
35 

Phe Arg Lys Arg Lys 
55 

Cys Ser Arg Phe Pro 
70 

Asn He Asn Phe 
85 



Leu Leu Leu Val Thr Val 
-5 

Glu Arg Asp Val Gin Cys 
10 

Trp Leu Arg Gly Leu Arg 
25 

Glu Cys His Pro Gly Ser 
40 45 
His His Thr Cys Pro Cys 
60 

Asp Gly Arg Tyr Arg Cys 
75 
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<210> 478 
<211> 102 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -36. .-1 

<220> 

<221> UNSURE 
<222> 31 

<223> Xaa = His,Arg 



<400> 478 



Met 


Ser 

-35 


Asn 


Thr 


Thr 


Val 


Pro 

-30 


Asn 


Ala 


Pro 


Gin 


Ala 

-25 


Asn 


Ser 


Asp 


Ser 


Met 


Val 


Gly 


Tyr 


Val 


Leu 


Gly 


Pro 


Phe 


Phe 




He 


Thr 


Leu 


Val 


Gly 


-20 










-15 










-10 










-5 


Val 


Val 


Val 


Ala 


Val 


Val 


Met 


Tyr Val Gin Lys 


Lys 


Lys 


Arg 


Val 


Asp 










1 








5 










10 








Leu 


Arg 


His 


His 


Leu 


Leu 


Pro 


Met 


Tyr 


Ser 


Tyr Asp 


Pro 


Ala 


Glu 






15 










20 










25 








Glu 


Leu 


Xaa 


Glu 


Ala 


Glu 


Gin 


Glu 


Leu 


Leu 


Ser Asp 


Met 


Gly 


Asp 


Pro 




30 










35 










40 










Lys 


Val 


Val 


His 


Gly Trp Gin 


Ser 


Gly 


Tyr 


Gin 


His 


Lys 


Arg 


Met 


Pro 


45 










50 










55 










60 


Leu 


Leu 


Asp 


Val 


Lys 


Thr 























65 



<210> 479 
<211> 137 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -18. .-1 



<400> 479 



Met 


Lys 


Ala 


Leu He Val Leu 


Gly 


Leu 


Val Leu 


Leu Ser 


Val 


Thr 


Val 








-15 




-10 






-5 






Gin 


Gly 


Lys 


Val Phe Glu Arg 


Cys 


Glu Leu Ala Arg Thr 


Leu 


Lys 


Arg 






1 


5 








10 








Leu 


Gly 


Met 


Asp Gly Tyr Arg 


Gly 


He 


Ser Leu 


Ala Asn 


Trp 


Met 


Cys 


15 






20 






25 








30 


Leu 


Ala 


Lys 


Trp Glu Ser Gly 


Tyr 


Asn 


Thr Arg 


Ala Thr 


Asn 


Tyr 










35 






40 






45 




Ala 


Gly 


Asp 


Arg Ser Thr Asp 


Tyr 


Gly 


He Phe 


Gin He 


Asn 


Ser 


Arg 








50 




55 






60 






Tyr 


Trp 


Cys 


Asn Asp Gly Lys 


Thr 


Pro 


Gly Ala 


Val Asn 


Ala 


Cys 


His 






65 




70 






75 








Leu 


Ser 


Cys 


Ser Gly Trp His 


Gly Glu 


He Val 


Val Lys 


Thr 


Glu 


Met 




80 




85 








90 








Ser 


Val 


Ser 


Met Phe Lys Val 


Val 


Glu 


Cys Asn 


Ser Arg 


He 


Phe 


Leu 


95 






100 






105 








110 


Leu 


Gin 




He Leu Ser Leu 


Ser 


His 













115 



<210> 480 
<211> 101 
<212> PRT 
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<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -29. .-1 

<400> 480 



Met 


Arg 


Glu 


Glu 


Lys 


Lys 


Pro 


Phe 


Glu Arg 


Glu 


Arg 


Glu Ser 


Val 


Cys 










-25 








-20 








-15 




Val 


Cys 


Met 


Cys 


Val 


Phe 


Ser 


Thr 


Gin Gly 


Ala 


Leu 


Gly Glu 


Met 


Ala 








-10 










-5 






1 






Ala 


His 


Phe 


He 


Asp 


Glu 


Lys 


Leu 


Arg Pro 


Ser 


Glu 


Gly Asn 


Gly 


His 




5 










10 








15 








Arg 


Gly Thr 


Leu 


Asp 


Ser 


Leu 


Ser 


Ser Asp 


Gin 


Glu 


Ser Tyr 


He 


Pro 


20 










25 








30 








35 


Ser 


Thr 


Ala 


Asp 


Pro 


Thr 


Gin 


Ala 


Gly Pro 


Glu 


Leu 


Leu His 


Lys 


Asn 










40 








45 








50 




Leu 


Pro 


Val 


Thr 


Ser Arg 


Ser 


Gin 


Pro Leu 


Pro 


Ser 


Asp Leu Ala 


He 








55 










60 






65 






Pro 


Ala 


Ala 


Ala 


Leu 





















70 



<210> 481 
<211> 128 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> SIGNAL 
<222> -26. .-1 



<400> 481 



Met 


Cys 


Trp 


Leu Arg Ala Trp Gly 


Gin 


He 


Leu 


Leu 


Pro 


Val 


Phe 


Leu 




-25 




-20 








-15 










Ser 


Leu 


Phe 


Leu He Gin Leu Leu 


He 


Ser 


Phe 


Ser 


Glu 


Asn 


Gly 


Phe 


-10 






-5 






1 








5 




He 


His 


Ser 


Pro Arg Asn Asn Gin 


Lys 


Pro 


Arg Asp 


Gly Asn 


Glu 


Glu 








10 


15 










20 






Glu 


Cys 


Ala 


Val Lys Lys Ser Cys 


Gin 


Leu 


Cys 


Thr 


Glu 


Asp 


Lys 


Lys 






25 


30 










35 








Cys 


Val 


Trp 


Cys Ser Glu Glu Lys 


Ala 


Cys 


Lys 


Lys 


Tyr 


Cys 


Phe 


Pro 




40 




45 








50 










Tyr 


Phe 


Gly 


Cys Arg Phe Ser Ser 


He 


Tyr 


Trp 


Leu 


Asn 


Cys 


Lys 


Val 


55 






60 






65 










70 


Asp 


Met 


Phe 


Gly He Met Met Leu 




Leu 


He 


Ala 


Val 


Leu 


He 


Thr 








75 




80 










85 




Gly 


Phe 


Val 


Trp Tyr Cys Cys Ala 


Tyr 


His 


Phe 


Tyr 


Leu 


Gin Asp 


He 








90 


95 










100 







<210> 482 
<211> 142 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SIGNAL 
<222> -24. .-1 

<220> 

<221> UNSURE 
<222> 80 

<223> Xaa = Leu, Arg, Trp 
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<220> 

<221> UNSURE 
<222> 79 

<223> Xaa = Lys,Asn 
<400> 482 



Met Arg Leu Ser 


Trp 


Phe 


Arg 


Val 


Leu 




-20 










Ser Ala Val Ala 


Thr 


Ala 


Thr 


Gly 


Ala 


-5 










1 


He Gly Val Lys 


Lys 


Arg 


Val 


Asp 


His 


10 






15 






Gly Asp Val Leu 


His 


Met 


His 


Tyr 


Thr 


25 




30 








Glu Phe Asp Ser 


Ser 


Leu 


Pro 


Gin 


Asn 




45 










Gly Thr Gly Gin 


Val 


He 


Lys 


Gly 


Trp 


60 










65 


Cys Glu Gly Glu 


Lys 


Arg 


Xaa 


Xaa 


Val 


75 








80 




Gly Glu Arg Gly Ala 


Pro 


Pro 




He 


90 






95 






Phe Glu Val Glu 


Leu 


Leu 


Lys 


He 


Glu 


105 




110 








<210> 483 












<211> 70 












<212> PRT 












<213> Homo sapiens 










<220> 












<221> SIGNAL 












<222> -24. .-1 












<400> 483 












Met Ala Ser Ser 


Thr 


Ser 


Val 


Ser 


Thr 




-20 










Pro Ala Cys Leu 


Gly 


Ser 


Trp 


Ala 


Glu 


-5 










1 


He Ser He Ala 


Ser 


Asp 


Phe 


Pro 


Thr 


10 






15 






Pro Pro Ala Pro 


Gly 


Met 


Val 


Pro 


He 


25 




30 








Trp Ala Phe Ser 


Pro 


Ala 










45 










<210> 484 












<211> 55 












<212> PRT 












<213> Homo sapiens 










<220> 












<221> SIGNAL 












<222> -23. .-1 












<400> 484 












Met Ala Gly Pro 


Ala 


Ala 


Ala 


Phe 


Arg 


-20 










-15 


Ala Ala Ala Leu 


Gly 


Phe 


Ala 


Ser 


Tyr 


-5 








1 




Gin Met Pro Thr 


Gly 


Arg 


Ser 


Cys 


Leu 


10 




15 








Ser Tyr Thr Ala 


Trp 


Pro 


Cys 
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Thr 


Val 


Leu 


Ser 


He 


Cys Leu 


-15 










-10 


Glu 


Gly 


Lys 




Lys 


Leu Gin 








5 






Cys 


Pro 


He 


Lys 


Ser 


Arg Lys 






20 








Gly 


Lys 




Glu 


Asp 


Gly Thr 




35 








40 


Gin 


Pro 


Phe 


Val 


Phe 


Ser Leu 


50 










55 




Gin 


Gly 


Leu 




Gly Met 










70 




He 


Pro 


Ser 


Glu 


Leu 


Gly Tyr 








85 






Pro 


Gly 


Gly Ala 


Thr 


Leu Val 






100 








Arg 


Arg 


Thr 


Glu 


Leu 






115 










Gly 


Gin 


He 






Thr Arg 


-15 










-10 


He 


Arg 


Ser 


Pro 


Val 


Arg Thr 








5 






Ala 


Arg 


Val 


Ser 


Leu 


Trp Val 






20 








Lys 


He 


Ser Gly Cys 


Ala Asn 




35 








40 


Arg 


Leu 


Gly Ala 


Leu Ser Gly 










-10 




Gly 


Ala 


His 


Gly Ala 


Asn Ser 






5 








Thr 


Arg 


Pro 


Thr 


Asn 


Thr Thr 




20 








25 
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<210> 485 
<211> 55 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -23. .-1 



<400> 485 

Met Ala Gly Pro Ala Ala Ala Phe 
-20 

Ala Ala Ala Leu Gly Phe Ala Ser 

-5 1 
Gin Met Pro Thr Gly Arg Ser Cys 
10 15 
Ser Tyr Thr Ala Trp Pro Cys 
30 



Arg Arg Leu Gly Ala Leu Ser Gly 
-15 -10 

Tyr Gly Ala His Gly Ala Asn Ser 
5 

Leu Thr Arg Pro Thr Asn Thr Thr 

20 25 



<210> 486 
<211> 54 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> SIGNAL 
<222> -17 . .-1 



<400> 486 

Met Phe Arg Arg Leu Thr Phe Ala 
-15 -10 
Gly lie Ala Gly Gly Val Tyr He 

1 5 
Ala Lys Asp Gin Lys Glu Leu Lys 
20 

Ser Glu Glu Lys Lys Ser 
35 



Gin Leu Leu Phe Ala Thr Val Leu 
-5 

Phe Gin Pro Val Phe Glu Gin Tyr 

10 15 
Glu Lys Met Gin Leu Val Gin Glu 
25 30 



<210> 487 
<211> 83 
<212> PRT 
<213> Homo sapiens 



<220> 

<221> SIGNAL 
<222> -15. .-1 



Met 


Arg 


Leu 


Phe 


Leu Ser 


Leu 


Pro 


Val Leu 


Val 


Val Val Leu 


Ser 


He 


-15 








-10 








-5 






1 


Val 


Leu 


Glu 


Gly 


Pro Ala 


Pro 


Ala 


Gin Gly 


Thr 


Pro Asp Val 


Ser 


Ser 








5 








10 




15 






Ala 


Leu 


Asp 


Lys 


Leu Lys 


Glu 


Phe 


Gly Asn 


Thr 


Leu Glu Asp 


Lys 


Ala 






20 








25 






30 






Arg 


Glu 


Leu 


He 


Ser Arg 


He 


Lys 


Gin Ser 


Glu 


Leu Ser Ala 


Lys 


Met 


35 








40 








45 






Arg 


Glu 


Trp 


Phe 


Ser Glu 


Thr 


Phe 


Gin Lys 


Val 


Lys Asp Lys 


Leu 


Lys 


50 








55 








60 






65 


He 


Asp 


Ser 





















<210> 488 
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<211> 98 
<212> PRT 

<213> Homo sapiens 

<220> 

<221> SIGNAL 
<222> -43. .-1 



<400> 4f 


38 




















Met 


Phe 


Arg 


He 

-40 


Glu 


Gly 


Leu 


Ala 


Pro Lys Leu 

-35 


Asp 


Pro 


Glu Glu Met 

-30 


Lys 


Arg 


Lys 
-25 


Met 


Arg 


Glu 


Asp 


Val 
-20 


He Ser Ser 


He 


Arg 
-15 


Asn Phe Leu 


He 


Tyr 
-10 


Val 


Ala 


Leu 


Leu 


Arg 
-5 


Val 


Ser Glu Cys 


Leu 
1 


Pro 


Gly Cys Asp 
5 


Cys 


Asp 


Thr 


Ser 


Gly 


Glu Leu Thr Asp Gly His 


Pro 


Leu 


Thr Leu Arg 










10 








15 






20 


Gly 


His 




Gly 


Leu 


Arg 


Thr 


Glu 


Leu Asn Gly 


Ser Gly 


Glu Gin Gly 








25 










30 






35 


Gly 


Ser 


He 
40 


Tyr 


Leu 


Lys 


Glu 


He 
45 


Gly Gin His 


Met 


Lys 
50 


Thr Gly His 


His 


lie 
55 























<210> 489 
<211> 82 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> SIGNAL 
<222> -43. .-1 



<400> 489 
















Met 


Ala 


Lys 


Tyr 


Gin 


Gly Glu Val Gin 


Ser Leu 


Lys 


Leu 


Asp Asp Asp 








-40 




-35 








-30 


Ser 


Val 


He 


Glu 


Gly 


Val Ser Asp Gin Val Leu 


Val 


Ala 


Val Val Val 






-25 






-20 






-15 




Ser 


Phe 


Ala 


Leu 


He 


Ala Thr Leu Val 


Tyr Ala 


Leu 


Phe 


Arg Asn Val 




-10 








-5 




1 




5 


His 


Gin 


Asn 


He 


His 


Pro Glu Asn Gin 


Glu Leu 


Val 


Arg 


Val Leu Arg 










10 




15 






20 


Glu 


Gin 


Leu 


Gin 


Thr 


Glu Gin Asp Ala 


Pro Ala Asp 


Ser 


Thr Ala Val 








25 




30 








35 


Leu 


His 



















<210> 490 
<211> 99 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> SIGNAL 
<222> -28. .-1 



<400> 490 



Met 


Thr 


Ser 


Ala 


Leu 


Thr Gin Gly Leu Glu Arg He 


Pro 


Asp 


Gin 


Leu 








-25 




-20 




-15 






Gly 


Tyr 


Leu 


Val 


Leu 


Ser Glu Gly Ala Val Leu Ala 


Ser 


Ser 


Gly 


Asp 






-10 






-5 


1 








Leu 


Glu 


Asn 


Asp 


Glu 


Gin Ala Ala Ser Ala He Ser 


Glu 


Leu 


Val 


Ser 


5 








10 15 








20 


Thr 


Ala 


Cys 


Gly 


Phe 


Arg Leu His Arg Gly Met Asn 


Val 


Pro 


Phe 


Lys 
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25 30 35 

Arg Leu Ser Val Val Phe Gly Glu His Thr Leu Leu Val Thr Val Ser 

40 45 50 

Gly Gin Arg Val Phe Val Val Lys Arg Gin Asn Arg Gly Arg Glu Pro 

55 60 65 

lie Asp Val 
70 



<210> 491 
<211> 186 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SIGNAL 
<222> -15. .-1 



<220> 

<221> UNSURE 
<222> 148 

<223> Xaa = Glu, Gin 



<400> 491 










Met 


His 


Val 


Met 


Ala Ala 


Ser Met Ala Arg Gly Gly 


Val Ser Ala Arg 


-15 








-10 


-5 


1 


Val 


Leu 


Leu 


Gin 


Ala Ala 


Arg Gly Thr Trp Trp Asn 


Arg Pro Gly Gly 








5 




10 


15 


Thr 


Ser 


Gly 


Ser 


Gly Glu 


Gly Val Ala Leu Gly Thr 


Thr Arg Lys Phe 






20 






25 


30 


Gin 


Ala 


Thr 


Gly 


Ser Arg 


Pro Ala Gly Glu Glu Asp 


Ala Gly Gly Pro 




35 








40 45 




Glu 


Arg 


Pro 


Gly 


Asp Val 


Val Asn Val Val Phe Val 


Asp Arg Ser Gly 


50 








55 


60 


65 


Gin 


Arg 


He 




Val Ser 


Gly Arg Val Gly Asp Asn 


Val Leu His Leu 










70 


75 


80 


Ala 


Gin 


Arg 


His 


Gly Val 


Asp Leu Glu Gly Ala Cys 


Glu Ala Ser Leu 








85 




90 


95 


Ala 


Cys 


Ser 


Thr 


Cys His 


Val Tyr Val Ser Glu Asp 


His Leu Asp Leu 






100 






105 


110 




Pro 


Pro 


Pro 


Glu Glu 


Arg Glu Asp Asp Met Leu 


Asp Met Ala Pro 




115 








120 125 




Leu 


Leu 


Gin 


Glu 


Asn Ser 


Arg Leu Gly Cys Gin He 


Val Leu Thr Pro 


130 








135 


140 


145 


Glu 


Leu 


Xaa 


Gly 


Ala Glu 


Phe Thr Leu Pro Lys He 


Thr Arg Asn Phe 










150 


155 


160 


Tyr 


Val 


Asp 


Gly 


His Val 


Pro Lys Pro His 










165 




170 





<210> 492 
<211> 123 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> SIGNAL 
<222> -48. .-1 



<400> 492 

Met Glu Ala Leu Gly Lys Leu Lys Gin Phe Asp Ala Tyr Pro Lys Thr 

-45 -40 -35 

Leu Glu Asp Phe Arg Val Lys Thr Cys Gly Gly Ala Thr Val Thr He 

-30 -25 -20 

Val Ser Gly Leu Leu Met Leu Leu Leu Phe Leu Ser Glu Leu Gin Tyr 
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-15 






-10 






Tyr Leu Thr 


Thr Glu 


Val 


His 


Pro 


Glu 


1 


5 










Gly Asp Lys 


Leu Lys 


He 


Asn 


He 


Asp 












25 


Cys Ala Tyr 


Leu Ser 


He 


Asp 


Ala 


Met 


35 








40 




Leu Asp Val 


Glu His 


Asn 


Leu 


Phe 


Lys 


50 






55 






lie Pro Val 


Ser Ser 




Ala 


Glu Arg 


65 




70 








<210> 493 












<211> 77 












<212> PRT 












<213> Homo : 


sapiens 











<220> 

<221> SIGNAL 
<222> -29. .-1 



<220> 

<221> UNSURE 
<222> 36 

<223> Xaa = Gin, Arg 



PCT/lBOl/00914 

274 

-5 

Leu Tyr Val Asp Lys Ser Arg 
10 15 
Val Leu Phe Pro His Met Pro 
30 

Asp Val Ala Gly Glu Gin Gin 
45 

Gin Arg Leu Asp Lys Asp Gly 
60 

His Asp 
75 



<400> 493 

Met Ala Ala Thr Asp Phe Val Gin Glu Met Arg Ala Val Gly Glu Arg 

-25 -20 -15 

Leu Leu Leu Lys Leu Gin Arg Leu Pro Gin Ala Glu Pro Val Glu He 

-10 -5 1 

Val Ala Phe Ser Val He He Leu Phe Thr Ala Thr Val Leu Leu Leu 

5 10 15 

Leu Leu He Ala Cys Ser Cys Cys Cys Thr His Cys Cys Cys Pro Glu 
20 25 30 35 

Xaa Arg Gly Arg Lys Val Gin Val Gin Pro Thr Pro Pro 
40 45 



<210> 494 
<211> 107 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> SIGNAL 
<222> -27 . ,-1 



Met 


Leu 


Phe 


Glu 


Glu 


Ala 


Leu 


Pro Leu 


Ser Cys Ser Asp 


Pro 


Val 


Leu 






-25 










-20 


-15 








Ser 


Thr 


Leu 


Ser 


Leu 


Val 


Gin 


Phe Ser 


Pro Ser Gly Arg 


Thr 


Gin 


Asp 




-10 










-5 




1 






5 


Leu 


Leu 


Ser 


Pro 


Gly 


Val 


Glu 


Asn Leu 


Ser Val Leu Asp 


Val 


Ser 


Pro 










10 








15 




20 






Gly 


Leu 


Ala 


Cys 


Cys 




Leu Thr 


Leu Thr Met Ser 


Cys 


Pro 


Gly 








25 








30 




35 






Pro 


Asp 


Pro 


Pro 


Glu 


Gly 


Pro 


Gly Thr 


Gin Arg Val Trp 


Gin Gly Ala 






40 










45 


50 








Leu 




He 


Leu 


Gin 


Leu 


Pro Gly Ala 


Pro Asp Gly Val 


Ser 


Pro 


Tyr 




55^ 










60 




65 








Gin 


Pro 


Val 


Trp 


Ser Arg 


Thr 


Pro Asp 


Leu Lys 








70 










75 






80 
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<210> 495 
<211> 112 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -39. .-1 

<220> 

<221> UNSURE 
<222> -37 

<223> Xaa = * ,Glu,Lys,Gln 



<400> 495 



Met Leu Xaa His Leu Ser Ser 


Leu 


Pro 


Thr 


Gin 


Met 


Asp 


Tyr 


Lys Gly 


-35 






-30 










-25 


Gin Lys Leu Ala Glu Gin Met 


Phe 


Gin 


Gly 


He 


He 


Leu 


Phe 


Ser Ala 


-20 




-15 










-10 




He Val Gly Phe He Tyr Gly 


Tyr 


Val 


Ala 


Glu 


Gin 


Phe 


Gly Trp Thr 


-5 


1 








5 








Val Tyr He Val Met Ala Gly 


Phe 


Ala 


Phe 


Ser 


Cys 


Leu 


Leu 


Thr Leu 


10 15 








20 








25 


Pro Pro Trp Pro He Tyr Arg 


Arg 


His 


Pro 


Leu 


Lys 


Trp 


Leu 


Pro Val 


30 






35 










40 


Gin Ala Gin Thr Thr Arg Asn 


Gin 


Gly Lys 


Glu 


Lys 


Leu 


Arg 


Gly Met 


45 




50 










55 




Leu Lys He He Glu Val Phe 


Met 


He 


Gin 


His 


Leu 


Leu 


Leu 


Phe Leu 


60 


65 










70 






<210> 496 


















<211> 103 


















<212> PRT 


















<213> Homo sapiens 


















<220> 


















<221> SIGNAL 


















<222> -21. .-1 


















<400> 496 


















Met Ala Leu Cys Ala Leu Thr 




Ala 


Leu 


Arg 


Ser 


Leu 


Asn 


Leu Ala 


-20 -15 










-10 








Pro Pro Thr Val Ala Ala Pro 


Ala 


Pro 


Ser 


Leu 


Phe 


Pro 


Ala 


Ala Gin 


-5 1 






5 










10 


Met Met Asn Asn Gly Leu Leu 


Gin 


Gin 


Pro 


Ser 


Ala 


Leu 


Met 


Leu Leu 


15 




20 










25 




Pro Cys Arg Pro Val Leu Thr 


Ser 


Val 


Ala 


Leu 


Asn 


Ala 


Asn 


Phe Val 


30 


35 










40 






Ser Trp Lys Ser Arg Thr Lys 


Tyr 


Thr 


He 


Thr 


Pro 


Val 


Lys 


Met Arg 


45 50 










55 








Lys Ser Gly Gly Arg Asp His 


Thr Gly Ala 


Gly Asn Val 


Arg 


Ser Asn 


60 65 








70 








75 


Ser Arg Pro Ser He Gin Arg 



















80 



<210> 497 
<211> 92 

<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -15. .-1 
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<400> 497 



Met Ala Val Leu 


Ala Gly 


Ser 


Leu 


Leu 


Gly 


Pro 


Thr 


Ser 


Arg 


Ser 


Ala 


-15 


-10 










-5 










1 


Ala Leu Leu Gly 


Gly Arg 


Trp 


Leu 


Gin 


Pro Arg Ala 


Trp 


Leu 


Gly 


Phe 


5 








10 










15 






Pro Asp Ala Trp 


Gly Leu 


Pro 


Thr 


Pro 


Gin 


Gin 


Ala 


Arg 


Gly 


Lys 


Ala 


20 






25 










30 








Arg Gly Asn Glu 


Tyr Gin Pro 


Ser 


Asn 


He 


Lys 


Arg 


Lys 


Asn 


Lys 


His 


35 




40 










45 










Gly Trp Val Arg 


Arg Leu 


Ser 


Thr 


Pro 


Ala 


Gly Val 


Gin 


Val 


He 


Leu 


50 


55 










60 










65 


Arg Arg Met Leu 


Lys Gly Arg 


Lys 


Ser 


Leu 


Ser 


His 












70 








75 














<210> 498 
























<211> 87 
























<212> PRT 
























<213> Homo sapiens 






















<220> 
























<221> SIGNAL 
























<222> -39. .-1 
























<400> 498 
























Met Lys Val Leu 




Thr 


Gly 


Leu 


Gly 


Ala 


Leu 


Phe 


Phe 


Ala 


Tyr 




-35 








-30 










-25 




Tyr Trp Asp Asp 


Asn Phe 


Asp 


Pro 


Ala 


Ser 


Leu 


Gin 


Gly Ala 


Arg 


Val 


-20 








-15 










-10 






Leu Leu Thr Gly 


Ala Asn 


Ala 


Gly Val 


Gly 


Glu 


Glu 


Leu 


Ala 


Tyr 


His 


-5 






1 








5 










Tyr Ala Arg Leu 


Gly Ser 




Leu 


Val 


Leu 


Thr 


Ala 


His 


Thr 


Glu 


Ala 


10 


15 










20 










25 


Leu Leu Gin Lys 


Ala Arg 


Trp 


Leu 


Thr 


Leu 


Val 


Val 


Ser 


Thr 


Leu 


Gly 




30 








35 










40 




Gly Arg Gly Lys 


Trp He 


Thr 




















45 
























<210> 499 
























<211> 125 
























<212> PRT 
























<213> Homo sapiens 






















<220> 
























<221> SIGNAL 
























<222> -15. .-1 
























<220> 
























<221> UNSURE 
























<222> 100 
























<223> Xaa = Asn, 


,Thr 






















<400> 499 
























Met Lys Leu Leu 


Thr His 


Asn 


Leu 


Leu 


Ser 


Ser 


His 


Val Arg Gly 


Val 


-15 


-10 










-5 










1 


Gly Ser Arg Gly Phe Pro 


Leu 


Arg 


Leu 


Gin 


Ala 


Thr 


Glu 


Val 




He 


5 








10 










15 






Cys Pro Val Glu 


Phe Asn 


Pro 


Asn 


Phe 


Val 


Ala 


Arg 


Met 


He 


Pro 


Lys 


20 






25 










30 








Val Glu Trp Ser 


Ala Phe 


Leu 


Glu 


Ala 


Ala 


Asp Asn 


Leu Arg 




He 


35 




40 










45 










Gin Val Pro Lys 


Gly Pro 


Val 


Glu 


Gly Tyr 


Glu 


Glu 




Glu 


Glu 


Phe 



50 55 60 65 
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Leu Arg Thr Met His His Leu Leu Leu Glu Val Glu Val He Glu Gly 

70 75 80 

Thr Leu Gin Cys Pro Glu Ser Gly Arg Met Phe Pro He Ser Arg Gly 

85 90 95 

He Pro Xaa Met Leu Leu Ser Glu Glu Glu Thr Glu Ser 
100 105 110 

<210> 500 

<211> 52 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SIGNAL 
<222> -22.. -1 

<220> 

<221> UNSURE 
<222> 26 

<223> Xaa = Ile,Asn 
<400> 500 

Met Lys He Ser Val Ala Ala He Pro Phe Phe Leu Leu He Thr He 

-20 -15 -10 

Ala Leu Gly Thr Lys Thr Glu Ser Ser Ser Arg Gly Pro Tyr His Pro 

-5 15 10 

Ser Glu Cys Cys Phe Thr Tyr Thr Thr Tyr Lys He Pro Arg Gin Xaa 
15 20 25 

Tyr Gly Leu Leu 
30 

<210> 501 

<211> 72 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SIGNAL 
<222> -42. .-1 

<400> 501 

Met Asn Arg Leu Ala Gly Val Gly Trp Arg Val Asp Tyr Thr Leu Ser 

-40 -35 -30 

Ser Ser Leu Leu Gin Ser Val Glu Glu Pro Met Val His Leu Arg Leu 

-25 -20 -15 

Glu Val Ala Ala Ala Pro Gly Thr Pro Ala Gin Pro Val Ala Met Ser 
-10 -5 15 

Leu Ser Ala Asp Lys Phe Gin Val Leu Leu Ala Glu Leu Lys Gin Ala 

10 15 20 

Gin Thr Leu Met Ser Ser Leu Gly 

25 30 

<210> 502 

<211> 84 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SIGNAL 
<222> -20. .-1 

<400> 502 

Met Ala Ala Ala Ala Val Pro Ser Leu Leu Leu Ser Leu Pro Pro His 
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-20 -15 -10 -5 



Gin 


Gly 


Leu 


Thr 


Phe 


Ser 


Asn Lys lie 


Gin 


Pro 


Phe 


Gly 


Ala Gin Gly 


























Val 


Leu 


His 


Pro 


Glu 


Pro 


Gly Leu Arg 


Asp 


Trp 


Leu 


Leu 


Pro Thr Cys 






15 








20 








25 




Ser 


Arg 


Gin 


Leu 


Arg 


Val 


Ala Leu Pro 


Glu 


Lys 


Gly 


Ser 


Glu Gly Ser 




30 










35 






40 






Leu 


Cys 


Gin 


Thr 


Gin 


Leu 


Pro Ala Thr 


Pro 


Cys 


Phe 


Leu 


Pro Ser Asn 


45 








50 






55 






60 


Thr 


Val 


Arg 


Thr 



















<210> 503 
<211> 54 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> SIGNAL 
<222> -19. .-1 



<400> 503 

Met Lys Phe Asn Phe Val Leu Leu He Leu Ser Lys Val Leu Asp Asp 

-15 -10 -5 

Thr Phe Gin Ser Val Lys Lys Trp Leu Asn Tyr Phe Gin Phe Thr Leu 

15 10 
Arg Asn Gly Leu Met Trp Pro Gly Ala Val Ala His Ala Cys Asn Pro 

15 20 25 

Ser Thr Gly Ser Arg Leu 
30 35 

<210> 504 

<211> 91 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SIGNAL 
<222> -19. .-1 



<220> 

<221> UNSURE 

<222> 29 

<223> Xaa = Met, Thr 



<400> 504 

Met Tyr Gly Lys He He Phe Val 
-15 

He Ser Ala Leu Ser Thr Thr Glu 

1 5 
Ser Ser Val Thr Lys Ser Tyr He 

15 20 
Gly Gin Leu Val His Arg Phe Thr 
30 35 
Leu He He Leu Cys Val Met Ala 
50 

He Ser Tyr Ser He Arg Arg Leu 
65 



Leu Leu Leu Ser Glu He Val Ser 

-10 -5 
Val Ala Met His Thr Ser Thr Ser 
10 

Ser Ser Gin Thr Asn Gly Glu Xaa 
25 

Val Pro Ala Pro Val Val He He 

40 45 
Gly He He Gly Thr He Leu Leu 

55 60 
He Lys Ala 
70 



<210> 505 
<211> 114 
<212> PRT 

<213> Homo sapiens 
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<220> 

<221> SIGNAL 
<222> -32. .-1 



<400> 505 



Met 


Gin 


Met 


Tyr 


Ser Arg 


Gin 




Ala 


Ser Thr 


Glu 


Trp 


Leu 


Thr 


He 






-30 








^25 
















Gin 


Gly 


Gly Leu 


Leu Gly Ser 


Gly 


Leu 


Phe Val 


Phe 


Ser 


Leu 


Thr 


Ala 




-15 








-10 








-5 










Phe 


Asn 


Asn 


Leu 


Glu Asn 


Leu 


Val 


Phe 


Gly Lys 


Gly 


Phe 


Gin 


Ala 


Lys 


1 








5 








10 








15 




He 


Phe 


Pro 


Glu 


He Leu 




Cys 


Leu 


Leu Leu 


Ala 


Leu 


Phe 


Ala 


Ser 








20 








25 








30 






Gly 


Leu 


He 


His 


Arg Val 


Cys 


Val 


Thr 


Thr Cys 


Phe 


He 


Phe 


Ser 


Met 






35 








40 








45 








Val 


Gly 


Leu 


Tyr 


Tyr He 


Asn 


Lys 


He 


Ser Ser 


Thr 


Leu 


Tyr 


Gin 


Ala 




50 








55 








60 










Ala 


Ala 


Pro 


Val 


Leu Thr 


Pro 


Ala 


Lys 


Val Thr 


Gly 


Lys 


Ser 


Lys 


Lys 



65 70 75 80 



<210> 506 
<211> 53 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> SIGNAL 
<222> -13. .-1 



<400> 506 

Met Ala Val Ala Phe Val Leu Ser 
-10 

Phe Arg Val Ala Asp Gin Arg Lys 

5 10 
Asn Tyr Asp Val Met Lys Asp Phe 
20 25 
Phe Gin Ser Val Lys 
40 



Leu Gly Val Ala Ala Leu Tyr Lys 
-5 1 
Lys Ala Tyr Ala Asp Phe Tyr Arg 
15 

Glu Glu Met Arg Lys Ala Gly He 
30 35 



<210> 507 
<211> 108 
<212> PRT 
<213> Homo sapiens 



<220> 

<221> SIGNAL 
<222> -40. .-1 



<400> 507 



Met 


Gin 


Lys 


Phe 


Arg 


Lys 


Met Ser 


Glu 


Thr 


His 


His 


Ser 


Val 


He 


Ser 


-40 










-35 








-30 










-25 


Val 


Lys 


Ala 


Ser 


Ser 


Pro 


Trp Leu 


Ser 


Ser 


Ser 


Val 


Thr 


Ala 


Pro 


Ser 










-20 








-15 










-10 




Met 


Val 


Ala 


Pro 


Val 


Thr 


Phe Ala 


Ser 


He 


Val 


Glu 


Glu 
5 


Glu 


Leu 


Gin 


Gin 


Glu 


Ala 


-5 
Ala 


Leu 


He 


Arg Ser Arg 


Glu 


Lys 


Pro 


Leu 


Ala 


Leu 


He 




10 










15 








20 










Gin 


He 


Glu 


Glu 


His 


Ala 


He Gin 


Asp 


Leu 


Leu 


Val 


Phe 


Tyr 


Glu 


Ala 


25 










30 








35 










40 


Phe 


Gly 


Asn 


Pro 


Glu 


Glu 


Phe Val 


He 


Val 


Glu 


Arg 


Thr 


Pro 


Gin 


Gly 










45 








50 










55 




Pro 


Leu 


Ala 


Val 


Pro 


Met 


Trp Asn 


Lys 


His 


Gly Cys 
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60 65 

<210> 508 
<211> 127 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -47. .-1 

<220> 

<221> UNSURE 
<222> -22 

<223> Xaa = * ,Cys 
<220> 

<221> UNSURE 
<222> -39 

<223> Xaa = * ,Tyr 
<220> 

<221> UNSURE 
<222> 57 

<223> Xaa =■ lie, Leu 
<220> 

<221> UNSURE 
<222> -43,-27,63 
<223> Xaa = Leu, Met 

<220> 

<221> UNSURE 
<222> 3 

<223> Xaa = Lys,Gln 
<220> 

<221> UNSURE 
<222> -42 

<223> Xaa = Lys,Thr 

<400> 508 



Met 


Phe 


Thr 


Pro 


Xaa 


Xaa 


Val 




Xaa 


Ala 


Tyr Tyr Asp 


Thr 


Glu 


Arg 






-45 










-40 






-35 








lie 


Gly 


Val 


Asp 


Xaa 


He 


Met 


Lys 


Thr 


Xaa 


Phe Ser Pro 


Asn 


Arg 


Val 




-30 










-25 








-20 








lie 


Gly 


Leu 


Ser 


Ser 


Asp 


Leu 


Gin 


Gin 


Val 


Gly Gly Ala 


Ser 


Ala 


Arg 


-15 










-10 










-5 






1 


lie 


Xaa 


Asp 


Ala 


Leu 


Ser 


Thr 


Val 


Leu 


Gin 


Tyr Ala Glu 


Asp 


Val 


Leu 








5 










10 






15 






Ser 


Gly 


Lys 


Val 


Ser 


Ala 


Asp 


Asn 


Thr 


Val 


Gly Arg Phe 


Leu 


Met 


Ser 






20 










25 






30 








Leu 


Val 


Asn 


Gin 


Val 


Pro 


Lys 


He 


Val 


Pro 


Asp Asp Phe 


Glu 


Thr 


Met 




35 










40 








45 








Leu 


Asn 


Ser 


Asn 


lie Asn Asp Xaa 


Leu 


Met 


Val Thr Tyr 


Xaa 


Ala 


Asn 


50 










55 










60 






65 


Leu 


Thr 


Gin 


Ser 


Gin 


He 


Ala 


Leu 


Asn 


Glu 


Lys Leu Val 


Asn 


Leu 





70 75 80 



<210> 509 

<211> 85 

<212> PRT 

<213> Homo sapiens 
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<220> 

<221> SIGNAL 
<222> -32. .-1 

<400> 509 



Met 


Lys 


Ala 


Ala 


Leu 


Glu 


Asp Thr Leu 


Ala Glu 


Thr 


Glu Ala Arg 


Phe 






-30 








-25 






-20 






Gly 


Ala 
-15 


Gin 


Leu 


Ala 


His 


He Gin Ala 
-10 


Leu He 


Ser 
-5 


Gly He 


Glu 


Ala 


Gin 


Leu 


Gly 


Asp Val Arg Ala Asp Ser 


Glu Arg 


Gin 


Asn Gin 


Glu 


Tyr 


1 








5 






10 






15 




Gin 


Arg 


Leu 


Met 
20 


Asp 


He 


Lys Ser Arg 
25 


Leu Glu 


Gin 


Glu He 
30 


Ala 


Thr 


Tyr 


Arg 


Ser 


Leu 


Leu 


Glu 


Gly Gin Glu 


Asp His 


Tyr Asn Asn 


Leu 


Ser 






35 








40 






45 






Ala 


Ser 

50 


Lys 


Val 


Leu 

















<210> 510 
<211> 101 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> SIGNAL 
<222> -25. .-1 



<400> 510 



Met 


Val 


Asp 


Arg 


Glu 


Leu Ala Asp 


He 


His 


Glu 


Asp Ala 


Lys 


Thr 


Cys 


-25 










-20 






-15 










-10 


Leu 


Val 


Leu 


Cys 


Ser Arg Val Leu 


Ser 


Val 


He 


Ser 


Val 


Lys 


Glu 


He 










-5 






1 








5 






Lys 


Thr 


Gin 

10 


Leu 


Ser 


Leu Gly Arg 
15 


His 


Pro 


He 


He 


Ser 

20 


Asn 


Trp 


Phe 


Asp 


Tyr 


He 


Pro 


Ser 


Thr Arg Tyr 




Asp 


Pro 


Cys 


Glu 




Leu 


His 




25 








30 








35 












Cys 


Arg 


Leu 


Thr 


He Arg Asn Gin 


Leu 


Leu 


Thr 


Asn 


Asn 


Met 


Leu 


40 










45 






50 










55 


Pro 


Asp 


Gly 


He 


Phe 


Ser Leu Leu 


He 


Pro Ala Arg 


Leu 


Gin 


Asn 


Tyr 










60 






65 










70 




Leu 


Asn 


Leu 


Glu 
75 


He 





















<210> 511 

<211> 68 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SIGNAL 
<222> -47. .-1 

<400> 511 



Met 


Val 


Lys 


Ser 


Ser Leu Gin 


Arg 


He 


Leu Asn Ser His Cys Phe Ala 






-45 






-40 




-35 


Arg 


Glu 


Lys 


Glu 


Gly Asp Lys 


Pro 


Ser 


Ala Thr He His Ala Ser Arg 




-30 






-25 






-20 


Thr 


Met 


Pro 


Leu 


Leu Ser Leu 


His 


Ser 


Arg Gly Gly Ser Ser Ser Glu 


-15 








-10 






-5 1 


Ser 


Ser Arg 


Val 


Ser Leu His 


Cys 


Cys 


Ser Asn Pro Gly Pro Gly Pro 








5 






10 


15 




Trp 


Cys 


Ser 
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<210> 512 

<211> 68 

<212> PRT 

<213> Homo sapiens 



<220> 

<221> SIGNAL 
<222> -47. .-1 



<220> 

<221> UNSURE 
<222> 9 

<223> Xaa = Cys,Trp 
<400> 512 

Met Val Lys Ser Ser Leu Gin Arg 
-45 -40 
Arg Glu Lys Glu Gly Asp Lys Pro 

-30 -25 
Thr Met Pro Leu Leu Ser Leu His 
-15 -10 
Ser Ser Arg Val Ser Leu His Xaa 
5 

Arg Trp Cys Ser 

20 



lie Leu Asn Ser His Cys Phe Ala 
-35 

Ser Ala Thr He His Ala Ser Arg 
-20 

Ser Arg Gly Gly Ser Ser Ser Glu 

-5 1 
Cys Ser Asn Pro Gly Pro Gly Pro 
10 15 



<210> 513 
<211> 129 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> SIGNAL 
<222> -16.. -1 



<400> 513 



Met 


Ala 


Val 


Leu 


Leu 


Leu 


Leu Leu 




-15 










-10 


Pro 


Gly 


Pro 


Arg 


Pro 


Leu 


Trp Gly 


1 








5 






Phe 


Pro 


Ala 


Arg 


Pro 


Gly Arg Gly 








20 








Pro 


Gly Asp 


Leu 


Ala 


Glu 


Ala Gly 






35 








40 


Leu 


Arg 


Leu 


Leu 


Thr 


Pro 


Thr Phe 




50 










55 


Lys 


Gin 


His 


Leu 


Val 


Gin 


Asn Pro 


65 










70 




Gly 


Thr 


Phe 


Tyr 


Phe 
85 


Asn 


Thr Ser 


Lys 


Asp 


Lys 


Ser 


Lys 


Gly 


Lys Ala 



100 



He 



Arg Ala Leu Arg Arg Gly Pro Gly 
-5 

Pro Gly Pro Ala Trp Ser Pro Gly 

10 15 
Arg Pro Tyr Met Ala Ser Arg Pro 
25 30 
Gly Arg Ala Leu Gin Ser Leu Gin 
45 

Glu Gly He Asn Gly Leu Leu Leu 
60 

Val Arg Leu Trp Gin Leu Leu Gly 

75 80 
Arg Leu Lys Gin Lys Asn Lys Glu 

90 95 
Pro Glu Glu Asp Glu Gly He Phe 
105 110 



<210> 514 

<211> 109 
<212> PRT 
<213> Homo sapiens 



<220> 

<221> SIGNAL 
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<222> -33. .-1 












<220> 












<221> UNSURE 












<222> 43 












<223> Xaa = Asp, Gly, Asn, Ser 










<220> 












<221> UNSURE 
























<223> Xaa = Glu,Gly 












<220> 












<221> UNSURE 












<222> 42 












<223> Xaa = Pro,Arg 












<400> 514 












Met Ala Ala Ser Gly Ala 


Pro 


Arg 


lie Leu Val Asp 


Leu 


Leu Lys Leu 


-30 






-25 




-20 


Asn Val Ala Pro Leu Ala 


Val 


Phe 


Gin Met Leu Lys 


Ser 


Met Cys Ala 


-15 




-10 




-5 




Gly Gin Arg Leu Ala Ser 


Glu 


Pro 


Gin Asp Pro Ala 


Ala 


Val Ser Leu 


1 5 






10 




15 


Pro Thr Ser Ser Val Pro 


Xaa 


Thr 


Arg Gly Arg Asn 


Lys 


Gly Ser Ala 


20 






25 




30 


Ala Leu Gly Gly Ala Leu 


Ala 


Leu 


Ala Glu Xaa Xaa 


Ser Arg Glu Gly 


35 






40 




45 


Ser Ser Gin Arg Met Pro 


Arg 


Gin 


Pro Ser Ala Thr 


Arg 


Leu Pro Lys 


50 




55 




60 




Gly Gly Gly Pro Gly Lys 


Ser 


Pro 


Thr Arg Gly Ser 


Thr 




65 


70 




75 







<210> 515 
<211> 103 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -36. .-1 



<400> 515 



Met 


Ala 


Asn Leu 


Phe 


lie Arg Lys 


Met 


Val Asn 


Pro 


Leu 


Leu 


Tyr Leu 




-35 






-30 






-25 








Ser 




His Thr 


Val 


Lys Pro Arg 


Ala 


Leu Ser 


Thr 


Phe 


Leu 


Phe Gly 


-20 








-15 




-10 








-5 


Ser 


lie Arg Gly 


Ala 


Ala Pro Val 


Ala 


Val Glu 


Pro 


Gly Ala Ala Val 








1 




5 








10 




Arg 


Ser 


Leu Leu 


Ser 


Pro Gly Leu 


Leu 


Pro His 


Leu 


Leu 


Pro 


Ala Leu 






15 




20 








25 






Gly 


Phe 


Lys Asn 


Lys 


Thr Val Leu 


Lys 


Lys Arg 


Cys 


Lys 




Cys Tyr 




30 






35 






40 








Leu 


Val 


Lys Arg 


Arg Gly Arg Trp 


Tyr Val Tyr Cys 


Lys 


Thr 


His Pro 


45 








50 




55 








60 


Arg 


His 


Lys Gin 


Arg 


Gin Met 















65 



<210> 516 
<211> 86 
<212> PRT 

<213> Homo sapiens 
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<220> 

<221> SIGNAL 
<222> -36. .-1 

<400> 516 



Met Gly 






Leu Arg 


Ser Leu 


Gin 


Val 


Lys 


Lys 


Gly 


Thr 


Glu 


His 


-35 










-30 








-25 










Ala Asp 


Pro 


Leu 


Pro 


Phe 


Pro Ser 


Val 


Ser 


Leu Ser Gly 


Phe 


Thr 


Val 


-20 








-15 








-10 










-5 


Gly Thr 


Leu 


Ser 


Glu 


Thr 


Ser Thr Gly 


Gly 


Pro 


Ala 


Thr 


Pro 


Thr Trp 








1 






5 










10 






Lys Glu 


Cys 


Pro 


He 


Cys 


Lys Glu 


Arg 


Phe 








Ser 


Asp 


Lys 




15 








20 










25 








Asp Ala 


Leu 


Glu 


Asp 


His 


Met Asp Gly His 


Phe 


Phe 


Phe 


Ser 


Thr 


Gin 


30 










35 








40 










Asp Pro 


Phe 


Thr 


Phe 


Glu 




















45 








50 




















<210> 517 


























<211> 52 


























<212> PRT 


























<213> Homo sapiens 






















<220> 




























<221> SIGNAL 
























<222> -30. .- 


-1 
























<400> 517 


























Met Ser 


Ser 


Cys 


Gly 


He 


Val Gly 


Ser 


Ser 


Val 


Ser 


Phe 


Gin 


Leu 


Asp 


-30 








-25 








-20 










-15 


Ala Val 


Lys 


Leu 


Leu 


Leu 


Lys Met 


Val 


Ser 


Ser 


Ala 


Thr 


Thr 


Glu 


Arg 








-10 








-5 










1 




Cys Cys Asn Gly Ser Ala 


Asn Phe 


His 


Lys 


Asn 


Leu 


Cys 


Ala 


Thr Gly 




5 








10 










15 








He Lys 


Asn 


Phe 
























20 




























<210> 518 


























<211> 182 


























<212> PRT 


























<213> Homo sapiens 






















<400> 518 


























Met Ser 


Ala 


Thr 


Val 


Val 


Asp Ala 


Val 


Asn 


Ala 


Ala 


Pro 


Leu 


Ser 


Gly 


1 






5 








10 










15 




Ser Lys 


Glu 


Met 


Ser 


Leu 


Glu Glu 


Pro 


Lys 


Lys 


Met 


Thr 


Arg 


Glu 


Asp 






20 








25 










30 






Trp Arg 


Lys 


Lys 


Lys 


Glu 


Leu Glu 


Glu 


Gin 


Arg 


Lys 


Leu 


Gly Asn 


Ala 




35 








40 










45 








Pro Ala 


Glu 


Val 


Asp 


Glu 


Glu Gly Lys Asp 


He 


Asn 


Pro 


His 


He 


Pro 


50 










55 








60 










Gin Tyr 


He 


Ser 


Ser 


Val 


Pro Trp Tyr 


He 


Asp 


Pro 


Ser 


Lys 


Arg 


Pro 


65 








70 








75 










80 


Thr Leu 


Lys 


His 


Gin 


Arg 


Pro Gin 


Pro 


Glu 


Lys 


Gin 


Lys 


Gin 


Phe 


Ser 








85 








90 










95 




Ser Ser 


Gly 


Glu 


Trp 


Tyr 


Lys Arg 


Gly Val 


Lys 


Glu 


Asn 


Ser 


He 


He 






100 








105 










110 






Thr Lys 


Tyr 


Arg 




Gly 


Ala Cys 


Glu 


Asn 


Cys 


Gly Ala 


Met 


Thr 


His 




115 








120 










125 








Lys Lys 


Lys 


Asp 


Cys 


Phe 


Glu Arg 


Pro Arg 


Arg 


Val 


Gly 


Ala 


Lys 


Phe 


130 










135 








140 










Thr Gly 


Thr 


Asn 


He 


Ala 


Pro Asp 


Glu 


His 


Val 


Gin 


Pro 


Gin 


Leu 


Met 


145 








150 








155 










160 
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Phe Asp Tyr Asp Gly Lys Arg Asp Arg Trp Asn Gly Tyr Asn Pro Glu 

165 170 175 

Glu His Met Lys lie Val 
180 



<210> 519 
<211> 147 
<212> PRT 

<213> Homo sapiens 

<400> 519 



Met 


Asp 


Val 


Leu 


Val Ser 


Glu 


Cys 


Ser 


Ala Arg 




Leu 


Gin 


Gin 


Glu 


















10 








15 




Glu 


Glu 


lie 


Lys 


Ser Leu 


Thr 


Ala 


Glu 


He Asp Arg 


Leu 


Lys 


Asn 


Cys 








20 








25 








30 






Gly 


Cys 


Leu 


Gly 


Ala Ser 


Pro 


Asn 


Leu 


Glu Gin 


Leu 


Gin 


Glu 


Glu 








35 








40 








45 








Leu 


Lys 


Leu 


Lys 


Tyr Arg 


Leu 


Asn 


He 


Leu Arg 


Lys 


Ser 


Leu 


Gin 


Ala 




50 








55 








60 










Glu 


Arg 


Asn 


Lys 


Pro Thr 


Lys Asn Met 


He Asn 


He 


He 


Ser Arg 


Leu 


65 








70 








75 










80 


Gin 


Glu 


Val 


Phe 


Gly His 


Ala 


He 


Lys 


Ala Ala 


Tyr 


Pro 


Asp 




Glu 










85 








90 








95 




Asn 


Pro 


Pro 


Leu 


Leu Val 


Thr 


Pro 


Ser 


Gin Gin 


Ala 


Lys 


Phe 


Gly Asp 








100 








105 








110 






Tyr 


Gin 


Cys 


Asn 


Ser Ala 


Met 


Gly 


He 


Ser Gin 


Val 


Met 


Tyr 


Cys 


His 






115 








120 








125 








Asp 


Ser 


Trp 


Leu 


Phe Asp 


Phe 


Phe 


Lys 


Tyr Tyr 


Tyr 


His 


His 


Cys 


His 




130 








135 








140 










Leu 


Gin 


Lys 

























<210> 520 
<211> 65 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> SIGNAL 
<222> -28. .-1 

<400> 520 

Met Asn Leu Phe Arg Phe Leu Gly Asp Leu Ser His Leu Leu Ala He 

-25 -20 -15 

He Leu Leu Leu Leu Lys He Trp Lys Ser Arg Ser Cys Ala Ala His 

-10 -5 1 

Pro Gin Leu Pro Leu Ser Phe Cys Leu Ser Val Cys Leu Ser Val Ser 
5 10 15 20 

Leu Ser Leu Ser Val Ser Leu Ser Leu Ser Phe Ser Val Ser Lys Lys 
25 30 35 

Lys 

<210> 521 
<211> 140 
<212> PRT 

<213> Homo sapiens 

<220> 

<221> SIGNAL 
<222> -40. .-1 



<400> 521 

Met Val Ser Ser Phe Arg Val Ser Glu Leu Gin Val Leu Leu Gly Phe 
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-40 








-35 










-30 








-25 


Ala Gly 


Arg 


Asn 


Lys 


Ser 


Gly 


Arg 


Lys 


His 


Asp 


Leu 


Leu 


Met Arg 


Ala 








-20 










-15 








-10 




Leu His 


Leu 


Leu 


Lys 


Ser Gly 


Cys 


Ser 


Pro 


Ala 


Val 


Gin 


He Lys 


He 






-5 










1 








5 






Arg Glu 


Leu 


Tyr 


Arg Arg Arg 


Tyr 


Pro Arg 


Thr 


Leu 


Glu Gly Leu Ser 


10 










15 










20 








Asp Leu 


Ser 


Thr 


He 


Lys 


Ser 


Ser 


Val 


Phe 


Ser 


Leu 


Asp Gly Gly Ser 


25 








30 










35 








40 


Ser Pro 


Val 


Glu 


Pro 


Asp 




Ala 






Gly 


He 


His 


Ser Leu 


Pro 








45 










50 








55 




Ser Thr 


Ser 


Val 


Thr 


Pro 


His 


Ser 


Pro 


Ser 


Ser 


Pro 


Val 


Gly Ser 


Val 






60 










65 














Leu Leu 


Gin 


Asp 


Thr 


Lys 


Pro 


Thr 


Phe 


Glu 


Met 


Gin 


Gin 


Pro Ser 


Pro 




75 










80 










85 






Pro lie 


Pro 


Pro 


Val 


His 


Pro 




Val 


Gin 


Leu Lys 








90 










95 










100 








<210> 522 


























<211> 154 


























<212> PRT 


























<213> Homo sapiens 






















<220> 




























<221> UNSURE 
























<222> 135 


























<223> Xaa = 


His, 


, Asn, 


.Pro, 


.Thr 


















<220> 




























<221> UNSURE 
























<222> 137 


























<223> Xaa = 


He, 


, Leu 






















<220> 




























<221> UNSURE 
























<222> 139 


























<223> Xaa = 


Pro, Arg 






















<220> 




























<221> UNSURE 
























<222> 136 


























<223> Xaa = 


Pro, 


, Gin 






















<400> 522 


























Met Ala 


Glu 


Val 


Glu 


Gin 


Lys 


Lys 


Lys 


Arg 


Thr 


Phe 


Arg 


Lys Phe 


Thr 


1 






5 










10 








15 




Tyr Arg 


Gly 


Val 


Asp 


Leu 


Asp 


Gin 


Leu 


Leu Asp Met 


Ser 


Tyr Glu 


Gin 






20 










25 










30 




Leu Met 


Gin 


Leu 


Tyr 


Ser 


Ala 


Arg 


Gin 


Ala 


Ala 


Ala 


Glu 


Pro Gly 


Pro 




35 










40 










45 






Ala Ala 


Glu 


Ala 


Ala 


Leu 


Pro 


Ala 


Glu 


Ala 


Pro 


Ala 


Gin 


Gly Gin 


Glu 


50 










55 










60 








Gly Gly 


Ala 


Ala 


His 


Gly 


Glu 


Ala 


Gly 


Ser 


Gly 


Glu 


Asp 


Ala Pro 


Ala 


65 








70 










75 








80 


Gly His 


Asp 


His 


Pro 


Thr 


Arg 


Asp 


Gly 


Gly Gin 


His 


Gly 


Gly Arg 


Leu 








85 










90 








95 




Gin Arg 


Gin 


Asp 


Leu 


Gin 


Pro 


Gly 


Gly Asp Gin Ala 


Arg Asp Asp 


Arg 






100 










105 










110 




Pro Leu 


Pro 


Gly 


Arg 


Val 


Leu 


His 


His 


Leu 


Gin 


Ala 


Arg 


Lys Ala 


Trp 




115 










120 










125 






Pro Ala 


Arg 


His 


Arg 


Gly 


Xaa 


Xaa 


Xaa 


Leu 


Xaa 


Leu 


His 


Pro Ser 


Gin 


130 










135 










140 








Val Met 


Ala 


Gin 


Leu 


He 


Lys 


Ala 


His 


Met 
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<210> 523 
<211> 65 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> UNSURE 
<222> 63 

<223> Xaa = * , Ala, Glu, Gly, He, Lys, Leu, Pro, Gin, Arg, Ser , Thr , Val 

<400> 523 



Met Ala Glu Glu 


Gin 


Gly 


Arg 


Glu 


Arg Asp 


Ser 


Val 


Pro 


Lys Pro Ser 


1 


5 










10 








15 


Val Leu Phe Leu 


His 


Pro 


Asp 


Leu 


Gly Val 


Gly Gly 


Ala 


Glu Arg Leu 


20 










25 










30 


Val Leu Leu Pro 


Phe 


Pro 


Thr 


Glu 


Arg Arg 


Leu 


Leu 


Phe 


Cys Thr Ala 


35 








40 










45 




Pro Arg Ser Leu 


Asn 


Ser 


Leu 


Pro 


Leu 


Lys 


Lys 


Lys 


Lys 


He Xaa Lys 


50 






55 










60 






Ser 






















65 






















<210> 524 






















<211> 133 






















<212> PRT 






















<213> Homo sapiens 




















<400> 524 






















Met Glu Tyr Val 


Leu 


Glu 


Val 


Lys 


Asn 


Ser 


Pro 


Arg 


His 


Leu Leu Lys 


1 


5 










10 








15 


Gin Phe Thr Val 


Cys 


Asp 


Val 


Pro 


Leu 


Tyr Asp 


He 


Cys 


Asp Tyr Asn 


20 










25 










30 


Val Ser Arg Asp 


Arg 


Cys 


Gin 


Glu 


Leu 


Gly Cys 


Cys 


Phe 


Tyr Glu Gly 


35 








40 










45 




Val Cys Tyr Lys 


Lys 


Ala 


Val 


Pro 


He 


Tyr 


He 


His 


Val 


Phe Ser Ala 


50 






55 










60 






Leu He Val He 


He 


Ala 


Gly Ala 


Phe 


Val 


He 


Thr 


He 


He Tyr Arg 


65 




70 










75 






80 


Val He Gin Glu 


Ser 


Arg 


Lys 


Glu 


Lys 


Ala 


He 


Pro 


Val 


Tyr Val Ala 




85 










90 








95 


Leu Pro Gin Lys 


Ser 


Ser 


Glu 


Lys 


Ala 


Glu 


Leu 


Ala 


Ser 


Ser Ser Ser 


100 










105 










110 


Lys Leu Gly Leu 


Lys 


Leu 


Arg 


Val 


Leu 


Gly 


Leu 


Lys 


Val 


Leu Gly Leu 


115 








120 










125 




Asp Glu Ser Asp 


Glu 




















130 






















<210> 525 






















<211> 149 






















<212> PRT 






















<213> Homo sapiens 




















<400> 525 






















Met Arg Tyr Asn Glu 


Lys 


Glu 


Leu 


Gin 


Ala 


Leu 


Ser 


Arg 


Gin Pro Ala 


1 


5 










10 








15 


Glu Met Ala Ala 


Glu 


Leu Gly Met 




Gly 


Pro 






Gly Ser Val 


20 










25^ 










30 


Leu Lys Arg Arg 


Leu 


Val 


Lys 


Leu 


Val 


Val 


Asn 


Phe 




Phe Tyr Phe 


35 








40 










45 




Arg Thr Asp Glu 


Ala 


Glu 


Pro 


Val 


Gly Ala 






Leu 


Glu Arg Cys 


50 






55 










60 
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Arg Val 


Val 


Arg 


Glu 


Glu Pro Gly 


Thr 


Phe 


Ser 


He 


Ser 


Phe 


He 


Glu 


65 










70 






75 










80 


Asp 


Pro 








Tyr His Phe 




Cys 


Ser 


Ser 


Glu 
















85 






90 










95 




Gin 


Glu 


Trp 


Met 


Glu 


Ala Leu Arg 


Arg 


Ala 


Ser 


Tyr 


Glu 


Phe 


Met 


Arg 








100 






105 










110 






Arg 


Ser 




He 


Phe 


Tyr Arg Asn 


Glu 


He 


Arg 


Lys 


Val 


Thr 


Gly 


Lys 






115 






120 










125 








Asp 


Pro 


Leu 


Glu 


Gin 


Phe Gly He 


Ser 


Glu 


Glu Ala Arg 


Phe 


Gin 





130 135 140 



Ser Gly Leu Gin Ala 

145 



<210> 526 
<211> 92 
<212> PRT 

<213> Homo sapiens 



<400> 526 




















Met Pro Val 


Val Pro Ala Leu Gly 






Trp Ala 


Asp 


His 




1 


5 




10 










15 




Arg Ser Gly 


Val Arg Asp Gin 


Pro 


Gly Gin 


Pro 


Gly 


Glu 


Ala 


Pro 


Pro 




20 




25 








30 






Ser Leu Leu 


Lys He Gin Lys 


Leu 


Ala Gly 


Tyr 


Gly 


Gly 


Gly 


Cys 


Leu 


35 




40 








45 








Trp Ser Gin 


Leu Leu Gly Arg 


Leu 


Arg Arg 


Glu 


Asn 


His 


Leu 


Ser 


Pro 


50 


55 








60 










Gly Gly Gly 


Gly Cys Ser Glu 


Pro 


Arg Leu Cys 


His 


Cys 


Thr 


Pro 


Ala 


65 


70 






75 










80 


Trp Val Thr 


Glu Gin Asp Ser 


He 


Ser Lys 


He 


Glu 











85 90 



<210> 527 
<211> 60 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -21. .-1 



<220> 

<221> UNSURE 
<222> -5 

<223> Xaa = Ser, Thr 



<400> 527 

Met Asp Pro Asn Cys Ser Cys Ala 

-20 -15 
Xaa Pro Ala Ser Ala Lys Ser Ala 
-5 1 
Ala Ala Pro Gly Ala Leu Ala Pro 
15 

Cys His Leu Ser Val Cys Pro Lys 
30 35 



Ala Gly Val Ser Cys Thr Cys Ala 
-10 

Asn Ala Pro Pro Ala Arg Arg Ala 

5 10 
Pro Ser Trp Thr Pro Ala Leu Pro 
20 25 
Glu Val Leu Val 



<210> 528 

<211> 120 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> SIGNAL 
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<222> -35. .-1 
<400> 528 



Met 


Ser 


Lys 


His 


Leu 


His Cys Lys 


Val 


-35 










-30 




Glu 


Val 


Ser 


Ala 


Ser 


Cys Leu Ser 


Gin 










-15 






Thr 


His 


Ser 


Gin 


Thr 


Cys He Arg 


Asn 








1 




5 




Pro 




Thr 


Ser 


Thr 


Gly Cys Lys 


Gly 




15^ 








20 




Glu 


Pro 


Val 


Glu 


Leu 


Glu Val Glu 


Gly 


30 










35 




Thr 


Cys 


Gly Ser 


Leu 


Gly Arg Gly Asn 










50 






Gly 


Phe 


Ser 


Met 


Asn 


Thr Thr Val 


Gly 








65 






70 


His 


His 


Thr 


Ser 


Thr 


Thr Ser Ser 








80 






85 





<210> 529 

<211> 65 

<212> PRT 

<213> Homo sapiens 



PCT/lBOl/00914 

289 



Leu 


Gly 


Leu 


Pro 


Leu Arg 


Leu 




-25 










-20 


Ser 


Leu 


Ala 


Met 


Ser 




Gin 












-5 




Leu 


Val 


Thr Cys 


He 


Asn 


Tyr 








10 








Thr 


Thr 


Thr 
25 


Gin 




He 


Met 


Thr 


Glu 


Gin 


Asp Asn 


Ala 


Lys 




40 










45 


Glu 


Asn 


Thr 


Met 




Arg 


Gly 


55 










60 




Gin 


Gly 


He 


Ser 


Lys 


Gin 


Thr 



75 



<220> 

<221> SIGNAL 
<222> -14 . .-1 



<400> 529 

Met Leu Cys Trp Leu Lys Leu Val 
-10 

Leu Asn Trp Ala Gin His Ala Glu 

5 10 
Leu Ser Leu He Gin Val Thr Leu 

20 25 
Arg Leu Val Lys Ser Lys Pro Phe 
35 40 
Pro 



Ser He Thr Thr Val Ala He He 

-5 1 
Asn Thr Thr Glu Cys Ala His Trp 
15 

Thr Ser Leu Glu Ala Cys Gin Asn 
30 

His Leu Gin Asn Phe Thr Cys Lys 
45 50 



<210> 530 
<211> 99 
<212> PRT 

<213> Homo sapiens 



<400> 530 



Met 


Lys 


Lys 


Lys 


Glu 


Glu 


Thr 


Thr 


Leu 


Ser 


Glu 


Met 


Glu 


Pro 


Val 


Glu 


1 








5 










10 










15 




Pro 


Gin 


Tyr 


Gin 
20 


Leu 


Val 


Asn 


Ala 


Glu 
25 


Ser 


Thr 


Ser 


Pro 


Phe 
30 


Leu 


His 


Cys 


Leu 


Arg 

35 


Glu 


Val 


He 


Gly 


Glu 
40 


Tyr 


Ser 


Val 


His 


Glu 
45 


Phe 


Ser 


Leu 




Gly 
50 


Lys 


Thr 


Glu 


Ser 


Gin 
55 


Gly 


He 


Gly 




Trp 
60 


He 


Ala 




Val 


Val 


Phe 


Leu 


Ser 


Phe 


Leu 


He 


Phe 


Ser 


Thr 


Ser 


Phe 


Tyr 


He 


Ser 


Asn 


65 










70 










75 










80 


Ala 


Glu 


Gin 


Pro 


Phe 


Phe 


Lys 


Glu 


Pro 


Pro 


Thr 


Glu 


Ala 


Ala 


Lys 


Glu 



85 90 95 



Leu Ser Leu 



<210> 531 
<211> 128 
<212> PRT 
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<213> Homo sapiens 
<220> 

<221> UNSURE 
<222> 70 

<223> Xaa = Pro,Thr 



<400> 531 



Met 


Ala 


Gly 


Asn 


Leu Leu 


Ser Gly Ala Gly Arg 


Arg Leu 


Trp Asp Trp 


1 








5 


10 




15 


Val 


Pro 




Ala 


Cys Arg 


Ser Phe Ser Leu Gly 


Val Pro 


Arg Leu He 








20 




25 




30 


Gly 


lie 


Arg 


Leu 


Thr Leu 


Pro Pro Pro Lys Val 


Val Asp 


Arg Trp Asn 






35 






40 


45 




Glu 


Lys 


Arg 


Ala 


Met Phe 


Gly Val Tyr Asp Asn 


He Gly 


He Leu Gly 




50 








55 


60 






Phe 


Glu 


Lys 


His Xaa 


Lys Glu Leu lie Arg 


Gly Pro 


He Trp Leu 


65 








70 


75 




80 


Arg 


Gly 


Trp 


Lys 


Gly Asn 


Glu Leu Gin Arg Cys 


He Arg 


Lys Arg Lys 










85 


90 




95 


Met 


Val 


Gly 


Ser 


Arg Met 


Phe Ala Asp Asp Leu 


His Asn 


Leu Asn Lys 








100 




105 




110 


Arg 


lie 


Arg 


Tyr 


Leu Tyr 


Lys His Phe Asn Arg 


His Gly Lys Phe Arg 






115 






120 


125 





<210> 532 
<211> 52 
<212> PRT 

<213> Homo sapiens 



<400> 532 

Met Phe Leu Thr Leu Ala Asp Thr Cys Lys Leu Arg Gly Met Ser Phe 



10 



15 



Leu Leu Asn Val Tyr Glu Gly Glu Ala Thr Val Ser Ser Val Leu Glu 

20 25 30 

Leu Leu Glu Ser Trp He He Val Gly Asn Glu Arg Tyr Phe Asp Gly 

35 40 45 

He Ser Ser His 
50 



<210> 533 
<211> 84 
<212> PRT 

<213> Homo sapiens 



<400> 533 




























Met 


Lys 


Val 


Lys 


He 


Lys 


Cys 


Trp Asn 


Gly Val 


Ala 


Thr 


Trp 


Leu 


Trp 


1 








5 










10 










15 




Val 


Ala 


Asn 


Asp 


Glu 


Asn 


Cys 


Gly 


He 


Cys 




Met 


Ala 


Phe Asn Gly 








20 










25 










30 






Cys 


Cys 


Pro 


Asp 


Cys 


Lys 


Val 


Pro 


Gly Asp 


Asp 


Cys 


Pro 


Leu 


Val 


Trp 






35 










40 










45 








Gly 


Gin 


Cys 


Ser 


His 


Cys 


Phe 


His 


Met 


His 


Cys 


He 


Leu 


Lys 


Trp 


Leu 




50 










55 










60 










His 


Ala 


Gin 


Gin 


Val 


Gin 


Gin 


His 


Cys 


Pro 


Met 


Cys Arg Gin 


Glu 


Trp 


65 










70 










75 










80 


Lys 


Phe 


Lys 


Glu 



























<210> 534 
<211> 80 
<212> PRT 

<213> Homo sapiens 
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<400> 534 

Met Ala Thr Pro Thr Gin Thr Pro Thr Lys Ala Pro Glu Glu Pro Asp 

15 10 15 

Pro Phe Tyr Tyr Asp Tyr Asn Thr Val Gin Thr Val Gly Met Thr Leu 

20 25 30 

Ala Thr He Leu Phe Leu Leu Gly He Leu He Val He Ser Lys Lys 

35 40 45 

Val Lys Cys Arg Lys Ala Asp Ser Arg Ser Glu Ser Pro Thr Cys Lys 

50 55 60 

Ser Cys Lys Ser Glu Leu Pro Ser Ser Ala Pro Gly Gly Gly Gly Val 
65 70 75 80 

<210> 535 
<211> 127 
<212> PRT 
<213> Homo sapiens 

<400> 535 



Met 


Ser 


Phe 


Ser 


Gly 


Lys 


Tyr Gin 


Leu 


Gin 


Ser 


Gin Glu 


Asn 


Phe 


Glu 


1 








5 








10 








15 




Ala 


Phe 


Met 


Lys 


Ala 


He 


Gly Leu 


Pro 


Glu 


Glu 


Leu He 


Gin 


Lys 


Gly 








20 








25 








30 






Lys 


Asp 


He 


Lys 


Gly Val 


Ser Glu 


He 


Val 


Gin 


Asn Gly Lys 


His 


Phe 






35 








40 








45 








Lys 


Phe 


Thr 


He 


Thr 


Ala 


Gly Ser 


Lys 


Val 


He 


Gin Asn 


Glu 


Phe 


Thr 




50 










55 








60 








Val 


Gly 


Glu 


Glu 


Cys 


Glu 


Leu Glu 


Thr 


Met 


Thr 


Gly Glu 


Lys 


Val 


Lys 


65 










70 








75 








80 


Thr 


Val 


Val 


Gin 


Leu 


Glu 


Gly Asp 


Asn 


Lys 


Leu 


Val Thr 


Thr 


Phe 


Lys 










85 








90 








95 




Asn 


He 


Lys 


Ser 


Val 


Thr 


Glu Leu 


Asn 


Gly Asp 


He He 


Thr 


Asn 


Thr 








100 








105 








110 






Met 


Thr 


Leu 


Gly 


Asp 


He 


Val Phe 


Lys 


Arg 


He 


Ser Lys Arg 


He 








115 








120 








125 









<210> 536 
<211> 77 
<212> PRT 

<213> Homo sapiens 
<400> 536 

Met Ala Ala Gly Thr Ser Pro Ala Asp Ala Leu Cys Asp Gly Lys Lys 
15 10 15 

Ala Ala Gly Pro Thr Ser Gly Ala Phe Tyr Asn Glu Leu Arg Ala Leu 
20 25 30 

Glu Ser Leu Pro Gly Asn Tyr Ser Ser Ala Ser Thr Val Val Leu Leu 

35 40 45 

Trp He Ala Ser Arg Gin Lys Ser Gin Ser Ser Arg Asn Ser Gin Cys 

50 55 60 

Ser Arg Lys Asp Lys Gly Gly Lys Gin Glu Glu His Glu 
65 70 75 

<210> 537 

<211> 75 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> UNSURE 
<222> 65 

<223> Xaa = Asp, Gly 
<400> 537 
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Met Pro Gly Pro Thr Pro Ser Gly Thr Asn Val Gly Ser Ser Gly Arg 

15 10 15 

Ser Pro Ser Lys Ala Val Ala Ala Arg Ala Arg Asp Pro Leu Ser Gly 

20 25 30 

Arg Gly Lys Met Pro Ala Val Gly Gin Gly Val Gin Ala Ala Gin Pro 

35 40 45 

Arg Gin Ala Pro Gly Gly Cys Gly Asp Ser Thr Gin Lys lie His Leu 

50 55 60 

Xaa Ser Lys Leu Ala Leu Phe Gin Tyr Trp Leu 
65 70 75 

<210> 538 
<211> 56 
<212> PRT 
<213> Homo sapiens 



<400> 538 

Met Ser Ser Trp Ser Leu Met Cys 

1 5 

Ala Arg Leu Leu Tyr Glu Ala Asn 

20 

Gin Thr Asp Gin Ala Val Glu Phe 

35 40 
Lys Lys Tyr Tyr Glu Lys Met Pro 
50 55 

<210> 539 
<211> 122 
<212> PRT 

<213> Homo sapiens 



Met Gin Asp Met Gly Asn Thr Lys 

10 15 
Leu Pro Glu Asn Phe Arg Arg Pro 
25 30 
Phe lie Arg Asp Lys Tyr Glu Lys 
45 



<400> 539 








Met 


Glu 


Ala 


Ser 


Ala 


Leu Thr Ser 


1 








5 




Val 


Val 


Arg 


Val 


Ala 


Ser Gly Ser 








20 






He 


Ala 


Thr 


Val 


Val 


He Gly Gly 






35 






40 


Met 


Val 


Leu 


Ser 


Ala 


Met Gly Phe 




50 








55 


Ser 


He 


Ala 


Ala 


Lys 


Met Met Ser 


65 










70 


Gly 


Val 


Ala 


Ser 


Gly 
85 


Ser Leu Val 


Thr 


Gly 


Leu 


Ser 


Gly 


Leu Thr Lys 








100 






Ala 


He 


Ala 


Ala 


Val 


He Ala Arg 






115 






120 



Ser 


Ala 


Val Thr Ser 


Val 


Ala Lys 




10 






15 


Ala 


Val 


Val Leu Pro 


Leu 


Ala Arg 


25 






30 




Val 


Val 


Ala Met Ala 


Ala 


Val Pro 






45 






Thr 


Ala 


Ala Gly He 


Ala 


Ser Ser 






60 






Ala 


Ala 


Ala He Ala 


Asn 


Gly Gly 






75 




80 


Ala 


Thr 


Leu Gin Ser 


Leu 


Gly Ala 




90 






95 


Phe 


He 


Leu Gly Ser 


He 


Gly Ser 


105 






110 




Phe 


Tyr 









<210> 540 

<211> 74 

<212> PRT 

<213> Homo sapiens 

<400> 540 

Met Ala Ser Ser Thr Val Pro Val 
1 5 

Thr Pro Glu He Pro Asp Asn Val 

20 

Arg Phe Ala Thr Asp Arg Asn Asp 

35 40 
Leu Gly Leu Phe Ala Ala Gly Val 
50 55 



Ser Ala Ala Gly Ser Ala Asn Glu 
10 15 

Gly Asp Trp Leu Arg Gly Val Tyr 

25 30 
Phe Arg Arg Asn Leu He Leu Asn 
45 

Trp Leu Ala Arg Asn Leu Ser Asp 
60 
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lie Asp Leu Met 


Ala 








Gly Val 














65 




70 






















<210> 541 


























<211> 97 


























<212> PRT 


























<213> Homo sapiens 
























<400> 541 


























Met Ser Ser Ala 


Pro 


Glu 


Pro 


Pro 


Thr 


Phe 


Lys 


Lys 


Glu 


Pro 


Pro Lys 




5 










10 










15 




Glu Lys Glu Phe 


Gin 


Ser 


Pro 


Gly 


Leu 


Arg 


Gly Val 


Arg 


Thr 


Thr 


Thr 


20 










25 










30 






Leu Phe Arg Ala 


Val 


Asn 


Pro 


Glu 


Leu 


Phe 


He 


Lys 


Pro 


Asn 


Lys 


Pro 


35 








40 










45 








Val Met Ala Phe 


Gly Leu Val 


Thr 


Leu 


Ser 


Leu 


Cys 


Val 


Ala 


Tyr 


He 


50 






55 










60 










Gly Tyr Leu His 


Ala 


lie 


Gin 


Glu 


Asn 


Lys 


Lys 


Asp 


Leu 


Tyr 


Glu 


Ala 


65 




70 










75 










80 


lie Asp Ser Glu 


Gly 




Ser 


Tyr 


Met 


Arg Arg Lys 


Thr 


Ser 


Lys 


Trp 




85 










90 










95 




Asp 


























<210> 542 


























<211> 98 


























<212> PRT 


























<213> Homo sapiens 
























<400> 542 


























Met Ala Ala Glu 


Pro 


Leu 


Thr 


Glu 


Leu 


Glu 


Glu 


Ser 


He 


Glu 


Asn 


Val 


1 


5 










10 










15 




Val Thr Thr Phe 


Phe 


Thr 


Phe 


Ala Arg Gin Glu Gly Arg 


Lys 




Ser 


20 










25 










30 






Leu Ser Val Asn 


Glu 


Phe 


Lys 


Glu 


Leu 


Val 


Thr 


Gin 


Gin 


Leu 


Pro 


His 


35 








40 










45 








Leu Leu Lys Asp 


Val 


Gly 


Ser 


Leu 


Asp Glu 


Lys 


Met 


Lys 


Ser 


Leu 


Asp 


50 






55 










60 










Val Asn Gin Asp 


Ser 


Glu 


Leu 


Lys 


Phe 




Glu 


Tyr 


Trp 


Arg 


Leu 


He 


65 




70 










75 










80 


Gly Glu Leu Ala 


Lys 


Glu 


He 


Arg 


Lys 


Lys 


Lys 


Asp 


Leu 


Lys 


He Arg 




85 










90 










95 




Lys Lys 


























<210> 543 


























<211> 115 


























<212> PRT 


























<213> Homo sapiens 
























<400> 543 


























Met Val Ala Ala 


Lys 


Lys 


Thr 


Lys 


Lys 


Ser 


Leu 


Glu 


Ser 


He 


Asn 


Ser 


1 


5 










10 










15 




Arg Leu Gin Leu 


Val 


Met 


Lys 


Ser 


Gly 


Lys 


Tyr 


Val 


Leu 


Gly 


Tyr 


Lys 


20 










25 










30 






Gin Thr Leu Lys 


Met 


He 


Arg 


Gin 


Gly 


Lys 


Ala 


Lys 


Leu 


Val 


He 


Leu 


35 








40 










45 








Ala Asn Asn Cys 


Pro 


Ala 


Leu 


Arg 


Lys 


Ser 


Glu 


He 


Glu 


Tyr 


Tyr Ala 


50 






55 










60 










Met Leu Ala Lys 


Thr Gly Val 


His 


His 


Tyr Ser 


Gly Asn 


Asn 


He 


Glu 


65 




70 










75 










80 


Leu Gly Thr Ala 


Cys 


Gly 


Lys 


Tyr 


Tyr Arg 


Val 


Cys 


Thr 


Leu 


Ala 


He 




85 










90 










95 




lie Asp Pro Gly 


Asp 


Ser 


Asp 


He 


He Arg 


Ser 


Met 


Pro 


Glu 


Gin 


Thr 


100 










105 










110 
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Gly Glu Lys 
115 

<210> 544 
<211> 102 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> UNSURE 
<222> 85 

<223> Xaa = Asp,Asn 
<400> 544 



Met 


Ala 


Gly 


Gin 


Ala 


Phe Arg 


Lys 


Phe 


Leu 


Pro Leu Phe Asp Arg Val 


1 








5 








10 


15 




Leu 


Val 


Glu 




Ser 


Ala Ala 


Glu 


Thr 


Val 


Thr Lys Gly Gly He 


Met 








20'^ 








25 




30 




Leu 


Pro 


Glu 


Lys 


Ser 


Gin Gly 


Lys 


Val 


Leu 


Gin Ala Thr Val Val 


Ala 






35 








40 






45 




Val 


Gly 


Ser 


Gly 


Ser 


Lys Gly 


Lys 


Gly Gly 


Glu He Gin Pro Val 


Ser 




50 








55 








60 




Val 


Lys 


Val 


Gly 


Asp 


Lys Val 


Leu 


Leu 


Pro 


Glu Tyr Gly Gly Thr 


Lys 


65 










70 








75 


80 


Val 


Val 


Leu 


Asp 


Xaa 


Lys Asp 


Tyr 


Phe 


Leu 


Phe Arg Asp Gly Asp 


He 










85 








90 


95 




Leu 


Gly 


Lys 


Tyr 


Val 


Asp 













100 



<210> 545 
<211> 102 
<212> PRT 
<213> Homo sapiens 

<400> 545 



Met Ala Gly 


Gin 


Ala 


Phe 




Lys Phe 


Leu 


Pro 


Leu 


Phe Asp Arg 


Val 


1 




5 








10 






15 




Leu Val Glu 


Arg 


Ser 


Ala 


Ala 


Glu Thr 


Val 


Thr 


Lys 


Gly Gly He 


Met 




20 








25 








30 




Leu Pro Glu 


Lys 


Ser 


Gin Gly Lys Val 


Leu 


Gin 


Ala 


Thr Val Val 


Ala 


35 










40 








45 




Val Gly Ser 


Gly 


Ser 


Lys 


Gly Lys Gly Gly 


Glu 


He 


Gin Pro Val 


Ser 


50 








55 








60 






Val Lys Val 


Gly 


Asp 


Lys 


Val 


Leu Leu 


Pro 


Glu 


Tyr 


Gly Gly Thr 


Lys 


65 






70 








75 






80 


Val Val Leu 


Asp 


Asp 


Lys 


Asp 


Tyr Phe 


Leu 


Phe Arg Asp Gly Asp 


He 






85 








90 






95 




Leu Gly Lys 


Tyr 


Val 


Asp 


















100 




















<210> 546 






















<211> 116 






















<212> PRT 






















<213> Homo sapiens 


















<400> 546 






















Met Ser Ala 


Thr 


Ala 


Ala 


Thr 


Ala Pro 


Pro 


Ala 


Ala 


Pro Ala Gly 


Glu 


1 




5 








10 






15 




Gly Gly Pro 


Pro 


Ala 


Pro 


Pro 


Pro Asn 


Leu 


Thr 


Ser 


Asn Arg Arg 


Leu 




20 








25 








30 




Gin Gin Thr 


Gin 


Ala 


Gin 


Val 


Asp Glu 


Val 


Val 


Asp 


He Met Arg 


Val 


35 










40 








45 




Asn Val Asp 


Lys 


Val 


Leu 


Glu 


Arg Asp 


Gin 


Lys 


Leu 


Ser Glu Leu 


Asp 
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50 








55 




60 










Asp Arg Ala 


Asp 


Ala 


Leu 


Gin Ala 


Gly Ala 


Ser Gin 


Phe 


Glu 


Thr 


Ser 


65 






70 






75 








80 


Ala Ala Lys 


Leu 


Lys 


Arg 


Lys Tyr 


Trp Trp 


Lys Asn 


Leu 


Lys 


Met 


Met 






85 






90 








95 




lie lie Leu 


Gly 


Val 


He 


Cys Ala 


He He 


Leu He 


He 


He 


He 


Val 




100 








105 






110 






Tyr Phe Ser 


Thr 




















115 






















<210> 547 






















<211> 97 






















<212> PRT 






















<213> Homo sapiens 


















<220> 






















<221> UNSURE 




















<222> 54 






















<223> Xaa = 


Glu, Lys 


















<400> 547 






















Met Ser Leu 


Ser 


Leu 


Val 


Phe Arg 


Ala Ala 


Ser Tyr 


Phe 


Lys 


Leu 


Val 


1 




5 






10 








15 




Pro Phe His 


Ser 


Ser 


Ser 


Ser Asn 


Gin Phe 


Leu Gin 


Pro 


Pro 


Gly 


Trp 




20 








25 






30 






Val Val Leu 


Thr 


Gin 


Thr 


Leu Val 


Leu Leu 


His Phe 


Glu 


Arg 


Phe 


Ser 


35 








40 






45 








Tyr Gin Asn 


Val 


Pro 


Xaa 


Ser Ala Gin Gly 


Lys Gly Asn 


Leu 


Gin 


Pro 


50 








55 




60 










Glu Thr Asn 


He 


His 


Leu 


Phe His 


Phe Leu 


Thr Phe 


Pro 


Lys 


Gin 


He 


65 






70 






75 








80 


Ser Arg Asn 


Leu 


Phe 


Asn 


Ser Leu 


Leu Cys 


Leu Met 


Cys 


Leu 


Thr 


Tyr 






85 






90 








95 




Phe 






















<210> 548 






















<211> 113 






















<212> PRT 






















<213> Homo sapiens 


















<220> 






















<221> UNSURE 




















<222> 110 






















<223> Xaa = 


Ser, 


,Thr 


















<400> 548 






















Met Ser Met 


Thr 


Asp 


Leu 


Leu Asn 


Ala Glu 


Asp He 


Lys 


Lys 


Ala 


Val 


1 




5 






10 








15 




Gly Ala Phe 


Ser 


Ala 


Thr 


Asp Ser 


Phe Asp 


His Lys 


Lys 


Phe 


Phe 


Gin 




20 








25 






30 






Met Val Gly 


Leu 


Lys 


Lys 


Lys Ser 


Ala Asp 


Asp Val 


Lys 


Lys 


Val 


Phe 


35 








40 






45 








His Met Leu 


Asp 


Lys 


Asp 


Lys Ser 


Gly Phe 


He Glu 


Glu 


Asp 


Glu 


Leu 


50 








55 




60 










Gly Phe He 




Lys 


Gly 


Phe Ser 


Pro Asp 


Ala Arg Asp Leu 


Ser 


Ala 


65 






70 






75 








80 


Lys Glu Thr 


Lys 


Met 


Leu 


Met Ala Ala Gly 


Asp Lys 


Asp Gly Asp 


Gly 






85 






90 








95 




Lys He Gly 


Val 


Asp 


Glu 


Phe Ser 


Thr Leu 


Val Ala 


Glu 


Xaa 


Lys 


Lys 


100 








105 






110 







His 



<210> 549 
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<211> 112 
<212> PRT 

<213> Homo sapiens 



<400> 549 



Met 


Asp 


Pro 


Arg 


Lys 


Val 


Asn Glu 


Leu Arg Ala 


Phe 


Val 


Lys 


Met 


Cys 


1 








5 








10 










15 




Lys 


Gin 


Asp 


Pro 


Ser 


Val 


Leu His 


Thr 


Glu 


Glu 


Met Arg 


Phe 


Leu 


Arg 






20 








25 










30 






Glu 


Trp 


Val 


Glu 


Ser 


Met 


Gly Gly 


Lys 


Val 


Pro 


Pro 


Ala 


Thr 


Gin 


Lys 






35 








40 










45 








Ala 


Lys 


Ser 


Glu 


Glu 


Asn 


Thr Lys 


Glu 


Glu 


Lys 


Pro 




Ser 


Lys 


Lys 




50 










55 








60 










Val 


Glu 


Glu 


Asp 


Leu 


Lys 


Ala Asp 


Glu 


Pro 


Ser 


Ser 


Glu 


Glu 


Ser 


Asp 


65 










70 








75 










80 


Leu 


Glu 


lie 


Asp 


Lys 


Glu 


Gly Val 


lie 


Glu 


Pro 


Asp 


Thr Asp Ala 


Pro 










85 








90 










95 




Gin 


Glu 


Met 


Gly 


Asp Glu Asn Ala 


Glu 


lie 


Thr 


Ala 


Gin 


His 


Phe 


Leu 








100 








105 










110 







<210> 550 
<211> 118 
<212> PRT 

<213> Homo sapiens 



<400> 550 



Met 

1 


Val 


Leu 


Leu 


Glu 
5 


Ser 


Glu 


Gin 


Phe 


Leu Thr 
10 


Glu Leu Thr 


Arg Leu 
15 


Phe 


Gin 


Lys 


Cys 
20 


Arg 


Thr 


Ser 


Gly 


Ser 

25 


Val Tyr 


He Thr Leu 

30 


Lys Lys 


Tyr 


Asp 


Gly 
35 


Arg 


Thr 


Lys 


Pro 


He 
40 


Pro 


Lys Lys 


Gly Thr Val 
45 


Glu Gly 


Phe 


Glu 


Pro 


Ala 


Asp 


Asn 




Cys 


Leu 


Leu Arg 


Ala Thr Asp Gly Lys 




50 










55 








60 




Lys 


Lys 


He 


Ser 


Thr 


Val 


Val 


Ser 


Ser 


Lys Glu 


Val Asn Lys 


Phe Gin 


65 










70 








75 




80 


Met 


Ala 


Tyr 


Ser 


Asn 


Leu 


Leu Arg 


Ala 


Asn Met 


Asp Gly Leu 


Lys Lys 










85 










90 




95 


Arg 


Asp 


Lys 


Lys 
100 


Asn 


Glu 


Thr 


Lys 


Lys 
105 


Thr Lys 


Ala Ala Ala 
110 


Ala Ala 


Ala 


Ala 


Thr 

115 


Ala 


Ala 


Gin 















<210> 551 
<211> 109 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> UNSURE 
<222> 40 

<223> Xaa = Gin, Arg 

<220> 

<221> UNSURE 
<222> 10 

<223> Xaa = His, Arg 

<220> 

<221> UNSURE 
<222> 38 

<223> Xaa = Phe, He 
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<220> 

<221> UNSURE 
<222> 97 

<223> Xaa = Phe,Leu 
<400> 551 



Met 




Ser 




Pro 


Ser Gly 


Lys 


He 


Xaa 


Arg Lys 


Val 


Lys 


He 




1 








5 








10 








15 




Leu 


Gly 


Arg 


Asn 


Arg 


Lys Glu 


Asn 


Leu 


Glu 


Pro Asn 


Ala 


Glu 


Phe 


Asp 








20 








25 








30 






Lys 


Arg 


Thr 


Glu 


Phe 


Xaa Thr 


Xaa 


Glu 


Glu 


Asn Arg 


He 


Cys 


Ser 


Ser 






35 








40 








45 








Pro 


Val 


Gin 


Ser 


Leu 


Leu Asp 


Leu 


Phe 


Gin 


Thr Ser 


Glu 


Glu 


Lys 


Ser 




50 








55 








60 










Glu 


Phe 


Leu 


Gly 


Phe 


Thr Ser 


Tyr 


Thr 


Glu 


Lys Ser 


Gly 


He 


Cys 




65 










70 








75 








80 


Val 


Leu 


Asp 


He 


Trp 


Glu Glu 


Glu 


Asn 


Ser Asp Asn 


Leu 


Leu 


Thr 


Ala 










B5 








90 








95 




Xaa 


Phe 


Ser 


Ser 


Pro 


Ser Thr 


Ser 


Thr 


Phe 


Thr Gly Phe 









100 105 



<210> 552 
<211> 103 
<212> PRT 
<213> Homo sapiens 

<400> 552 



Met 


Ser 


Gly 


Asp 


Gly 


Ala 


Thr 


Glu Gin 


Ala 


Ala 


Glu 


Tyr Val 


Pro 


Glu 


1 








5 








10 








15 




Lys 


Val 


Lys 


Lys 


Ala 


Glu 


Lys 


Lys Leu 


Glu 


Glu 


Asn 


Pro Tyr 


Asp 


Leu 








20 








25 








30 






Asp 


Ala 


Trp 


Ser 


He 


Leu 


He 


Arg Glu 


Ala 


Gin 


Asn 


Gin Pro 


He Asp 






35 










40 








45 






Lys 


Ala 


Arg 


Lys 


Thr 


Tyr Glu Arg Leu 


Val 


Ala 


Gin 


Phe Pro 


Ser 


Ser 




50 










55 








60 








Gly 


Arg 


Phe 


Trp 


Lys 


Leu 


Tyr 


He Glu 


Ala 


Glu 


Val 


Thr He 




Phe 


65 










70 








75 








80 


Tyr 


Phe 


Phe 


Leu 


Tyr 


Gin 


Tyr 


Cys Ser 


He 


His 


Cys 


Ser Asp 


Arg 


Lys 










85 








90 








95 




Gin 


Val 


Arg 




He 


Ala 


Asn 

















100 



<210> 553 
<211> 106 
<212> PRT 
<213> Homo sapiens 

<400> 553 



Met 


Leu Lys 


Ser 


Asn 


Gly 


Glu Arg Arg 


Ser Arg Asn Ala Leu 


Pro 


Ala 


1 






5 










10 








15 




Val 


Tyr Ala 


Arg 


Lys 


Met 


Ala 


Ala 


Ser 


Gin 


Gin 


Gin 


Ala Ser 


Ala 


Ala 






20 










25 








30 






Ser 


Ser Ala 


Ala 


Gly 


Val 


Ser 


Gly 


Pro 


Ser 


Ser 


Ala 


Gly Gly 


Pro 


Gly 




35 










40 










45 






Pro 


Gin Gin 


Gin 


Pro 


Gin 


Pro 


Pro 


Ala 


Gin 




Val 


Gly Pro 


Ala 


Gin 




50 








55 










60 








Ser 


Gly Leu 


Leu 


Gin 


Gin 


Gin 


Gin 


Gin 


Asp 


Phe 


Asp 


Pro Val 


Gin Arg 


65 








70 










75 








80 


Tyr 


Lys Met 


Leu 


He 


Pro 


Gin 


Leu 


Lys 


Glu 


Ser 


Leu 


Gin Val 


He 


Gly 








85 










90 








95 




Leu 


Lys Gin 




Glu 


Ala 


Asn 


Trp 


He 


Trp 













100 105 



PCr/iBOl/00914 
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<210> 554 
<211> 86 
<212> PRT 

<213> Homo sapiens 
<400> 554 



Met Ala Glu 


Asp Met Glu 


Thr 


Lys 


He 


Lys 


Asn 


Tyr 


Lys 


Thr 


Ala 


Pro 


1 


5 








10 










15 




Phe Asp Ser Arg Phe Pro 




Gin 




Gin 


Thr 


Arg 


Asn 


Cys 


Trp Gin 




20 






25 










30 






Asn Tyr Leu Asp Phe His 


Arg 


Cys 


Gin 


Lys 


Ala 


Met 


Thr 


Ala 


Lys 


Gly 


35 






40 










45 








Gly Asp lie 


Ser Val Cys 


Glu Trp Tyr Gin 


Arg 


Val 


Tyr 


Gin 


Ser 


Leu 


50 




55 










60 










Cys Pro Thr 


Ser Trp Val 


Thr Asp 


Trp Asp 


Glu 


Gin Arg Ala 


Glu 


Gly 


55 


70 










75 










80 


Thr Phe Pro 


Gly Lys He 
























85 






















<210> 555 
























<211> 105 
























<212> PRT 
























<213> Homo sapiens 






















<400> 555 
























Met Cys Val 


Gly Leu Leu 


Met 


Val 


Ser 


He 


Leu 


Thr 


Asp 


He 


Asn 


Leu 


1 


5 








10 










15 




Ser Asn Leu 


Val Ala Val 




Tyr 


Ser 




Asp 


Tyr Cys 




Phe Arg 




20 






25 










30 






Lys Arg Ser 


Asp Lys Asn 


Gin 


Asp Ala 


Ser 


Thr 


Phe 


Cys 


His 


Asn 


Cys 


35 






40 










45 








Asn Gin Phe 


His Leu Cys 


Leu 


Gin 


Tyr 


His 


His 


Lys 


He 


Val 


Leu 


Pro 


50 














60 










Trp Ser Val 


Leu Ala He 


Leu 


Ser 


Gin 


Leu 


Phe 


Leu 


His 


He 


Thr 


Ser 


55 


70 










75 










80 


Arg lie Arg 


Pro Thr He 


Tyr 


Lys 


Ser 


Asn 


Lys 


Pro 


Lys 


Ser 


He 


Glu 




85 








90 










95 




He Leu He 


Gly Phe Val 


Leu 


Asn 


Leu 


















100 






105 
















<210> 556 
























<211> 101 
























<212> PRT 
























<213> Homo sapiens 






















<220> 
























<221> UNSURE 






















<222> 95 
























<223> Xaa = 


Cys, Trp 






















<220> 
























<221> UNSURE 






















<222> 86 
























<223> Xaa = 


Leu, Val 






















<400> 556 
























Met Glu Arg 


Pro Asp Lys 


Ala 


Ala 


Leu 


Asn 


Ala 


Leu 


Gin 


Pro 


Pro 


Glu 


1 


5 








10 










15 




Phe Arg Asn 


Glu Ser Ser 


Leu 


Ala 


Ser 


Thr 






Thr 




Leu 


Phe 




20 






25 










30 






Phe Thr Ala 


Leu Met He 


Thr 


Val 


Pro 


He 


Gly 


Leu 


Tyr 


Phe 


Thr 


Thr 


35 






40 










45 








Lys Ser Tyr 


He Phe Glu 


Gly Ala 


Leu 


Gly 


Met 


Ser 




Arg Asp 


Ser 
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50 55 
Tyr Phe Tyr Ala Ala He Val Ala Val 
65 70 
Ala Leu Phe Val Tyr Xaa Ala Trp Asn 
85 

Glu Gly Lys Gin Asp 
ICQ 



PCT/lBOl/00914 

299 

60 

Val Ala Val His Val Val Leu 

75 80 
Glu Gly Ser Arg Gin Xaa Arg 
90 95 



<210> 557 
<211> 92 
<212> PRT 

<213> Homo sapiens 



<400> 557 

Met Phe Arg Asp Phe Gly Arg Arg 
1 5 
Val Asp Ala Arg Leu Arg Leu Ser 
20 

Lys Pro Lys Pro Val Glu Val Gin 

35 40 
Tyr Ala Val Trp Phe Gly Gly Ser 

50 55 
Phe Gin Val Cys His Thr Lys Lys 
65 70 
He Cys Arg His Asn Pro Val Phe 
85 



Leu Gin Arg Asp Leu Lys Arg Val 

10 15 
Glu Glu Leu Ser Gly Gly Arg He 

25 30 
Val Val Thr His His Met Gin Arg 
45 

Met Leu Ala Ser Thr Pro Glu Phe 
60 

Asp Tyr Glu Glu Tyr Gly Pro Ser 

75 80 
Gly Val Met Ser 
90 



<210> 558 
<211> 98 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> UNSURE 
<222> 86 

<223> Xaa = * ,Glu 



<220> 

<221> UNSURE 
<222> 51 

<223> Xaa = Asp, Tyr 



<400> 558 

Met Ser Val Ser Phe His Thr His 
1 5 

Glu Asp Leu Gin Lys Glu Met Leu 

20 

Asp Ala Val Gin Glu Leu He Lys 

35 40 
Thr Leu Xaa Ala Thr Leu Asn Val 

50 55 
Gin Gin Leu Arg Ser Ala Pro Pro 
65 70 
Arg Ser Ala Trp Ala Xaa Leu Ser 
85 

Val Arg 



Thr Lys Glu Leu Trp Thr Trp Met 
10 15 

Glu Asp Val Cys Ala Asp Ser Val 
25 30 
Gin Phe Gin Gin Gin Gin Thr Ala 
45 

He Lys Glu Gly Glu Asp Leu He 
60 

Ser Leu Gly Glu Pro Ser Glu Ala 

75 80 
Ser Gly Lys Cys Leu Gly Leu Asp 
90 95 



<210> 559 
<211> 89 
<212> PRT 
<213> Homo sapiens 



<400> 559 
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Met 


Val 


Ser 


Gly 


Asp 


Gly 


Phe 


Leu 


Val 


Ser 


Arg 


Pro 


Glu 


Ala 


He His 


1 








5 










10 










15 


Leu 


Gly 


Pro 


Arg 

20 


Gin 


Ala 


Val 


Arg 


Pro 
25 


Ser 


Val 


Arg 


Ala 


Glu 
30 


Ser Arg 


Arg 


Val 


Asp 


Gly 


Gly 


Gly 


Arg 


Ser 


Pro 


Arg 


Glu 


Pro Asp 


Gly Arg Gly 






35 










40 










45 






Arg 


Ser 
50 


Arg 


Gin 


Ala 


Arg 


Phe 
55 


Ser 


Pro 


Tyr 


Pro 


He 
60 


Pro 


Ala 


Val Glu 


Pro 


Asp 


Leu 


Leu 


Arg 


Ser 


Val 


Leu 


Gin 


Gin 


Arg 


Leu 


He 


Ala 


Leu Gly 


65 










70 










75 








80 


Gly Val 


He 


Ala 


Ala 


Arg 


He 


Ser 


Val 






















85 























<210> 560 
<211> 172 
<212> PRT 

<213> Homo sapiens 



<400> 560 



Met 


Ala 


Gly Ala 


Ala 


Thr 


Gin 


Ala 


1 








5 








Met 


Arg 


Leu 


Val 


Ala 


Glu 


Cys 


Ser 








20 










Leu 


Trp 


Leu 


Pro 


His 


Gly 


Thr 


Val 






35 










40 


Gly 


Thr 


Gin 


Ala 


Thr 


Met 


Lys 


Gly 




50 










55 




Leu 


Gly 


Cys 


Arg 


He 


Cys 


Leu 


Gly 


65 










70 






Pro 


Gly 


Pro 


Glu 


Leu 
85 


He 


Gin 


Lys 


Asn 


Trp 


Pro 


His 






Leu 


Thr 






100 










Ser 


Leu 


Val 


Ser 


Leu 


Ser 


Glu 


Val 






115 










120 


Ser 


Pro 


Tyr Asp 


Gly 


Asn 


Glu 


Thr 




130 










135 




Gin 


He 


Gin 


Asn 


Ala 




Gly 


Ser 


145 










150 






Val 


Val 


Ser 


Ser 


Thr 


Val 


Thr 


Gly 



165 



Ser 


Leu 


Glu 


Ser 


Ala Pro Arg He 




10 






15 


Arg 


Ser 


Arg 


Ala Arg Ala Gly Glu 


25 








30 


Ala 


Thr 


Pro 


Val 


Phe Met Pro Val 










45 


He 


Thr 


Thr 


Glu 


Gin Leu Asp Ala 








60 




Asn 


Thr 


Tyr 


His 


Leu Gly Leu Arg 






75 




80 


Ala 


Asn 


Gly 


Leu 


His Gly Phe Met 




90 






95 


Asp 


Ser 


Gly 


Gly Phe Gin Met Val 


105 








110 


Thr 


Glu 


Glu 


Gly Val Arg Phe Arg 










125 


Leu 


Leu 


Ser 


Pro 


Glu Lys Ser Val 








140 




Asp 


He 


He 


Met 


Gin Leu Asp Asp 






155 




160 


Pro Arg 


Val 


Glu 






170 









<210> 561 
<211> 528 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 57. .527 



<220> 

<221> sig_peptide 

<222> 57 . . 143 

<223> Von Heijne matrix 

score 14.3000001907349 
seq LLVCCLLTPGVQG/QE 



<400> 561 

cttttttccc ctgcctgccc ttcgggcacc tcaggaaggc accttcctct gtcaga atg 59 

Met 

get acc atg gta cca tec gtg ttg tgg ccc agg gcc tgc tgg act ctg 107 
Ala Thr Met Val Pro Ser Val Leu Trp Pro Arg Ala Cys Trp Thr Leu 
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-25 










-20 










-15 








ctg 


gtc 


tgc 


tgt 


ctg 


ctg 


ace 


cca 


ggt 


gtc 


cag 


ggg 


cag 


gag 


ttc 


ett 


155 


Leu 


Val 


Cys 
-10 


Cys 


Leu 


Leu 


Thr 


Pro 
-5 


Gly 


Val 


Gin 


Gly 


Gin 
1 


Glu 


Phe 


Leu 




ttg 


egg 


gtg 


gag 


ccc 


cag 


aac 


cct 


gtg 


etc 


tet 


get 


gga 


ggg 


tec 


ctg 


203 


Leu 


Arg 


Val 


Glu 


Pro 


Gin 


Asn 


Pro 


Val 




Ser 


Ala 


Gly Gly 


Ser 


Leu 




5 










10 










15 










20 




ttt 


gtg 


aac 


tgc 


agt 


act 


gat 


tgt 


ccc 


age 


tct 


gag 


aaa 


ate 


gee 


ttg 


251 


Phe 


Val 


Asn 


Cys 


Ser 
25 


Thr 


Asp 


Cys 


Pro 


Ser 
30 


Ser 


Glu 


Lys 


He 


Ala 
35 


Leu 




gag 


acg 


tec 


eta 


tea 


aag 


gag 


ctg gtg 


gee 


agt 


ggc 


atg 


ggc 


tgg 


gca 


299 


Glu 


Thr 


Ser 


Leu 
40 


Ser 




Glu 


Leu 


Val 
45 


Ala 


Ser 


Gly 


Met 


Gly 
50 


Trp 


Ala 




gcc 


ttc 


aat 


etc 


age 


aac 


gtg 


act 


ggc 


aac 


agt 


egg 


ate 


etc 


tgc 


tea 


347 


Ala 


Phe 


Asn 


Leu 


Ser 


Asn 


Val 


Thr Gly 


Asn 


Ser Arg 


He 


Leu 


Cys 


Ser 








55 










60 










65 










gtg 


tac 


tgc 


aat 


ggc 


tec 


cag 


ata 


aca 


ggc 


tec 


tet 


aac 


ate 


acc 


gtg 


395 


Val 


Tyr 

70 


Cys 


Asn 


Gly 


Ser 


Gin 

75 


He 


Thr 


Gly 


Ser 


Ser 

80 


Asn 


He 


Thr 


Val 




tac 


agg 


etc 


ccg 


gag 


cgt 


gtg 


gag 


ctg 


gca 




ctg 




cct 


tgg 


cag 


443 










Glu Arg 


Val 


Glu 


Leu 




Pro 


Leu 


Pro 


Pro 








85 










90 










95 










100 




ccg 


gtg 


ggc 


eag 


aac 


ttc 


acc 


cct 


gcg 


ctg 


cca 


agt 


gga 


gga 


tgg 


gtc 


491 


Pro 


Val 


Gly 


Gin 


Asn 


Phe 


Thr 


Pro 


Ala 


Leu 


Pro 


Ser 


Gly Gly 


Trp 


Val 












105 










110 










115 






gcc 


ccg 


gac 


cak 


cct 


cac ggt 


ggt 


get 


get 


teg 


ctg 


g 








528 


Ala 


Pro 


Asp 


Xaa 


Pro 


His 


Gly 


Gly Ala 


Ala 


Ser 


Leu 













120 125 



<210> 562 
<211> 682 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 230. .682 



<220> 

<221> sig_j3eptide 
<222> 230. .277 
<223> Von Heijne matrix 
score 13 

seq LLLLLSALAGLFG/AA 



<400> 562 

gtgtttcgat acagaetaag cttttaggee aaccctcctg aetggatggg ggcggcgggc 60 
gtggcatgca tgaaaagtaa acatcagaga cetgaagaag ettataaaat agcttgggag 120 
aggceagtea ccaagacagg catctcaaat cggctgattc tgcatctgga aactgccttc 180 
atcttgaaag aaaagcteca ggtcecttet ceagccacce agecccaag atg gtg atg 238 

Met Val Met 
-15 

ctg ctg ctg ctg ett tec gca ctg get ggc etc ttc ggt gcg gca gag 28 6 

Leu Leu Leu Leu Leu Ser Ala Leu Ala Gly Leu Phe Gly Ala Ala Glu 

-10 -5 1 

gga caa gca ttt cat ett ggg aag tgc ccc aat cct ccg gtg cag gag 334 
Gly Gin Ala Phe His Leu Gly Lys Cys Pro Asn Pro Pro Val Gin Glu 

5 10 15 

aat ttt gac gtg aat aag tat etc gga aga tgg tac gaa att gag aag 382 
Asn Phe Asp Val Asn Lys Tyr Leu Gly Arg Trp Tyr Glu He Glu Lys 
20 25 30 35 

ate cca aca acc ttt gag aat gga cge tgc ate eag gee aac tac tea 430 
He Pro Thr Thr Phe Glu Asn Gly Arg Cys He Gin Ala Asn Tyr Ser 
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eta 


atg 


gaa 


aac 


gga 


aag 


ate 




gtg 


tta 


aac 


eag 


gag 


ttg 


aga 


get 


478 


Leu 


Met 


Glu 


Asn 

55 


Gly 


Lys 


He 


Lys 


Val 
60 




Asn 


Gin 


Glu 


Leu 

65 


Arg 


Ala 




gat 


gga 


act 


gtg 


aat 


caa 


ate 


gaa 


ggt 


gaa 


gcc 


acc 


cca 


gtt 


aac 


etc 


526 


Asp 


Gly 


Thr 


Val 


Asn 


Gin 


He 


Glu 


Gly Glu 


Ala 


Thr 


Pro 


Val 


Asn 


Leu 








70 










75 










80 










aca 


gag 


cct 


gcc 


aag 


ctg gaa 


gtt 


aag 


ttt 


tec 


tgg 


ttt 


atg cea 


teg 


574 


Thr 


Glu 

85 


Pro 


Ala 


Lys 


Leu 


Glu 
90 


Val 


Lys 


Phe 


Ser 


Trp 
95 


Phe 


Met 


Pro 


Ser 




gca 


ccg 


tac 


tgg 


ate 


ctg 


gcc 


ace 


gac 


tat 


gag 


aac 


tat 


gcc 


etc 


gtg 


622 


Ala 


Pro 


Tyr 


Trp 


lie 


Leu 


Ala 


Thr 


Asp 


Tyr 


Glu 


Asn 


Tyr 


Ala 


Leu 


Val 




100 










105 










110 










115 




tat 


tec 


tgt 


ace 


tgc 


ate 


ate 


caa 


ctt 


ttt 


cac 


gtg 


gat 


ttt 


get 


tgg 


670 


Tyr 


Ser 


Cys 


Thr 


Cys 
120 


He 


He 


Gin 


Leu 


Phe 
125 


His 


Val 


Asp 


Phe 


Ala 
130 


Trp 




ate 


ttg 


gea 


aga 


























682 


He 


Leu 


Ala 


Arg 
135 





























<210> 563 
<211> 500 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 72. .500 

<220> 

<221> sig_peptide 

<222> 72.. 122 

<223> Von Heijne matrix 

score 10.6999998092651 

seq VLMLLAVLIWTGA/EN 

<220> 

<221> misc_feature 
<222> 1. .3 

<223> n=a, g, c or t 
<400> 563 

nnngaacatc taatcgaaaa cttgtttcct gagttgtcct gtgctggagg tctgetcaga 60 
cgaaggtctc c atg gcg tta gaa gtc ttg atg etc etc get gte ttg att 110 
Met Ala Leu Glu Val Leu Met Leu Leu Ala Val Leu He 













-15 








-10 








-5 




tgg 




ggt 


get 


gag 


aac 


etc 


cat 


gtg 


aaa 


ata 


agt 


tgc 


tct 


ctg 


gac 


158 


Trp 


Thr 


Gly 


Ala 


Glu 


Asn 




His 


Val 


Lys 


He 


Ser 


Cys 


Ser 


Leu 


Asp 










1 








5 










10 








tgg 


ttg 


atg 


gtc 


tea 


gtt 


ate 


cea 


gtt 


gea 


gaa 


age 


aga 


aat 


ctg 


tat 


206 


Trp 


Leu 


Met 


Val 


Ser 


Val 


He 


Pro 


Val 


Ala 


Glu 


Ser 


Arg 


Asn 


Leu 


Tyr 






15 










20 










25 










ata 


ttt 


gcg 


gat 


gaa 


tta 


cat 


ctg 


gga 


atg 


ggc 


tgc 


cet 


gea 


aat 


egg 


254 


He 


Phe 
30 


Ala 


Asp 


Glu 


Leu 


His 
35 


Leu 


Gly 


Met 


Gly 


Cys 
40 


Pro 


Ala 


Asn 


Arg 




ata 


cat 


aca 


tat 


gta 


tat 


gag 


ttt 


ata 


tat 


ctt 


gtt 


cgt 


gat 


tgt 


ggc 


302 


He 


His 


Thr 


Tyr 


Val 


Tyr 


Glu 


Phe 


He 


Tyr 


Leu 


Val 


Arg 


Asp 


Cys 


Gly 




45 










50 










55 










60 




ate 


agg 




agg 


gta 


gtt 


tct 


gag 


gaa 


act 




ctt 


ttt 


caa 


acc 


gag 


350 


He 


Arg 


Thr 


Arg 


Val 
65 


Val 


Ser 


Glu 


Glu 


Thr 
70 


Leu 


Leu 


Phe 


Gin 


Thr 

75 


Glu 




ctg 


tac 


ttt 






agg 


aat 


ata 


gat 


cat 


gac 


cct 


cag 


gaa 


ate 




398 


Leu 


Tyr 


Phe 


Thr 


Pro 






He 




His 




Pro 


Gin 


Glu 


He 


His 
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80 85 
ttg gag tgt tec acc tct agg aaa tea 
Leu Glu Cys Ser Thr Ser Arg Lys Ser 

95 100 
act gag aat gaa ata aaa ttg gat cct 
Thr Glu Asn Glu lie Lys Leu Asp Pro 

110 115 
cag aca 
Gin Thr 
125 



PCT/IBOl/00914 

303 

90 

gtg tgg ctt aca oca gtt tct 44 6 

Val Trp Leu Thr Pro Val Ser 
105 

agt cct ttt att get gac ttt 4 94 

Ser Pro Phe lie Ala Asp Phe 
120 

500 



<210> 564 
<211> 497 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 52. .495 



<220> 

<221> sig_peptide 

<222> 52. . 102 

<223> Von Heijne matrix 

score 10.6999998092651 
seq VLMLLAVLIWTGA/EN 



<400> 564 

cttgtttcct gagttgtcct gtgctggagg tctgctcaga cgaaggtctc c atg gcg 57 

Met Ala 



tta 


gaa 


gtc 


ttg atg 


etc 


etc 


get 


gtc 


ttg 


att 


tgg 


acc 


ggt 


get 


gag 


105 


Leu 


Glu 


Val 


Leu 


Met 


Leu 


Leu 


Ala 


Val 


Leu 


He 


Trp 


Thr 


Gly 


Ala 


Glu 




-15 










-10 










-5 










1 




aac 


etc 


cat 


gtg 




ata 


agt 


tge 


tct 


ctg 


gae 


tgg 


ttg 


atg 


gtc 


tea 


153 


Asn 


Leu 


His 


Val 
5 


Lys 


He 


Ser 


Cys 


Ser 

10 


Leu 


Asp 


Trp 


Leu 


Met 

15 


Val 


Ser 




gtt 


ate 


eca 


gtt 




gaa 


age 


aga 


aat 


ctg 


tat 




ttt 


gcg 




gaa 


201 


Val 


He 


Pro 
20 


Val 


Ala 


Glu 


Ser 


Arg 
25 






Tyr 


He 


Phe 
30 


Ala 


Asp 


Glu 




tta 


cat 


ctg 


gga 


atg 


ggc 


tge 


cct 


gca 


aat 


egg 


ata 


cat 


aca 


tat 


gta 


249 


Leu 


His 


Leu 


Gly Met 


Gly 


Cys 


Pro 


Ala 


Asn 


Arg 


He 


His 


Thr 


Tyr 


Val 






35 










40 










45 












tat 


gag 


ttt 


ata 


tat 


ctt 


gtt 


cgt 


gat 


tgt 


ggc 


ate 


agg 


aca 


agg 


gta 


297 


Tyr 


Glu 


Phe 


He 


Tyr 


Leu 


Val 


Arg 


Asp 


Cys 


Gly 


He 




Thr 


Arg 


Val 




50 










55 










60 










65 




gtt 


tct 


gag 


gaa 


act 


etc 


ctt 


ttt 


caa 


acc 


gag 


ctg 


tac 


ttt 


ace 


eca 


345 


Val 


Ser 


Glu 


Glu 


Thr 
70 


Leu 


Leu 


Phe 


Gin 


Thr 
75 


Glu 


Leu 


Tyr 


Phe 


Thr 
80 


Pro 




agg 


aat 


ata 


gat 


cat 


gac 


cct 


cag 


gaa 


ate 


eat 


ttg 


gag 


tgt 


tec 


ace 


393 


Arg 


Asn 


He 


Asp His 


Asp 


Pro 


Gin 


Glu 


He 


His 


Leu 


Glu 


Cys 


Ser 


Thr 










85 










90 










95 








tct 


agg 


aaa 


tea 


gtg 


tgg 


ctt 


aca 


eca 


gtt 


tct 


act 


gag 


aat 


gaa 


ata 


441 


Ser 


Arg 


Lys 
100 


Ser 


Val 


Trp 


Leu 


Thr 
105 


Pro 


Val 


Ser 


Thr 


Glu 
110 


Asn 


Glu 


He 






ttg 


gat 




agt 


cct 


ttt 


att 


get 


gae 


ttt 


cag 


aca 




gca 


gaa 


489 


Lys 


Leu 


Asp 


Pro 


Ser 


Pro 


Phe 


He 


Ala 


Asp 


Phe 


Gin 


Thr 


Thr 


Ala 


Glu 





115 120 125 

gag tta gg 4 97 

Glu Leu 

130 



<210> 565 
<211> 342 
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<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 118. .342 

<220> 

<221> sig_pepticle 

<222> 118. .189 

<223> Von Heijne matrix 

score 10.6000003814697 

seq LLLLLLCPAHVGG/LW 

<400> 565 

tccccggccg ccgccgttgc gctcgccgcg ctcgcactga agcccgggcc ctcgcgcgcc 60 
gcggttcgcc ccgcagcctc gccccctgcc cacccgggcg gccgtagggc ggtcacg 117 



atg' 


ctg' 


ccg 


ccc 


tta 




tec 


cgc 


etc 


ggg 


ctg 


ctg 


ctg 


ctg 


ctg 


etc 


165 


Met 


Leu 


Pro 


Pro 




Pro 


Ser 


Arg 


Leu 


Gly 


Leu 








Leu 


Leu 












-20 










-15 










-10 






ctg 


tgc 


ccg 


gcg 


cac 


gtc 


ggc 


gga 


ctg 


tgg 


tgg 


get 


gtg 


ggc 


age 


ccc 


213 


Leu 


Cys 


Pro 


Ala 
-5 


His 


Val 


Gly 


Gly 


Leu 
1 


Trp 


Trp Ala 


Val 
5 


Gly 


Ser 


Pro 




ttg 


gtt 


atg 


gac 


cct 


acc 


age 


ate 


tgc 


agg 


aag 


gca 


egg 


egg 


ctg 


gcc 


261 


Leu 


Val 


Met 


Asp 


Pro 


Thr 


Ser 


He 


Cys 


Arg 


Lys 


Ala Arg 


Arg 


Leu 


Ala 






10 










15 










20 












ggg 


egg 


cag 


gcc 


gag 


ttg 


tgc 


cag 


get 


gag 


ccg 


gaa 


gtg 


gtg 


gca 


gag 


309 


Gly 


Arg 


Gin 


Ala 


Glu 


Leu 


Cys 


Gin 


Ala 


Glu 


Pro 


Glu 


Val 


Val 


Ala 


Glu 




25 










30 










35 










40 




ctg 


get 


egg 


ggc 


gcc 


egg 


etc 


ggg 


gtg 




rag 












342 


Leu 


Ala 


Arg 


Gly Ala 


Arg 


Leu 


Gly 


Val 


Arg 


Xaa 















45 50 



<210> 566 
<211> 434 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 242. .433 

<220> 

<221> sig_peptide 

<222> 242. .307 

<223> Von Heijne matrix 

score 10.1999998092651 

seq LLALAALLQGAVS/LK 

<400> 566 

acatttgccc cagggaaggt caeagctgcc tgaaetbbtc aaaactccca gaeacgcact 60 
gcctgtgcag gatceggage ccagcagcac tgceagggee ttgaagtget tatteagaga 120 
ectttettea tagactaett ttttttcttt aageagcaaa aggagaaaat tgteatcaaa 180 
ggatattcca gattcttgac agcattctcg tcatetctga ggaeateaec atcateteag 240 
g atg agg ggc atg aag ctg ctg ggg gcg ctg ctg gca ctg gcg gcc eta 289 
Met Arg Gly Met Lys Leu Leu Gly Ala Leu Leu Ala Leu Ala Ala Leu 
-20 -15 -10 

ctg cag ggg gcc gtg tec ctg aag ate gca gcc ttc aac ate cag aca 337 
Leu Gin Gly Ala Val Ser Leu Lys He Ala Ala Phe Asn He Gin Thr 

-5 15 10 

ttt ggg gag ace aag atg tec aat gcc ace etc gtc age tac att gtg 385 
Phe Gly Glu Thr Lys Met Ser Asn Ala Thr Leu Val Ser Tyr He Val 
15 20 25 
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cag ate ctg age cgc tat gac ate gcc ctg gtc cag gag gtc aga gac a 4 34 
Gin lie Leu Ser Arg Tyr Asp lie Ala Leu Val Gin Glu Val Arg Asp 
30 35 40 

<210> 567 
<211> 487 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 136. .486 

<220> 

<221> sig_peptide 
<222> 136. .234 
<223> Von Heijne matrix 
score 8.5 

seq FFFVLGSLIFCFG/IW 
<400> 567 

ctcttttggg gttcttcctt tctctctcag ctctccgtct ctctttctct ctcagcctct 60 
ttctttctcc ctgtctcccc cactgtcagc acctcttctg tgtggtgagt ggaccgctta 120 
ccccactagg tgaag atg tea gcc cag gag age tgc etc age etc ate aag 171 
Met Ser Ala Gin Glu Ser Cys Leu Ser Leu He Lys 
-30 -25 



tac 


ttc 


etc 


ttc 


gtt 


ttc 


aac 


etc 


ttc 


ttc 


ttc 


gtc 


etc ggc 


age 


ctg 


219 


Tyr 


Phe 


Leu 


Phe 


Val 


Phe 


Asn 


Leu 


Phe 


Phe 


Phe 


Val 


Leu Gly 


Ser 


Leu 




-20 










-15 










-10 










ate 


ttc 


tgc 


ttc 


ggc 


ate 


tgg 


ate 




att 


gac 


aag 


ace age 


ttc 


gtg 


267 


He 


Phe 


Cys 


Phe 


Gly 


He 


Trp 


He 


Leu 


He 


Asp 


Lys 


Thr Ser 


Phe 


Val 




-5 










1 








5 








10 






tec 


ttt 


gtg ggc 


ttg 


gcc 


ttc 


gtg 


cct 


ctg 


cag 


ate 


tgg tec 


aaa 


gtc 


315 


Ser 


Phe 


Val 


Gly 
15 


Leu 


Ala 


Phe 


Val 


Pro 

20 


Leu 


Gin 


He 


Trp Ser 

25 


Lys 


Val 




ctg 


gcc 


ate 


tea 


gga 




ttc 


ace 


atg 


ggc 


ate 


gcc 


etc ctg 


ggt 


tgt 


363 


Leu 


Ala 


He 
30 


Ser 


Gly 


He 


Phe 


Thr 
35 


Met 


Gly 


He 


Ala 


Leu Leu 
40 


Gly 


Cys 




gtg 


ggg 


gcc 


tea 


agg 


age 


tec 


get 


gcc 


tec 


tgg 


gcc 


tgt att 


ttg 


gga 


411 


Val 


Gly 


Ala 


Ser 


Arg 


Ser 


Ser 


Ala 


Ala 


Ser 


Trp Ala 


Cys He 


Leu 


Gly 






45 










50 










55 










tgc 


tgc 


tgc 


tec 


tgt 


ttg 


cca 


cac 


aga 


tea 


cec 


tgg 


gaa tec 


tea 


tet 


459 


Cys 


Cys 


Cys 


Ser 


Cys 


Leu 


Pro 


His 


Arg 


Ser 


Pro 


Trp Glu Ser 


Ser 


Ser 




60 










65 










70 








75 




cea 


etc 


age 


ggg 


eea 


get 


gra 


geg 


aag 


e 












487 


Pro 


Leu 


Ser 


Gly 


Pro 


Ala 


Xaa 


Ala 


Lys 

















80 



<210> 568 
<211> 478 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 233. .478 

<220> 

<221> sig_peptide 

<222> 233. .328 

<223> Von Heijne matrix 

score 8.19999980926514 
seq LVAAMVLLSWFC/LY 
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<400> 568 

atgaatgtca gaataggccc attctaccag ctctggctga gcctgagctt ccaaaagtga 60 
gctgagctgt tcaaccttgg atcttaatta ctcctagcag ggataattag gtccctcttt 120 
ctcagattac aggtttgaga agttccaaat atccgatgcc ctagaggttt ttttccctta 180 
attttattgt tacaattggc cttcatctag ctctggatgg ttgaactgta gc atg gca 238 

Met Ala 

aag atg ttt gat etc agg acg aag ate atg ate gge ate gga age age 286 

Lys Met Phe Asp Leu Arg Thr Lys lie Met lie Gly lie Gly Ser Ser 

-30 -25 -20 -15 

tta etg gtt gee geg atg gtg etc eta agt gtt gtg ttc tgt ctt tac 334 

Leu Leu Val Ala Ala Met Val Leu Leu Ser Val Val Phe Cys Leu Tyr 

-10 -5 1 

ttc aaa gta got aag gca eta aaa get gca aag gac cct gat get gtg 382 
Phe Lys Val Ala Lys Ala Leu Lys Ala Ala Lys Asp Pro Asp Ala Val 

5 10 15 

get gta aaa aat cac aac cca gac aag gtg tgt tgg gee acg aac age 430 
Ala Val Lys Asn His Asn Pro Asp Lys Val Cys Trp Ala Thr Asn Ser 

20 25 30 

cak gee aaa gee ace acc atg gag tct tgt cca tet etc cag tge tgt 478 
Xaa Ala Lys Ala Thr Thr Met Glu Ser Cys Pro Ser Leu Gin Cys Cys 
35 40 45 50 

<210> 569 
<211> 469 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 290. .469 

<220> 

<221> sig_peptide 
<222> 290. .346 
<223> Von Heijne matrix 
score 7.5 

seq LAVFFLVSTQLFA/EE 
<400> 569 

agatgttatg ggcatcgacg agttaccgtc tcggaaactc tcaatcacgc aagcgaaagg 60 
agaggaggeg gctaattaaa tattgagcag aaagtcgcgt ggggagaatg tcacgtgggt 120 

ctggaggctc aaggaggctg ggataaatac cgcaagcact gagcaggcga aagagcgcgc 180 
teggaectee tteccggcgg cagctaeega gagtgeggas gaecagcgtg cgctcggagg 240 
aaccagagaa aetcagcacc ccgcgggact gtccgtcgca aaatccaac atg aaa ate 2 98 

Met Lys lie 

etc gtg gcc ttg gca gtc ttt ttt ctt gtc tec act cag ctg ttt gca 34 6 

Leu Val Ala Leu Ala Val Phe Phe Leu Val Ser Thr Gin Leu Phe Ala 

-15 -10 -5 

gaa gaa ata gga gee aat gat gat ctg aat tac tgg tec gac tgg tac 394 
Glu Glu lie Gly Ala Asn Asp Asp Leu Asn Tyr Trp Ser Asp Trp Tyr 
15 10 15 

gac age gac cag ate aag gag gaa ctg ccg gag ccc ttt gag cat ctt 4 42 

Asp Ser Asp Gin lie Lys Glu Glu Leu Pro Glu Pro Phe Glu His Leu 

20 25 30 

ctg cag aga ate gcc egg aga ccc aag 4 69 

Leu Gin Arg lie Ala Arg Arg Pro Lys 
35 40 

<210> 570 
<211> 489 
<212> DNA 
<213> Homo sapiens 
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<220> 
<221> CDS 
<222> 253. .489 

<220> 

<221> sig_peptide 

<222> 253. .336 

<223> Von Heijne matrix 

score 7.40000009536743 

seq CLLLNLAPAPLNA/DS 

<400> 570 

ctctttgctc taacagacag cagcgacttt aggctggata atagtcaaat tcttacctcg 60 
ctctttcact gctagtaaga tcagattgcg tttctttcag ttactcttca atcgccagtt 120 
tcttgatctg cttctaaaag aagaagtaga gaagataaat cctgtcttca atacctggaa 180 
ggaaaaacaa aataacctca actccgtttt gaaaaaaaca ttccaagaac tttcatcaga 240 
gattttactt ag atg att tac aca atg aag aaa gta cat gca ctt tgg get 291 
Met lie Tyr Thr Met Lys Lys Val His Ala Leu Trp Ala 
-25 -20 
tct gta tgc ctg ctg ctt aat ctt gcc cct gcc cct ctt aat get gat 339 
Ser Val Cys Leu Leu Leu Asn Leu Ala Pro Ala Pro Leu Asn Ala Asp 
-15 -10 -5 1 

tct gag gaa gat gaa gaa cac aca att ate aca gat acg gag ttg cca 387 
Ser Glu Glu Asp Glu Glu His Thr lie He Thr Asp Thr Glu Leu Pro 

5 10 15 

cca ctg aaa ctt atg cat tea ttt tgt gca ttc aag gcg gat gat ggc 435 
Pro Leu Lys Leu Met His Ser Phe Cys Ala Phe Lys Ala Asp Asp Gly 

20 25 30 

cca tgt aaa gca ate atg aaa aga ttt tct tea ata ttt tea etc gae 483 
Pro Cys Lys Ala He Met Lys Arg Phe Ser Ser He Phe Ser Leu Asp 



50 

<210> 571 
<211> 484 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 256. .483 

<220> 

<221> sig_peptide 

<222> 256. .327 

<223> Von Heijne matrix 

score 7.09999990463257 

seq FLFLFNLIFFILG/AV 

<400> 571 

ggggcggggc ctgccgagtc cgcggcgttc cccggctgca gccgggaggg ggccgaggag 60 
tgactgagcc ccgggctgtg cagtccgacg ccgactgagg cacgagcggg tgacgctggg 120 
cctgcagcgc ggagcagaaa gcagaacccg cagagtcctc cctgctgctg tgtggacgac 180 
acgtgggcac aggcagaagt gggccctgtg accagctgca ctggtttcgt ggaaggaagc 240 
tccaggactg gcggg atg ggc tea gcc tgt ate aaa gtc acc aaa tac ttt 291 
Met Gly Ser Ala Cys He Lys Val Thr Lys Tyr Phe 
-20 -15 
etc ttc etc ttc aac ttg ate ttc ttt ate ctg ggc gca gtg ate ctg 339 
Leu Phe Leu Phe Asn Leu He Phe Phe He Leu Gly Ala Val He Leu 
-10 -5 1 
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ggc ttc ggg gtg tgg ate 

Gly Phe Gly Val Trp He 

5 10 

ctg caa acc tec tec age 

Leu Gin Thr Ser Ser Ser 
25 

ggc gtg ggg gca gtc act 

Gly Val Gly Ala Val Thr 
40 



308 

ctg gcc gac aag age agt 
Leu Ala Asp Lys Ser Ser 
15 

teg ett agg atg ggg gcc 
Ser Leu Arg Met Gly Ala 
30 

atg etc atg ggc ttc ctg 
Met Leu Met Gly Phe Leu 
45 



PCT/lBOl/00914 



ttc ate tet gtc 387 
Phe He Ser Val 
20 

tat gtc ttc ate 435 
Tyr Val Phe He 
35 

ggc tge ate ggc g 484 
Gly Cys He Gly 
50 



<210> 572 
<211> 482 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 29. .481 



<220> 

<221> sig_peptide 
<222> 29. .82 
<223> Von Heijne matrix 
score 7 

seq VLGLVLLSVTVQG/KV 



<400> 572 

agcctagcac tctgacctag cagtcaac atg aag get etc att gtt ctg ggg 52 
Met Lys Ala Leu He Val Leu Gly 
-15 



Gtt 


gtc 


etc 


ctt 


tct 


gtt 


acg 


gtc 


cag 


ggc 


aag 


gtc 


ttt 


gaa 


agg 


tgt 


100 


Leu 


Val 


Leu 


Leu 


Ser 


Val 


Thr 


Val 


Gin 


Gly 


Lys 


Val 


Phe 


Glu 


Arg 


Cys 




-10 










-5 










1 








5 






gag 


ttg 


gcc 


aga 


act 


ctg 


aaa 


aga 


ttg 


gga 


atg 


gat 


ggc 


tac 


agg 


gga 


148 


Glu 


Leu 


Ala 


Arg 
10 


Thr 


Leu 


Lys 


Arg 


Leu 

15 


Gly 


Met 


Asp 


Gly 


Tyr 

20 


Arg 


Gly 




ate 


age 


eta 






tgg 


atg 


tgt 


ttg 


gcc 




tgg 


gag 




ggt 


tac 


196 


He 


Ser 


Leu 


Ala 


Asn 


Trp Met 


Cys 


Leu 


Ala 


Lys 


Trp 


Glu 


Ser 


Gly 


Tyr 








25 










30 










35 










aac 


aca 


cga 


get 


aca 


aac 


tac 


aat 


get 


gga 


gac 


aga 


age 


act 


gat 


tat 


244 


Asn 


Thr 


Arg Ala 


Thr 




Tyr 


Asn 


Ala 


Gly 


Asp 


Arg 


Ser 


Thr 


Asp 


Tyr 






40 










45 










50 












ggg 


ata 


ttt 


cag 


ate 


aat 


age 


cgc 


tac 


tgg 


tgt 


aat 


gat 


ggc 


aaa 


acc 


292 


Gly 


He 


Phe 


Gin 


He 


Asn 


Ser 


Arg 


Tyr 


Trp 


Cys 




Asp 


Gly 


Lys 


Thr 




55 










60 










65 










70 




oca 


gga 


gca 


gtt 


aat 


gcc 


tgt 


cat 


tta 


tec 


tge 


agt 


get 


ttg 


ctg 


caa 


340 


Pro 


Gly 


Ala 


Val 


Asn 
75 


Ala 


Cys 


His 


Leu 


Ser 
80 


Cys 


Ser 


Ala 


Leu 


Leu 

85 


Gin 




gat 


aac 


ate 


get 


gat 


get 


gta 


get 


tgt 


gea 


aag 


agg 


gtt 


gtc 


cgt 


gat 


388 


Asp 


Asn 


He Ala Asp 


Ala 


Val 


Ala 


Cys 


Ala 


Lys 


Arg 


Val 


Val 


Arg 


Asp 










90 










95 










100 








eca 


ecc 


acc 


teg 


gcc 


tec 


caa 


agt 


get 


ggg 


att 




ggc 


gtg 




aac 


436 


Pro 


Pro 


Thr 
105 


Ser 


Ala 


Ser 


Gin 


Ser 
110 


Ala 


Gly 


He 


Thr 


Gly 
115 


Val 


Xaa 


Asn 




tge 




egg 




cat 


tea 


gtt 


ctt 


ate 




gaa 


ata 


acc 


cag 


act 


t 


482 


Cys 


Ala 


Arg 


Pro 


His 


Ser 


Val 


Leu 


He 


Lys 


Glu 


He 


Thr 


Gin 


Thr 







120 125 130 



<210> 573 
<211> 541 
<212> DNA 
<213> Homo sapiens 
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<220> 
<221> CDS 
<222> 294. .539 



<220> 

<221> sig_peptide 
<222> 294. .380 
<223> Von Heijne matrix 
score 7 

seq LLMLFHTMAQIMA/EQ 



<400> 573 

ataactatgt gacattactt agggcataat tctaaaattt ctgtcctagg agacactaga 60 

taactctcaa aaacttctga ttttcttgca tccctttttg gcagagtggt cacagtataa 120 

tgcgggaggc cctcctcccc cgaatcagca acaatttttg ctccaccgtt ttctttcggg 180 

aacagatgcg ttttcagaag acagtactcc agtgataagc ggcgactttg agggattccc 240 

tctctggcgg cctctgcagc agcacagccg gcctcattcg gggcactgcg agt atg 2 96 

Met 



gat 


etc 


caa 


gga aga 


ggg gtc 


cec 


age 


ate 


gae 


aga 


ett cga gtt 


etc 


344 


Asp 


Leu 


Gin 


Gly Arg 


Gly Val 


Pro 


Ser 


He 


Asp 




Leu Arg Val 


Leu 










-25 






-20 








-15 






ctg 


atg 


ttg 


ttc eat 


aea atg 


get 


caa 


ate 


atg 


gea 


gaa caa gaa 


gtg 


392 




Met 


Leu 


Phe His 


Thr Met 


Ala 


Gin 


He 


Met 


Ala 


Glu Gin Glu 


Val 








-10 






-5 










1 






gaa 


aat 


etc 


tea ggc 


ett tec 


act 


aac 


cct 


gaa 


aaa 


gat ata ttt 


gtg 


440 


Glu 


Asn 


Leu 


Ser Gly 


Leu Ser 


Thr 


Asn 


Pro 


Glu 


Lys 


Asp He Phe 


Val 




5 








10 








15 






20 




gtg 


egg 


gaa 


aat ggg 


acq acc 


tgt 


etc 


atg 


gca 


gag 


ttt gca gcc 




488 


Val 


Arg 


Glu 


Asn Gly 


Thr Thr 


Cys 


Leu 


Met 


Ala 


Glu 


Phe Ala Ala 












25 








30 






35 






ttt 


att 


gta 


cct tat 


gat gtg 


tgg 


gcc 


age 


aac 


tac 


gta gat ctg 


ate 


536 


Phe 


lie 


Val 


Pro Tyr 


Asp Val 


Trp Ala 


Ser 


Asn 


Tyr Val Asp Leu 


He 





40 45 50 



aca ga 541 
Thr 



<210> 574 
<211> 453 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 274. .453 



<220> 

<221> sig_peptide 

<222> 274 . .345 

<223> Von Heijne matrix 

score 6.59999990463257 
seq KMVHLLVLSGAWG/MQ 



<400> 574 

attattggtt gggggaaacc acgaggggac gcggccgagg agggtcgctg tccacccggg 60 
ggcgtgggag tgaggyacca gattcagccc atttggcccc gacgcctctg ttctcggaat 120 
eegggtgctg cggattgagg teeeggttec taacgaatet ctgetggatt ggccgtaacc 180 
etgtcecega gegggeteae agggtetgaa ggeeaegeay gaggeaaagg yaaagyycyg 240 
agccacccgg tgchtcsttc ccaggascgc aag atg gag gaa ggc ggg aac eta 2 94 
Met Glu Glu Gly Gly Asn Leu 
-20 

gga ggc ctg att aag atg gtc cat eta ctg gtc ttg tea ggt gcc tgg 342 
Gly Gly Leu He Lys Met Val His Leu Leu Val Leu Ser Gly Ala Trp 
-15 -10 -5 
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ggc 


atg 


caa 


atg 


tgg 


gtg 








ggc 


ttc 


ctg 


ctt 


ttc 


cga 


390 


Gly 


Met 

1 


Gin 


Met 


Trp 


Val 
5 


Thr 


Phe Val 


Ser 


Gly 
10 


Phe 


Leu 


Leu 


Phe 


15 




age 


ctt 


ccc 


cga 


cat 


acc 


ttc 


gga eta 


gtg 


cag 


age 


aaa 


etc 


ttc 


eec 


438 


Ser 


Leu 


Pro 




His 
20 


Thr 


Phe 


Gly Leu 


Val 
25 


Gin 


Ser 


Lys 


Leu 


Phe 
30 


Pro 




ttc 


tac 


ttc 


cac 


ate 






















453 


Phe 


Tyr 


Phe 


His 
35 


lie 

























<210> 575 
<211> 497 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 70. .495 

<220> 

<221> sig_pepticie 

<222> 70. .147 

<223> Von Heijne matrix 

score 6.09999990463257 

seq IGLASLFTENVLA/HP 

<400> 575 

ataggatgaa accatggaag tcttsctsag atctggggag ggagcctggc ccaagagccc 60 
gtggagcta atg aaa aca gaa cag ctg aat aga ttt get gga ttt ggt att 111 
Met Lys Thr Glu Gin Leu Asn Arg Phe Ala Gly Phe Gly lie 
-25 -20 -15 



gga 


ctt 


gca 


agt 


etc 


ttt 


aea 


gaa 


aat 


gta 


ttg 


gea 


eat 


eet 


tge 


att 


159 


Gly 


Leu 


Ala 
-10 


Ser 


Leu 


Phe 


Thr 


Glu 
-5 


Asn 


Val 


Leu 


Ala 


His 
1 


Pro 


Cys 


He 




gtt 


eta 


cgc 


cgc 


caa 


tgt 


eag 


gtt 


aat 


tae 


eat 


get 


eag 


cat 


tac 


cat 


207 


Val 


Leu 


Arg 


Arg 


Gin 


Cys 


Gin 


Val 


Asn 


Tyr 


His 


Ala 


Gin 


His 


Tyr 


His 




5 










10 










15 










20 




etc 


act 


cca 


ttt 




gtc 


ate 


aat 


att 


atg 


tac 


agt 


ttc 


aac 


aaa 


act 


255 


Leu 


Thr 


Pro 


Phe 


Thr 
25 


Val 


He 


Asn 


He 


Met 
30 


Tyr 


Ser 


Phe 


Asn 


Lys 
35 


Thr 




cag 


gga 


cct 


aga 




ctg 


tgg 


aaa 


gga 


atg 


rga 


agt 


aea 


ttt 


att 


gtc 


303 


Gin 


Gly 


Pro 


Arg 
40 


Ala 


Leu 


Trp 


Lys 


Gly 
45 


Met 


Xaa 


Ser 


Thr 


Phe 

50 


He 


Val 




cag 


gga 


gtc 


aca 


ctt 


gga 


gca 




ggc 


ata 


att 




gaa 


ttt 






351 


Gin 


Gly 


Val 
55 


Thr 


Leu 


Gly 


Ala 


Glu 
60 


Gly 


He 


He 


Ser 


Glu 
65 


Phe 


Thr 


Pro 




ttg 




agg 


gag 


gtt 


tta 


cat 


aaa 


tgg 


agt 


cct 


aaa 


caa 


ata 


gga 


gaa 


399 


Leu 


Pro 
70 


Arg 


Glu 


Val 


Leu 


His 
75 


Lys 


Trp 


Ser 


Pro 


80 


Gin 


He 


Gly 


Glu 




cac 


ctt 


eta 


ctg 


aaa 


tee 


eta 


act 


tae 


gtg 


gtg 


gea 


atg 


ett 


ttt 


tat 


447 


His 


Leu 


Leu 


Leu 


Lys 


Ser 


Leu 


Thr 


Tyr 


Val 


Val 


Ala 


Met 


Leu 


Phe 


Tyr 




85 










90 










95 










100 




tea 


gca 


agt 


ctg att 


gaa 


aea 


gtg 


eag 


agt 


gag 


ata 


att 


ega 


gat 


aat 


495 


Ser 


Ala 


Ser 


Leu 


He 


Glu 


Thr 


Val 


Gin 


Ser 


Glu 


He 


He 


Arg 


Asp 


Asn 





105 110 115 

497 



<210> 576 
<211> 467 
<212> DNA 
<213> Homo sapiens 



<220> 
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<221> CDS 
<222> 88 . .465 

<220> 

<221> sig_peptide 

<222> 88. .219 

<223> Von Heijne matrix 

score 6.09999990463257 

seq ACCLVMVISCVFC/MG 

<400> 576 

tcttacacca acctgctcca aaccacaaga ggagttactt gttccagcct cctgtgtgga 60 
ctgctttcct atcaaagcac cttagac atg cac gag gaa gaa ata tac acc tct 114 
Met His Glu Glu Glu He Tyr Thr Ser 
-40 



ctt 


cag 


tgg 


gat 


age 


eea 


gea 


eea 


gac act 


tac 


cag 


aaa 


tgt 


ctg 


tct 


162 




Gin 


Trp 


Asp 


Ser 


Pro 


Ala 


Pro 


Asp Thr Tyr Gin 




Cys 


Leu 


Ser 




-35 










-30 








-25 










-20 




tec 


aac 


aaa 


tgt 


tea 


gga 


gea 


tgc 


tgt ctt 


gtg 


atg 


gtg 


att 


tea 


tgt 


210 


Ser 


Asn 


Lys 


Cys 


Ser 


Gly Ala Cys 


Cys Leu 


Val 


Met 


Val 


He 


Ser 


Cys 












-15 








-10 










-5 






gtt 


ttc 


tgc 


atg 


gga 


tta 


tta 


acg 


gca tec 


att 


ttc 


ttg 


ggc 


gtc 


aag 


258 


Val 


Phe 


Cys 


Met 


Gly 


Leu 


Leu 


Thr 


Ala Ser 


He 


Phe 


Leu 


Gly Val 


Lys 










1 








5 








10 










ttg 


ttg 


cag 


gtg 


tec 


acc 


att 


gcg 


atg cag 


cag 


caa 


gaa 


aaa 


etc 


ate 


306 


Leu 


Leu 
15 


Gin 


Val 


Ser 


Thr 


He 
20 


Ala 


Met Gin 


Gin 


Gin 
25 


Glu 


Lys 


Leu 


He 




caa 


caa 


gag 


agg 


gca 


ctg 


eta 


aac 


ttt aca 


gaa 


tgg 


aag 


aga 


age 


tgt 


354 


Gin 


Gin 


Glu 


Arg 


Ala 


Leu 


Leu 


Asn 


Phe Thr 


Glu 


Trp 




Arg 


Ser 


Cys 




30 










35 








40 










45 




gcc 


ctt 


cag 


atg 


aaa 


tat 


tgc 


caa 


gcc ttc 


atg 


caa 


aac 


tea 


tta 


agt 


402 


Ala 


Leu 


Gin 


Met 


Lys 
50 


Tyr 


Cys 


Gin 


Ala Phe 
55 


Met 


Gin 


Asn 


Ser 


Leu 
60 


Ser 




tea 


gcc 


cat 


aac 


age 


agt 


cet 


tgt 


eea aac 


aat 


tgg 


att 


cag 


aac 


aga 


450 


Ser 


Ala 


His 


Asn 


Ser 


Ser 


Pro 


Cys 


Pro Asn 


Asn 


Trp 


He 


Gin 


Asn Arg 










65 










70 








75 








gaa 


agt 


tgt 




tat 


gt 




















467 


Glu 


Ser 


Cys 


Tyr 


Tyr 

























80 



<210> 577 
<211> 485 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 328. .483 

<220> 

<221> sigjpeptide 

<222> 328. .387 

<223> Von Heijne matrix 

score 5.90000009536743 

seq GWMAAGLMIGAGA/CY 

<400> 577 

ctagagcaca agtcctgcta gtcgccteeg tctgggtaec agecceetat taetctgcag 60 

gcgtgtgaag aaagaaggaa actagctcgg accgtgcagg tttgtaggtc tgttggcctg 120 

taggtttcgg cacaagtttc agcgagagaa ggagaaaact gccttggttg gaaccttgca 180 

gattcatcac aagagagcta caagagcctg gaagaagctg aagactgcta ccctccatcc 24 0 

ttactcaccc tggacctgag agacctcttc aatcaggtgg ageaaggcee tctectgtcc 300 

tgecceaagg ctggcacaga cttgagc atg ggc egg get egg gaa gtg ggt tgg 354 
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Met Gly Arg Ala Arg Glu Val Gly Trp 
-20 -15 
atg gcg gca gga ctg atg att ggg get ggt gcc tgc tac tgc gtt tac 402 
Met Ala Ala Gly Leu Met lie Gly Ala Gly Ala Cys Tyr Cys Val Tyr 

-10 -5 15 

aaa ctg acc ata gga aga gat gac agt gag aag ctg gag gag gag ggg 4 50 

Lys Leu Thr He Gly Arg Asp Asp Ser Glu Lys Leu Glu Glu Glu Gly 

10 15 20 

gaa gag gag tgg gac gat gac cag gag ctg gat ga 485 
Glu Glu Glu Trp Asp Asp Asp Gin Glu Leu Asp 



<210> 578 
<211> 502 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 39. .500 



<220> 

<221> sig_peptide 

<222> 39. .83 

<223> Von Heijne matrix 

score 5.90000009536743 
seq ILMLTFIICGLLT/RV 



<400> 578 

attcctcagg acacagagct tcctctctcc caggagcc atg aat ate ctg atg ctg 56 

Met Asn He Leu Met Leu 
-15 -10 



acc 


ttc 


att 


ate 


tgt 


ggg 


ttg 


eta 


act 


egg 


gtg 


ace 


aaa 


ggt 


age 


ttt 


104 


Thr 


Phe 


He 


He 


Cys 
-5 


Gly 


Leu 


Leu 


Thr 


Arg 
1 


Val 


Thr 


Lys 


Gly 
5 


Ser 


Phe 




gaa 








tgt 


tgg 




aat 


aat 


gta 


gga 


cat 




aga 


aga 


ega 


152 


Glu 


Pro 


Gin 


Lys 


Cys 


Trp 


Lys 


Asn 


Asn 


Val 


Gly 


His 


Cys 


Arg 


Arg 


Arg 








10 










15 










20 










tgt 


tta 


gat 


act 


gaa 


agg 


tac 


ata 


ctt 


ctt 


tgt 


agg 


aac 


aag 


eta 


tea 


200 


Cys 


Leu 


Asp 


Thr 


Glu 


Arg 


Tyr 


He 


Leu 


Leu 


Cys 


Arg 


Asn 


Lys 


Leu 


Ser 






25 










30 










35 












tgc 


tgc 


att 


tct 


ata 


ata 


tea 


cat 


gaa 


tat 


act 


cga 




cca 


gca 


ttt 


248 


Cys 


Cys 


He 


Ser 


He 


He 


Ser 


His 


Glu 


Tyr 


Thr 


Arg 


Arg 


Pro 


Ala 


Phe 




40 










45 










50 










55 




cct 


gtg 


att 


cac 


eta 


gag 




ata 




ttg 




tat 






gtg 


gac 


296 


Pro 


Val 


He 


His 


Leu 


Glu 


Asp 


He 


Thr 




Asp 


Tyr 


Ser 


Asp 


Val 


Asp 












60 










65 










70 






tct 


ttt 


act 


ggt 


tee 


cca 


gta 


tct 


atg 


ttg 


aat 


gat 


ctg 


ata 


aca 


ttt 


344 


Ser 


Phe 


Thr 


Gly 


Ser 


Pro 


Val 


Ser 


Met 




Asn 


Asp 




He 


Thr 


Phe 










75 










80 










85 








gac 


aca 


act 




ttt 


gga 


gaa 


acc 


atg 




cct 


gag 


acc 


aat 


act 


cct 


392 


Asp 


Thr 


Thr 


Lys 


Phe 


Gly 


Glu 


Thr 


Met 


Thr 


Pro 


Glu 


Thr 


Asn 


Thr 


Pro 






90 










95 










100 










gag 


act 




atg 


cca 


cca 


tec 


gag 


gcc 


act 


act 


ccc 


gag 


act 


act 


atg 


440 


Glu 


Thr 


Thr 


Met 


Pro 


Pro 


Ser 


Glu 


Ala 


Thr 


Thr 


Pro 


Glu 


Thr 


Thr 


Met 






105 










110 










115 












cca 


cca 


tct 


gag 


act 


get 


act 


tee 


gag 


act 


atg 


cca 


eea 


eet 


tet 


eag 


488 


Pro 


Pro 


Ser 


Glu 


Thr 


Ala 


Thr 


Ser 


Glu 


Thr 


Met 


Pro 


Pro 


Pro 


Ser 


Gin 




120 










125 










130 










135 




aca 


get 


ctt 


act 


ea 
























502 


Thr 


Ala 


Leu 


Thr 





























<210> 579 
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<211> 754 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 259. .753 

<220> 

<221> sig_peptide 

<222> 259. .375 

<223> Von Heijne matrix 

score 5.80000019073486 

seq FCVCVIAIGVVQA/LI 

<400> 579 

aagctcccgg ccgggctgac tcaagcggag gcgcgcggaa cagtcgccga ggcgattccc 60 

gcccagcagt tcgacagaag tgtacagagg cttctggcaa cacggattgc cgtctacctg 120 

atgacctttc tcatcgtgac agtggcctgg gcagcacaca caaggttgtt ccaagttgtt 180 

gggaaaacag acgacacact tgccctgctc aacctggccg catcatggct gtgatgccct 240 



ccctccctcc aggcctgc 


atg 


atg 




ate 


acc 


ttc 


ctg 


cct 


tac 


acg 


ttt 


291 












Met 


Met 


Thr 


He 


Thr 


Phe 


Leu 


Pro 


Tyr 


Thr 


Phe 






















-35 










-30 






teg 


tta 


atg 


gtg 


acc 


ttc 


cct 


gat 


gtg 


ect 


ctg 


ggc 


ate 


ttc 


ttg 


ttc 


339 


Ser 


Leu 


Met 


Val 


Thr 


Phe 


Pro 


Asp 


Val 


Pro 


Leu 


Gly He 


Phe 


Leu 


Phe 










-25 




















-15 








tgt 


gtg 


tgt 


gtg 


ate 


gcc 


ate 


ggg 


gtc 


gtg 


cag 


gca 


ctg 


att 


gtr 


ggg 


387 


Cys 


Val 


Cys 
-10 


Val 


He 


Ala 


He 


Gly 
-5 


Val 


Val 


Gin 


Ala 


1 


He 


Val 


Gly 




tac 


gca 


ttc 


cac 


ttc 


ccg 


cac 


ctg 


ctg 


age 


ccg 


cag 


ate 


cag 


cgc 


tct 


435 


Tyr 


Ala 


Phe 


His 


Phe 


Pro 


His 


Leu 


Leu 


Ser 


Pro 


Gin 


He 


Gin 


Arg 


Ser 




5 










10 










15 










20 




gcc 


cac 


agg 


get 


ctg 


tac 


cga 


cga 


cac 


gtc 


ctg 


ggc 


ate 


gtc 


etc 


caa 


483 


Ala 


His 


Arg 


Ala 


Leu 


Tyr Arg Arg 


His 


Val 


Leu 


Gly 


He 


Val 


Leu 


Gin 












25 










30 










35 






ggc 


ccg 


gcc 


ctg 


tgc 


ttt 


gca 


gcg 


gcc 


ate 


ttc 


tct 


etc 


ttc 


ttt 


gtc 


531 


Gly 


Pro 


Ala 


Leu 


Cys 


Phe 


Ala 


Ala 


Ala 


He 


Phe 


Ser 


Leu 


Phe 


Phe 


Val 










40 










45 










50 








GCC 


ttg 


tct 


tac 


ctg 


ctg 


atg gtg 


act 


gtc 


ate 


etc 


etc 


ece 


tat 


gte 


579 


Pro 


Leu 


Ser 


Tyr 


Leu 


Leu 


Met 


Val 


Thr 


Val 


He 


Leu 


Leu 


Pro Tyr 


Val 








55 










60 










65 










age 


aag 


gtc 


acc 


ggc 


tgg 




aga 


gac 


agg 


etc 


ctg ggc 


cac 


agg 


gag 


627 


Ser 


Lys 


Val 


Thr 


Gly 


Trp 


Cys 


Arg 


Asp 


Arg 


Leu 


Leu 


Gly 


His 


Arg 


Glu 






70 










75 










80 






















gtg 


gaa 


gtc 


ttc 


teg 


ttt 


gac 


etc 


cac 


gag 




675 


Pro 


Ser 


Ala 


His 


Pro 


Val 


Glu 


Val 


Phe 


Ser 


Phe 


Asp 


Leu 


His 


Glu 


Pro 




85 










90 










95 










100 




etc 


age 


aag 


gag 




gtg 


gaa 


gcc 


ttc 


age 


gac 


gga 


gte 


tae 


gee 


ate 


723 


Leu 


Ser 


Lys 


Glu 


Arg 


Val 


Glu 


Ala 


Phe 


Ser 


Asp 


Gly Val 


Tyr 


Ala 


He 












105 










110 










115 






gtg 


gcc 


acg 


ctt 


etc 


ate 


ctg 


gae 


ate 


tgg 


t 












754 


Val 


Ala 


Thr 


Leu 


Leu 


lie 


Leu 


Asp 


He 


Trp 

















120 125 



<210> 580 
<211> 488 
<212> DNA 
<213> Homo sapiens 



<220> 
<22l> CDS 
<222> 54. .488 
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<220> 

<221> sig_peptide 

<222> 54. .119 

<223> Von Heijne matrix 

score 5.80000019073486 

seq PVTLTSLLPMVTX/GP 

<220> 

<221> misc_feature 
<222> 181,256,369 

<223> n=a, g, c or t 



<400> 580 

gtctttttaa tctctgcagt cccctgagtg tcacccaacc tgcaggcagt gtc atg 5 6 

Met 



ggc 


cgt 


tgg 


agt 


ctg 


act 


gca 


tct 


cct 


gtt 


acc 


etc 


aca 


tct 


etc 


tta 


104 


Gly 


Arg 
-20 


Trp 


Ser 


Leu 


Thr 


Ala 
-15 


Ser 


Pro 


Val 


Thr 


Leu 
-10 


Thr 


Ser 


Leu 


Leu 




cca 


atg 


gta 


act 


kca 


gga 


cca 


get 


gca 


ggg 


aag 


age 


agt 


tec 


tea 


ace 


152 


Pro 


Met 


Val 


Thr 


Xaa 


Gly Pro Ala 


Ala 


Gly 


Lys 


Ser 


Ser 


Ser 


Ser 


Thr 




-5 










1 








5 










10 






teg 


tgg 


gat 


act 


gtg 


ctg 


act 


gcc 


ate 


wne 


tgt 


gee 


age 


act 


gtg 


cag 


200 


Ser 


Trp 


Asp 


Thr 
15 


Val 


Leu 


Thr 


Ala 


He 
20 


Xaa 


Cys 


Ala 


Ser 


Thr 
25 


Val 


Gin 




ctg 


ate 


tee 


aca 


aca 


ctg 


gga 


gca 


tct 


gee 


tea 


ggt 


gcc 


aga 


atg 


cce 


248 


Leu 


lie 


Ser 


Thr 


Thr 


Leu 


Gly Ala 


Ser 


Ala 


Ser 


Gly 


Ala 


Arg 


Met 


Pro 








30 










35 










40 










act 


acc 


tnc 


tgc 


teg 


ggg 


acc 


act 


gtg 


tte 


ctg 


act 


gcc 


ctt 


cag 


gat 


296 


Thr 


Thr 
45 


Xaa 


Cys 


Ser 


Gly 


Thr 
50 


Thr 


Val 


Phe 


Leu 


Thr 
55 


Ala 


Leu 


Gin 


Asp 




act 


atg 


cag 


aga 


gag 


gag 


ctt 


gta 




aat 


gcc 


act 


cct 


cct 


gca 


gaa 


344 


Thr 


Met 


Gin 


Arg 


Glu 


Glu 


Leu 


Val 


Lys 


Asn 


Ala 


Thr 


Pro 


Pro 


Ala 


Glu 




60 










65 










70 










75 




cct 


gcc 


agg 


gca 


gag 


gac 


ctt 


tct 


net 


get 


cct 


cat 


gtg 


aca 


cca 


ace 


392 


Pro 


Ala 


Arg 


Ala 


Glu 

80 


Asp 


Leu 


Ser 


Xaa 


Ala 

85 


Pro 


His 


Val 


Thr 


Pro 

90 


Thr 




teg 


tgc 


tgt 






ctg 


gca 


cct 


gca 


gca 


cca 


cct 


get 


tec 


ctg 


ggc 


440 


Ser 


Cys 


Cys 


Pro 
95 


Thr 


Leu 


Ala 


Pro 


Ala 
100 


Ala 


Pro 


Pro 


Ala 


Ser 
105 


Leu 


Gly 




act 


ate 


ttg 


atg 


aca 


ate 


atg 


ttt 


gee 


age 


gta 


ggt 


ttt 


tec 


aat 


gaa 


488 


Thr 


lie 


Leu 


Met 


Thr 


He 


Met 


Phe 


Ala 


Ser 


Val 


Gly 


Phe 


Ser 


Asn 


Glu 





110 115 120 



<210> 581 
<211> 343 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 17. .343 

<220> 

<221> sig_peptide 

<222> 17. .106 

<223> Von Heijne matrix 

score 5.69999980926514 

seq LGVNLVVMPPTRA/RS 

<400> 581 

atagattaca acggtg atg gcg ggc aag egg tec ggc tgg age egg gcg get 52 

Met Ala Gly Lys Arg Ser Gly Trp Ser Arg Ala Ala 

-30 -25 -20 
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etc 


etc 


eag 


etc 


ett 


etc 


ggc 


gtg 


aac etg 


gtg 


gtg atg 


ccg 


ccc 


acc 


100 


Leu 


Leu 


Gin 


Leu 


Leu 


Leu 


Gly 


Val 


Asn Leu 


Val 


Val 


Met 


Pro 


Pro 


Thr 










-15 










-10 








-5 








egg 


gcc 


egg 


agt 


etg 


cgc 


ttc 


gtt 


acc ttg 


gag 


ggg 


atg 


eta 


eag 


cac 


148 


Arg 


Ala 


Arg 
1 


Ser 


Leu 


Arg 


Phe 
5 


Val 


Thr Leu 


Glu 


Gly 
10 


Met 


Leu 


Gin 


His 




tgg 


gaa 


etg 


ggc 


cag 


gcc 


etg 


egg 


cag cgc 


tat 


cac 


ggc 


ttc 


eta 


aac 


196 


Trp 


Glu 


Leu 


Gly 


Gin 


Ala 


Leu Arg 


Gin Arg 


Tyr 


His 


Gly 


Phe 


Leu 


Asn 




15 










20 








25 










30 




acc 


tct 


tat 


cac 


egg 


caa 


gag 


gtt 


tat gtg 


ega 


arg 


tac 


ttt 


gac 


egg 


244 


Thr 


Ser 


Tyr 


His 


Arg 


Gin 


Glu 


Val 


Tyr Val 


Arg 


Xaa 


Tyr 


Phe 


Asp 


Arg 












35 








40 










45 






act 


etc 


atg 


agt 


get 


gag 


gcc 


aac 


etg get 


gga 


stc 


ttc 


cct 


ccc 


aac 


292 


Thr 


Leu 


Met 


Ser 


Ala 


Glu 


Ala 


Asn 


Leu Ala 


Gly Xaa 


Phe 


Pro 


Pro 


Asn 










50 










55 








60 








ggg 


atk 


cag 


cgc 


ttc 


aac 


ccg 


aac 


ate teg 


tgg cag 


cct 


att 


cct 


gtg 


340 


Gly 


Xaa 


Gin 


Arg 


Phe 


Asn 


Pro 


Asn 


lie Ser 


Trp 


Gin 


Pro 


lie 


Pro 


Val 








65 










70 








75 











His 



<210> 582 
<211> 465 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 293. .463 

<220> 

<221> sig_peptide 

<222> 293. .388 

<223> Von Heijne matrix 

score 5.59999990463257 

seq WMGLACFRSLAAS/SP 

<400> 582 

aaaagattga agaggaactc tttcaaacct ttcaagtgcc ccctcctttc cttaaagtct 60 

tttatagggg teeccttctt ggccatctcc atcctgtgag tcaggactga aagggcacag 120 

acaggtcact gccagcattg ttggggcaag cetgcaagca cgeatcactg gggatctgac 180 

atgacaatgg cegcctgcce cctctgaggg ctacaggact taccccagtg ggaagcaget 240 

aagcaggtct gaecagecga cctggacctg gccaagggtc ctgtcatccc to atg gee 2 98 

Met Ala 



acc 


ccg 


cea 


ttc 


egg 


etg 


ata 


agg 


aag 


atg 


ttt 


tec 


ttc 


aag 


gtg 


age 


346 


Thr 


Pro 


Pro 


Phe Arg 


Leu 


He 


Arg 


Lys 


Met 


Phe 


Ser 


Phe 


Lys 


val 


Ser 




-30 










-25 










-20 










-15 




aga 


tgg 


atg 


ggg 


ctt 


gcc 


tgc 


ttc 


egg 


tee 


etg 


gcg 


gea 


tec 


tct 


ccc 


394 


Arg 


Trp 


Met 


Gly 


Leu 


Ala Cys 


Phe Arg 


Ser 


Leu 


Ala 


Ala 


Ser 


Ser 


Pro 












-10 










-5 










1 






agt 


att 


cgc 


cag 


aag 


aaa 


eta 


atg 


cac 


aag 


etg 


cag 


gag 


gaa 


aac 


get 


442 


Ser 


He 


Arg 
5 


Gin 






Leu 


Met 
10 


His 


Lys 


Leu 


Gin 


Glu 
15 


Glu 


Asn 


Ala 




ttt 


cgc 


gaa 


gag 


atg 




att 


tt 


















465 


Phe 


Arg 
20 


Glu 


Glu 


Met 




He 
25 























<210> 583 
<211> 539 
<212> DNA 
<213> Homo sapiens 

<220> 



[fije>'/^:/yv'GG208389e.s^] 



WO 02/083898 PCT/JlBOl/00914 



<220> 

<221> sig_peptide 

<222> 302. .358 

<223> Von Heijne matrix 

score 5.40000009536743 
seq LCCSGCVPSLCCS/SY 



<400> 583 

aaaaacttcc gccgccgcgt ccgccgcctc cggaactaaa cggggtgagg tcacattcgg 60 
ttatctctaa cgttggaaaa cgatggagct aacacccatt atggagatta accacttttc 120 
atcaggtttt taacttaagt cgtgaggaat acaacggtga acacaagatt cattttattt 180 
tcatcaccat gggacgtatc ctgttgttga gtwctctggg tcagacctct gaagacttct 240 
cagatggatc ctagtctctg ggcttgcctg aaattactcg ctgctcaggg agagagttga 300 
a atg gtt ggc ate etc cca etc tgt tgc tec ggc tgt gtc ccc teg etc 34 9 
Met Val Gly lie Leu Pro Leu Cys Cys Ser Gly Cys Val Pro Ser Leu 
-15 -10 -5 

tgt tgt tec age tat gtc ccc tct gtt get cca act gca get cat tct 397 
Cys Cys Ser Ser Tyr Val Pro Ser Val Ala Pro Thr Ala Ala His Ser 

15 10 
gtt aga gtt eet eat tea get ggt cac tgt ggc cag agg gtg ttg gee 445 
Val Arg Val Pro His Ser Ala Gly His Cys Gly Gin Arg Val Leu Ala 

15 20 25 

tge tee ett eet eaa gta tte tta aag eea tgg att ttt gtg gag eat 493 
Cys Ser Leu Pro Gin Val Phe Leu Lys Pro Trp He Phe Val Glu His 
30 35 40 45 

ttt tet tee tgg ete tee ett gag tta ttt tee ttt ett ege tat e 539 
Phe Ser Ser Trp Leu Ser Leu Glu Leu Phe Ser Phe Leu Arg Tyr 
50 55 60 



<210> 584 
<211> 526 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 85. .525 



<220> 

<221> sig_peptide 

<222> 85. .144 

<223> Von Heijne matrix 

score 5.19999980926514 
seq SPVFLVFPPEITA/SE 



<400> 584 

eetartegaa aaaeaagaaa agaaeatggt etagaetgaa gtaeeaaeta aateatetee 
ttteaaatta teaeegaeae eate atg gat tea age ace gea cac agt eeg 
Met Asp Ser Ser Thr Ala His Ser Pro 
-20 -15 

gtg ttt ctg gta ttt cct cca gaa ate act get tea gaa tat gag tec 

Val Phe Leu Val Phe Pro Pro Glu He Thr Ala Ser Glu Tyr Glu Ser 

-10 -5 15 

aca gaa ett tea gee aeg ace ttt tea act eaa age eee ttg eaa aaa 

Thr Glu Leu Ser Ala Thr Thr Phe Ser Thr Gin Ser Pro Leu Gin Lys 

10 15 20 

tta ttt get aga aaa atg aaa ate tta ggg act ate eag ate ctg ttt 

Leu Phe Ala Arg Lys Met Lys He Leu Gly Thr He Gin He Leu Phe 

25 30 35 

gga att atg acc ttt tct ttt gga gtt ate ttc ett tte act ttg tta 



[fije>'/^:/yv'GG208389e.s^] 



WO 02/083898 PCT/lBOl/00914 
317 



Gly 


He 


Met 
40 


Thr 


Phe 


Ser 


Phe 


Gly 

45 


Val 


He Phe 


Leu 


Phe 

50 


Thr 


Leu 


Leu 








tat 


cca 


agg 


ttt 


ccc 


ttt 


ata 


ttt ctt 


tea 


gga 


tat 




ttc 


351 


Lys 


Pro 
55 


Tyr 


Pro 


Arg 


Phe 


Pro 
60 


Phe 


He 


Phe Leu 


Ser 
65 


Gly 


Tyr 


Pro 


Phe 




tgg 


ggc 


tct 


gtt 


ttg 


ttc 


att 


aat 


tct 


gga gcc 


ttc 


eta 


att 


gca 


gtg 


399 


Trp 


Gly 


Ser 


Val 


Leu 


Phe 


He 


Asn 


Ser 


Gly Ala 


Phe 


Leu 


He 


Ala 


Val 




70 










75 








80 










85 




aaa 


aga 




acc 


aca 


gaa 


act 


ctg 


ata 


ata ttg 


age 


ega 


ata 


atg 


aat 


447 






Lys 




90 


Glu 


Thr 






95 








100 






ttt 


ctt 


agt 


gcc 


ctg 


gga 


gca 


ata 


get 


gga ate 


att 


etc 


etc 


aca 


ttt 


4 95 


Phe 




Ser 


Ala 


Leu 


Gly Ala 


He 


Ala 


Gly He 


He 


Leu 


Leu 


Thr 


Phe 










105 










110 








115 








ggg 


ttt 


cat 


cct 


aga 


tea 


aaa 


eta 


cat 


ttg t 












526 


Gly 


Phe 


His 


Pro 


Arg 


Ser 


Lys 


Leu 


His 


Leu 















120 125 



<210> 585 
<211> 482 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 195. .482 

<220> 

<221> sig_peptide 

<222> 195. .293 

<223> Von Heijne matrix 

score 5.09999990463257 

seq LYALVQLGQPCDC/LP 

<400> 585 

gcacagatga ccggaagcgt gccctggcgg cgctgccagc cccagctggg agctggtagg 60 

ggagtgcgag gaaggcggga cgcagggcgg agccgctgca ggggcggggc ctgcggacag 120 

gagccggggt ttggggcggg aacccctcgt cccctgcaga ccccgcctgc tcgggcgcgg 180 



gcggcggcgc ggcc 






ctg 


aag 


ctg aag 


aac 


gtg 


ttt 


etc 


gee 


tae 


230 










Met 


Lys 


Leu 


Lys 
-30 


Leu 


Lys 


Asn 


Val 


Phe 
-25 


Leu 


Ala 


Tyr 




ttc 


ctg 


gtg 


teg 


ate 


gee 


ggc 


ete 


etc 


tac 


gcg 


ctg 


gta 


cag 


etc 


ggc 


278 


Phe 


-20 


Val 


Ser 


He 


Ala 


Gly 
-15 


Leu 


Leu 


Tyr 


Ala 


Leu 
-10 


Val 


Gin 


Leu 


Gly 




cag 




tgt 




tge 


ctt 




eee 


etg 


egg 


gca 


gca 


gcs 


gag 


cag 


eta 


326 


Gin 


Pro 


Cys 


Asp 


Cys 


Leu 


Pro 


Pro 


Leu 


Arg 


Ala 


Ala 


Ala 


Glu 


Gin 


Leu 




-5 










1 








5 










10 






egg 




aag 


gat 


etg 


agg 


att 


tec 


cag 


ctg 


caa 


gcg 


gaa 


etc 


ega 


egg 


374 


Arg 


Gin 


Lys 


Asp 
15 


Leu 


Arg 


He 


Ser 


Gin 
20 


Leu 


Gin 


Ala 


Glu 


Leu 
25 


Arg 


Arg 




cca 


ccc 


cct 


gcc 


cct 


gcc 


cag 


ccc 


cct 


gaa 


ccc 


gag 


gcc 


etg 


cct 


act 


422 


Pro 


Pro 


Pro 
30 


Ala 


Pro 


Ala 


Gin 


Pro 
35 


Pro 


Glu 


Pro 


Glu 


Ala 
40 


Leu 


Pro 


Thr 




ate 


tat 


gtt 


gtt 


acc 


ccc 


acc 


tat 


gcc 


agg 


ctg 


gta 


cag 


aag 


gca 


gag 


470 


He 


Tyr 
45 


Val 


Val 


Thr 


Pro 


Thr 
50 


Tyr 


Ala 


Arg 


Leu 


Val 
55 


Gin 


Lys 


Ala 


Glu 




ctg 


gta 


ega 


ctg 


























482 


Leu 


Val 


Arg 


Leu 





























60 



<210> 586 
<211> 543 
<212> DNA 
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<213> Homo sapiens 

<220> 
<221> CDS 
<222> 35. .541 

<220> 

<221> sig_peptide 
<222> 35. .82 
<223> Von Heijne matrix 
score 5 

seq CGLFLRTTAAARA/CR 

<400> 586 

agactgaggc tgaggctggg gaacatcggg cage atg age ggc tgc ggg etc ttc 55 
Met Ser Gly Cys Gly Leu Phe 
-15 -10 



ctg 


cgc 


acc 


acg 


get 


gcg 


get egt 


gcc 


tgc 


egg 


ggt 




gtg 


gtc 


tct 


103 


Leu 


Arg 


Thr 


Thr 


Ala 
-5 


Ala 


Ala Arg 


Ala 


Cys 
1 


Arg 


Gly 


Leu 


Val 
5 


Val 


Ser 




acc 


gcg 


aac 


egg 


egg 


eta 


ctg cgc 


ace 


age 


ccg 


cet 


gta 




get 


ttc 


151 


Thr 


Ala 


Asn Arg 


Arg 


Leu 


Leu Arg 


Thr 


Ser 


Pro 


Pro Val Arg 


Ala 


Phe 








10 








15 










20 










gcc 


aaa 


gag 


ctt 


ttc 


eta 


ggc aaa 


ate 


aag 


aag 


aaa 


gaa 


gtt 


ttc 


cca 


199 


Ala 


Lys 


Glu 


Leu 


Phe 




Gly Lys 


He 


Lys 


Lys 


Lys 


Glu 


Val 


Phe 


Pro 
















30 








35 












ttt 


cca 


gaa 


gtt 


age 


caa 


gat gaa 


ctt 


aat 


gaa 


ate 


aat 


cag 


ttc 


ttg 


247 


Phe 


Pro 


Glu 


Val 


Ser Gin Asp Glu 


Leu 


Asn 


Glu 


He 


Asn 


Gin 


Phe 


Leu 




40 










45 








50 










55 




gga 


ccc 


gtg 


gaa 


aaa 


ttc 


ttc act 


gaa 


gag 


gtg 


gac 


tec 


ega 


aaa 


att 


295 


Gly 


Pro 


Val 


Glu 


Lys 


Phe 


Phe Thr 


Glu 


Glu 


Val 


Asp 


Ser 


Arg 


Lys 


He 












60 








65 










70 






gac 


cag 


gaa 


ggg 


aaa 


ate 


cca gat 


gaa 


act 


ttg 


gag 


aaa 


ttg 


aag 


age 


343 


Asp 


Gin 


Glu 


Gly 


Lys 


He 


Pro Asp 


Glu 


Thr 


Leu 


Glu 


Lys 


Leu 


Lys 


Ser 










75 








80 










85 








eta 


ggg 


ctt 


ttt 


ggg 




caa gtc 


cca 


gaa 


gaa 


tat 


ggt 


ggc 


ctg 


ggc 


391 


Leu 


Gly 




Phe 


Gly 


Leu 


Gin Val 


Pro 


Glu 


Glu 


Tyr 


Gly 


Gly 


Leu 


Gly 








90 








95 










100 










ttc 


tec 


aac 


acc 


atg tac 


tea ega 


eta 


ggg 


gag 


ate 


ate 


age 


atg 


gat 


439 


Phe 


Ser 


Asn 


Thr 


Met 


Tyr 


Ser Arg 


Leu Gly Glu 


He 


He 


Ser 


Met 


Asp 






105 










110 








115 












ggg 


tec 


ate 


act 


gtg 




ctg gea 


gcg 


cae 


cag 


get 


att 


ggc 


tea 


agg 


487 


Gly 


Ser 


He 


Thr 


Val 


Thr 


Leu Ala 


Ala 


His 


Gin 


Ala 


He 


Gly 


Ser 


Arg 




120 










125 








130 










135 




gga 


tea 


tet 


tgg 


ctg 


gea 


ctg agg 


age 


aga 


aag 


cca 


aat 


aac 


ttg 


cet 


535 


Gly 


Ser 


Ser 


Trp 


Leu 


Ala 


Leu Arg 


Ser 


Arg Lys 


Pro 


Asn 


Asn 




Pro 












140 








145 










150 






aaa 


ctg 


ge 


























543 




Leu 































<210> 587 
<211> 473 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 64. .471 



<220> 

<221> sig_peptide 

<222> 64 . .171 

<223> Von Heijne matrix 
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score 4.80000019073486 
seq LLLLRLNDAALRA/LQ 

<400> 587 

aggaggtggc ggtggtggcc ctcgcctgtg gcccccgtgc tgcttgcact cgaactcgtc 60 



gcc 


atg 


gag 


gag 


etc 


cag gag cct ctg 


aga 


gga gag 




egg 




tgc 


108 




Met 


Glu 


Glu 


Leu 


Gin Glu Pro Leu 


Arg 


Gly Glu 


Leu Arg 




Cys 








-35 






-30 






-25 










ttc 


acg 


caa 


get 


gee 


egg act age etc 


tta 


ctg etc 


agg 


etc 


aac 


gac 


156 


Phe 


Thr 


Gin 


Ala 


Ala 


Arg Thr Ser Leu 


Leu 


Leu Leu Arg 


Leu 


Asn Asp 






-20 








-15 




-10 












get 


gcc 


ctg 


egg 


gcg 


ctg caa gag tgt 


cag 


egg caa 




gta 


egg 




204 


Ala 


Ala 


Leu 


Arg 


Ala 


Leu Gin Glu Cys 


Gin 


Arg Gin 


Gin 


Val 


Arg 


Pro 




-5 










1 


5 








10 






gtg 


att 


get 


ttc 


caa 


ggc cac ega ggg 


tat 


ctg aga 


etc 


cca 


ggc 


cct 


252 


Val 


He 


Ala 


Phe 


Gin 


Gly His Arg Gly 


Tyr 


Leu Arg 


Leu 


Pro 


Gly 


Pro 










15 




20 








25 








ggt 


tgg 


tec 


tgc 


etc 


ttc tee ttc ata 


gtg 


tec cag 


tgt 


tgt 


eag 


gag 


300 


Gly 


Trp 


Ser 


Cys 


Leu 


Phe Ser Phe He 


Val 


Ser Gin 


Cys 


Cys 


Gin 


Glu 








30 






35 






40 










ggc 


get 


ggt 


ggt 


age 


ttg gac ctt gtg 


tgc 


caa cgc 


ttc 


etc 


agg 


tct 


348 


Gly 


Ala 


Gly 


Gly Ser 


Leu Asp Leu Val 


Cys 


Gin Arg 


Phe 


Leu Arg 


Ser 






45 








50 




55 












ggg 


cot 


aac 


age 


etc 


eae tgc ctg ggc 


tea 


etc agg 


gag 




etc 


att 


396 


Gly 


Pro 


Asn 


Ser 


Leu 


His Cys Leu Gly 


Ser 


Leu Arg 


Glu 


Arg 




He 




50 










65 




70 








75 




att 


tgg 


gca 


gcc 




gat tct ate cca 


gcc 


cca tea 


tea 


gtt 


cag 


gga 


444 


He 


Trp 


Ala 


Ala 


Met 
80 


Asp Ser He Pro 


Ala 
85 


Pro Ser 


Ser 


Val 


Gin 
90 


Gly 






aay 


ctg 


act 


gaa 


gat gcc aga cat 


cc 












473 


His 


Asn 


Leu 


Thr 


Glu 


Asp Ala Arg His 






















95 




100 

















<210> 588 
<211> 509 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 183. .509 

<220> 

<221> sig_peptide 

<222> 183. .251 

<223> Von Heijne matrix 

score 4.59999990463257 

seq GHSLVGLALLCIA/AN 

<400> 588 

aggaccaatg aaagcaaagt acctcatccc ctcagtgact aagaatcgca gtatttaaga 60 
ggtagcagga atgggctgag agtggtgttt gctttctcca ecagaagggc acactttcat 120 
ctaatttggg gtatcactga gctgaagaca aagagaaggg ggagaaaacc tagcagaeca 180 
cc atg tgc tat ggg aag tgt gca cga tgc ate gga cat tct ctg gtg 227 
Met Cys Tyr Gly Lys Cys Ala Arg Cys He Gly His Ser Leu Val 
-20 -15 -10 

ggg etc gcc etc ctg tgc ate gcg get aat att ttg ctt tac ttt ccc 275 
Gly Leu Ala Leu Leu Cys He Ala Ala Asn He Leu Leu Tyr Phe Pro 

-5 15 
aat ggg gaa aca aag tat gcc tec gaa aac eae etc age cgc ttc gtg 323 
Asn Gly Glu Thr Lys Tyr Ala Ser Glu Asn His Leu Ser Arg Phe Val 

10 15 20 

tgg ttc ttt tct ggc ate gta gga ggt ggc ctg ctg atg etc ctg cca 371 
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Trp 


Phe 


Phe 


Ser 


Gly 


He 


Val Gly Gly Gly Leu 


Leu 


Met 


Leu 




Pro 




25 










30 


35 










40 




gca 


ttt 


gtc 


ttc 


att 


ggg 


ctg gaa cag gat gac 


tgc 


tgt 


ggc 


tgc 


tgt 


419 


Ala 


Phe 


Val 


Phe 


He 


Gly 


Leu Glu Gin Asp Asp 


Cys 


Cys 


Gly Cys 


Cys 












45 




50 








55 






ggc 


cat 


gaa 


aac 


tgt 


ggc 


aaa cga tgt gcg atg 


ctt 


tct 


tct 


gta 


ttg 


4 67 


Gly 


His 


Glu 


Asn 
60 


Cys 


Gly 


Lys Arg Cys Ala Met 
65 


Leu 


Ser 


Ser 
70 


Val 


Leu 




get 


get 


etc 


att 


gga 


att 


gca gga yet ggc tac 


tgt 


gtc 


att 






509 


Ala 


Ala 


Leu 


He 


Gly He Ala Gly Xaa Gly Tyr Cys Val 


He 












75 








80 




85 











<210> 589 
<211> 523 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 302. .523 

<220> 

<221> sig_peptide 

<222> 302.. 352 

<223> Von Heijne matrix 

score 4.59999990463257 

seq LLPLCQRIHCIIG/RF 

<400> 589 

agatgagtgt tcagctctca gcagagaggt tagctcctct ctgcagcttg tcctgttgtc 60 

tcctcaagtc tggctgagtc cggagttttt atgagcctca gaggggagga agtgcatgct 120 

gattaatcca tgggcaggcc tggaaaagtt cccactccag tctgcgggac ccacagcctg 180 

gccctcaggc tcaggcttcc cwggcttgaa gattgggctt cacctgggac ctaccccttc 240 

tgcctaggag catgtctgcc tcctgctgcc tttcatggtg cccagccaag gctaagtcga 300 

a atg tgg ccc aac ctt ctt ccc ctg tgc cag egg ate cat tgc ate att 34 9 
Met Trp Pro Asn Leu Leu Pro Leu Cys Gin Arg He His Cys He He 
-15 -10 -5 



ggt 


cgc 


ttc 


egg 




aat 


ggg 


ttt 


gag 


gac 


tgt 


ccc 


gat ggc age gat 


397 


Gly 


Arg 


Phe 


Arg 


Cys 


Asn 


Gly 


Phe 


Glu 


Asp 


Cys 


Pro Asp Gly Ser Asp 






1 








5 










10 




15 




gaa 


gag 




tgc 


aca 


gca 


aac 


cct 


ctg 


ctt 


tgc 


tec 


acc gcc cgc tac 


445 


Glu 


Glu 


Asn 


Cys 


Thr 
20 


Ala 


Asn 


Pro 


Leu 


Leu 
25 


Cys 


Ser 


Thr Ala Arg Tyr 
30 






tgc 


aag 




ggc 




tgt 


att 


gac 


aag 


age 


ttc 


ate tgc gat gga 


493 


His 


Cys 


Lys 


35 


Gly 




Cys 


He 


Asp 
40 


Lys 


Ser 


Phe 


He Cys Asp Gly 
45 




cag 


aat 


aac 


tgt 


caa 


gac 


aac 


agt 


gat 


gag 








523 


Gin 


Asn 


Asn 


Cys 


Gin 




Ser 


Asp 


Glu 














50 










55 















<210> 590 
<211> 513 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 103. .513 

<220> 

<221> sig_peptide 

<222> 103. .243 

<223> Von Heijne matrix 
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score 4.40000009536743 
seq MATMLQLVPLLKA/SI 



<400> 590 

aatcattaca gaaggatttg gcctctaagg gcctctggaa ttctttggaa gaaggcattg 
gctgtgtagc tgacactcta ctcactgtgg acatggacgt cc atg acc ctg cag 

Met Thr Leu Gin 
-45 



tgg 


cac 


ttg 


tat 


gag 


atg 


gea 


ege 


aae 


etg aag 


gtg 


eag gat 


atg 


etg 


162 


Trp 


His 


Leu 


Tyr 


Glu 


Met 


Ala 


Arg Asn 


Leu 


Lys 


Val 


Gin 


Asp 


Met 


Leu 










-40 










-35 










-30 








egg 


gca 


gag 


gte 


ttg 


get 


geg 


egg 


eae 


eag 


gee 


eag 


gga 


gae 


atg 


gee 


210 


Arg 


Ala 


Glu 


Val 


Leu 


Ala 


Ala Arg His 


Gin 


Ala 


Gin 


Gly Asp Met Ala 








-25 










-20 










-15 










a eg 


atg 


eta 


eag 


etg 


gte 


eee 


ete 


ete 


aaa 


gee 


age 


ate 


aag 


gag 


aea 


258 


Thr 


Met 
-10 


Leu 


Gin 


Leu 


Val 


Pro 
-5 


Leu 


Leu 


Lys 


Ala 


Ser 
1 


He 


Lys 


Glu 


Thr 
5 




eta 


aga 


ctt 


eae 


ecc 


ate 


tec 


gtg 


acc 


ctg 


cag 


aga 


tat 


ctt 


gta 


aat 


306 


Leu 


Arg 


Leu 


His 


Pro 


He 


Ser 


Val 


Thr 


Leu 
15 


Gin 


Arg 


Tyr 


Leu 


Val 
20 


Asn 




gac 


ttg 


gtt 


ctt 


cga 


gat 


tac 


atg 


att 


eet 


gee 


aag 


aea 


etg 


gtg 




354 


Asp 


Leu 


Val 


Leu 

25 


Arg 


Asp 


Tyr 


Met 


He 
30 


Pro 


Ala 


Lys 


Thr 


Leu 
35 


Val 


Gin 




gtg 


gcc 


ate 


tat 


get 


ctg 


ggc 


cga 


gag 


ccc 


acc 


ttc 


ttc 


tte 


gac 


cog 


402 


Val 


Ala 


He 
40 


Tyr 


Ala 


Leu 


Gly 


Arg 
45 


Glu 


Pro 


Thr 


Phe 


Phe 

50 


Phe 


Asp 


Pro 




gaa 


aat 


ttt 


gac 


cca 


acc 




tgg 


ctg 


age 




gac 


aag 


aac 


ate 


ace 


450 


Glu 


Asn 
55 


Phe 


Asp 


Pro 


Thr 


Arg 
60 


Trp 


Leu 


Ser 


Lys 


Asp 
65 


Lys 




He 


Thr 




tac 


tte 


egg 


aac 


ttg 


ggc 


ttt 


ggc 


tgg 


ggt 


gtg 


egg 


cag 


tgt 


ctg 


gga 


498 


Tyr 


Phe 


Arg 


Asn 


Leu 


Gly 


Phe Gly Trp Gly Val 


Arg 


Gin 


Cys 




Gly 




70 










75 










80 










85 




egg 


esg 


ate 


get 


gag 
























513 


Arg 


Xaa 


He 


Ala 


Glu 



























<210> 591 
<211> 490 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 196. .489 



<220> 

<221> sig_peptide 

<222> 196.. 255 

<223> Von Heijne matrix 

score 4.30000019073486 
seq LLDSSLMASGTAS/RS 



<400> 591 

agacgtacct cacggaagcc ggctttggcc ctgcggctgc taccgtcgcc gcggagaaat 60 
tgttggatct ggcagtctag gaatgaatct cctctcagcc tttaagctca cctggtcaga 120 
ateettggat gagcctgtgg gaccgttcct cctagcccgg tggtttggaa ccagtggctt 180 
tgggaetgta agagg atg gae aaa gat tet eag ggg etg eta gat tea tee 231 
Met Asp Lys Asp Ser Gin Gly Leu Leu Asp Ser Ser 
-20 -15 -10 

etg atg gea tea ggc aet gee age ege tea gag gat gag gag tea ctg 279 
Leu Met Ala Ser Gly Thr Ala Ser Arg Ser Glu Asp Glu Glu Ser Leu 

-5 15 
gca ggg cag aag cga gcc tec tec cag gcc ttg ggc acc ate cet aaa 327 



[fije>'/^:/yv'GG208389e.s^] 



WO 02/083898 PCT/lBOl/00914 

322 



Ala 


Gly 


Gin 


Lys 


Arg 


Ala 


Ser 


Ser 


Gin 


Ala 


Leu 


Gly Thr 


He 


Pro 


Lys 






10 










15 










20 












egg 


aga 


age 


tec 


tee 


agg 


tte 


ate 


aag 


agg 


aag 


aag 


ttc 




gat 


gag 


375 


Arg 


Arg 


Ser 


Ser 


Ser 


Arg 


Phe 


He 


Lys 




Lys 


Lys 


Phe 


Asp Asp 


Glu 




25 










30 










35 










40 




ctg 


gtg 


gag 


age 


age 


etg 


gea 




tot 


tot 


acc 


egg 




aag 


ggg 


gee 


423 
















Lys 


Ser 


Ser 




Arg 


Ala 


Lys 


Gly Ala 












45 










50 










55 






agt 


ggg 


gtg 


gaa 


cag 


ggc 


get 


gtt 


egg 


gta 


gtg 


aac 


act 


cct 


cca 


gtg 


471 


Ser 


Gly 


Val 


Glu 


Gin 


Gly 


Ala 


Val Arg Val 


Val 


Asn 


Pro 


Pro 


Pro 


Val 










60 










65 










70 








aga 


aga 


aga 


agg 


tat 


caa 


a 




















490 


Arg 


Arg 




Arg 


Tyr 


Gin 

























75 



<210> 592 
<211> 537 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 197. .535 

<220> 

<221> sig_peptide 

<222> 197. .304 

<223> Von Heijne matrix 

score 3.79999995231628 

seq WLGCIVFPFSALL/QQ 

<400> 592 

cctttttaaa caatatccat tctcttgatg tctacaatat cacctttctt atagattcgc 60 

atatacatgg ccaaaggaac aactccatgt tttctaaaag gcctagagaa catacatcag 120 

ctcaccagga catgattata gcttctcaag cttatctaaa ataaaagata acatatatat 180 

caacattttt gatgaa atg atg act gaa aaa cat gag gat cac tgt etc aag 232 
Met Met Thr Glu Lys His Glu Asp His Cys Leu Lys 













-35 








-30 








-25 




age 


tgt 


agt 


ggt 


cac 


tea 


tat 


ata 


aga 


aag 


aat 


tgg 


ctt 


gga 


tgc 


att 


280 


Ser 


Cys 


Ser 


Gly 


His 


Ser 


Tyr 


He Arg 


Lys 


Asn 


Trp 


Leu 


Gly Cys 


He 












-20 










-15 










-10 






gtc 


ttc 


cct 


ttt 


tct 


get 


ctt 


ctg 


caa 


caa 


tct 


gag 


att 


agt 


gga 




328 


Val 


Phe 


Pro 


Phe 


Ser 


Ala 




Leu 


Gin 
1 


Gin 


Ser 


Glu 


He 
5 


Ser 


Gly 


Thr 




ttt 


eaa 


gta 


-5 
act 


att 


cca 


cca 


gtt 


ttg 


etc 


ggg 


tat 


act 


tgg 


agt 


aat 


376 


Phe 


Gin 


Val 


Thr 


He 


Pro 


Pro 


Val 


Leu 


Leu 


Gly 


Tyr 


Thr 


Trp 


Ser 


Asn 






10 










15 










20 












aet 


tat 


gta 


ttt 


cct 






gat 


tec 


aat 


gag 






tta 


aaa 


gaa 


424 


Thr 


Tyr 


Val 


Phe 


Pro 




Glu 


Asp 


Ser 




Glu 


Gin 


Asn 




Lys 


Glu 




25 










30 










35 










40 




tgt 


aca 


ttc 


tta 


aat 


att 


ttt 


get 


acc 


att 


gaa 


cct 




ata 


tea 


tat 


472 


Cys 


Thr 


Phe 


Leu 


Asn 


He 


Phe 


Ala 


Thr 


He 


Glu 


Pro 


Gin 


He 


Ser 


Tyr 










45 










50 










55 






gte 


aee 


tgt 


aat 


eea 


aea 


eta 


gat 


aag 


ttt 


tta 


gat 


caa 




gag 


gte 


520 


Val 


Thr 


Cys 


Asn 


Pro 


Thr 


Leu 


Asp 


Lys 


Phe 


Leu 


Asp 


Gin 


Thr 


Glu 


Val 










60 










65 










70 








ttg 


cag 


aga 


gca 


cag 


at 






















537 




Gin 


Arg 


Ala 


Gin 



























75 



<210> 593 
<211> 488 
<212> DNA 
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<213> Homo sapiens 

<220> 
<221> CDS 
<222> 79. .486 

<220> 

<221> sig_peptide 

<222> 79. .201 

<223> Von Heijne matrix 

score 3.59999990463257 

seq LHTSVTLFLLSYC/DC 

<400> 593 

agatgattcc ctgattctcc agagagatta cacacttcgt ttgtggctaa ggttactgtg 60 



acccaatgaa agaagaaa 


atg 


aaa 


gee 


ata 


aag 


aaa 


agt 


ctt 


aca 


gaa 


gaa 


111 












Met 


Lys 
-40 


Ala 


He 


Lys 


Lys 


Ser 
-35 


Leu 


Thr 


Glu 


Glu 




gaa 


tac 


ctg 


tac 


ctg 


gac 


ttt 


tct 


cae 


eaa 




gaa 


gga 


tgc 


ate 


ttt 


159 


Glu 


Tyr 


Leu 


Tyr 


Leu Asp 


Phe 


Ser 


His 


Gin 


Thr 


Glu 


Gly 


Cys 


He 


Phe 




-30 










-25 










-20 










-15 




cct 


ctt 


cat 


aca 


tct 


gta 


act 


tta 


ttt 


ctg 


tta 


tct 


tac 


tgt 


gac 


tgt 


207 


Pro 


Leu 


His 


Thr 


Ser 


Val 


Thr 


Leu 


Phe 


Leu 


Leu 


Ser 


Tyr 


Cys 


Asp Cys 












-10 










-5 










1 






aaa 


ate 


ttt 




att 


tgc 


tta 


gtt 


gte 


aee 




gag 


gtg 


agt 


aga 




255 


Lys 


He 


Phe 
5 


Lys 


He 


Cys 


Leu 


Val 
10 


Val 


Thr 


Lys 


Glu 


Val 
15 


Ser 




Asp 




agt 


tea 


Gta 


eta 


aga 


gat 


gae 


etg 


ate 




gat 


gtt 




ata 


cag 


att 


303 


Ser 


Ser 


Leu 


Leu 


Arg Asp Asp 


Leu 


He 


Gin 


Asp 


Val 


Glu 


He 


Gin 


He 






20 










25 










30 












att 


tea 


agg 


cag 


gag 


etc 


cea 


cea 


ata 


gte 


eaa 


aat 


tge 


tgt 


ttg 


eet 


351 


He 


Ser 


Arg 


Gin 


Glu 


Leu 


Pro 


Pro 


He 


Val 


Gin 


Asn 


Cys 


Cys 


Leu 


Pro 




35 










40 










45 










50 




gca 


gta 


gta 


gaa 


cga 


tea 


gac 


aat 


ttt 


tgt 


aga 


gca 


gga 


ctt 


get 


gtt 


399 


Ala 


Val 


Val 


Glu 


Arg 
55 


Ser 


Asp 


Asn 


Phe 


Cys 
60 


Arg 


Ala 


Gly 


Leu 


Ala 
65 


Val 




gta 


ttg 


aga 


cac 


ata 


ate 


eag 


aaa 


tee 


tat 


gaa 


gca 


gac 


ccc 


tta 


aag 


447 


Val 


Leu 


Arg 


His 
70 


He 


He 


Gin 


Lys 


Ser 
75 


Tyr 


Glu 


Ala 


Asp 


Pro 
80 


Leu 


Lys 




aag 


gaa 


ctt 


ttg 


gaa 


ctt 


ctg 


ggc 


ttw 


aaa 


aga 


ctt 


get 


tg 






488 


Lys 


Glu 


Leu 


Leu 


Glu 


Leu 


Leu 


Gly Xaa 


Lys Arg 


Leu 


Ala 














85 










90 










95 











<210> 594 
<211> 490 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 80. .490 

<220> 

<221> sig_peptide 

<222> 80. .127 

<223> Von Heijne matrix 

score 3.59999990463257 

seq FSLLSISGPPISS/SA 

<400> 594 

gaatgtttat cctctggaca aaccagccag cctctccaga gcaggcgtgt gatctctgta 60 
cccccgcagt ggtcagaat atg gag aac ttc tea etc etc age ate tct gga 112 
Met Glu Asn Phe Ser Leu Leu Ser He Ser Gly 
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cct 


cca 


ate 


tct 


tec 


tec 


gcc 


ctg 


agt 


get 


ttt 


ccc 


gac 


att 


atg 


ttc 


160 


Pro 


Pro 


He 


Ser 


Ser 


Ser 


Ala 


Leu 


Ser 


Ala 


Phe 


Pro 


Asp 


He 


Met 


Phe 














1 
























tct 


cgt 


gcc 


acc 


age 


ctg 


cca 


gac 


att 


gea 


aag 


aca 


gea 


gta 


ccc 


act 


208 


Ser 


Arg 


Ala 


Thr 


Ser 


Leu 


Pro Asp 


He 


Ala 


Lys 


Thr 


Ala 


Val 


Pro 


Thr 










15 










20 










25 








gag 


gca 


tec 


age 


cca 


get 


eag 


gee 


ctg 


eea 


ccc 


eag 


tac 


eaa 


age 


ate 


256 


Glu 


Ala 


Ser 
30 


Ser 


Pro 


Ala 


Gin 


Ala 
35 


Leu 


Pro 


Pro 


Gin 


Tyr 
40 


Gin 


Ser 


He 




att 


gtc 


agg 


caa 


ggg 


ata 


cag 


aac 


aca 


gtg 


etc 


tea 


cca 


grc 


wgc 


age 


304 


lie 


Val 
45 


Arg 


Gin 


Gly 


He 


Gin 
50 


Asn 


Thr 


Val 


Leu 


Ser 
55 


Pro 


Xaa 


Xaa 


Ser 




ttg 


ggg 


gac 


acc 


cag 


cac 


gga 


gag 


aag 


ctg 


agg 


egg 


aac 


tgc 


act 


ate 


352 


Leu 


Gly 


Asp 


Thr 


Gin 


His 


Gly 


Glu 


Lys 


Leu 


Arg 


Arg 


Asn 


Cys 


Thr 


He 




60 










65 










70 










75 




tac 


egg 


ccc 


tgg 


ttc 


tec 


ccc 


tac 


age 


tac 


ttc 


gtg 


tgt 


gca 


gac 


aaa 


400 










80 




Pro 


Tyr 


Ser 


Tyr 

85 






Cys 


Ala 


Asp 
90 


Lys 




gag 


age 


cag 


ctg 


gag 


gcc 


tat 


gac 


ttc 


cca 


gag 


gtg 


cag 


cag 


gat 


gag 


448 


Glu 


Ser 


Gin 


Leu 


Glu 


Ala 


Tyr Asp 


Phe 


Pro 


Glu 


Val 


Gin 


Gin Asp Glu 










95 










100 










105 








ggc 


aag 


tgg 


gac 


aac 


tgc 


ett 


tet 


gag 


gac 


atg 


get 


gag 


aac 






490 


Gly 


Lys 


Trp 


Asp 


Asn 


Cys 


Leu 


Ser 


Glu Asp 


Met 


Ala 


Glu 


Asn 












110 










115 










120 











<210> 595 
<211> 495 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 157.. 495 

<400> 595 

attctttcga caggagttga aaccagcacc ttccctttct ctgagtcctg cctccttctg 60 

cagaagggag cteaaaagaa ctttgttgtt ttgeetttta ctctggggtg aaagcggcag 120 

gaggtatgtg agatggtgaa atgatttgct tctgee atg ctg ggg tea egg gtg 174 

Met Leu Gly Ser Arg Val 
1 5 



gat 


cgc 


cct 




etc 


teg 


gtg 


gcc 




tea 


gta 


gtt 


ttg 


gaa 


gag 


gac 


222 


Asp 


Arg 


Pro 


10 




Ser 


Val 


Ala 


Pro 
15 


Ser 


Val 


Val 


Leu 


Glu 
20 


Glu 


Asp 








ctt 


gte 


tct 


cca 




gtg 


gac 


ctg 


gaa 




gga 


tgc 


egg 


etc 


270 


Gin 


Val 


Leu 
25 


Val 


Ser 


Pro 


Ala 


Val 
30 


Asp 


Leu 


Glu 


Xaa 


Gly 
35 


Cys 


Arg 


Leu 




agg 


gac 


ttc 


act 


gag 


aaa 


ata 


atg 


aat 


gtc 


aaa 


gga 


aaa 


gta 


att 


ctg 


318 


Arg 


Asp 
40 


Phe 


Thr 


Glu 


Lys 


He 
45 


Met 


Asn 


Val 


Lys 


Gly 
50 


Lys 


Val 


He 


Leu 




tea 


atg 


ctg 


gtt 


gtc 


tea 


act 


gtg 


ate 


att 


gtg 


ttt 


tgg 


gaa 


ttt 


ate 


366 


Ser 


Met 


Leu 


Val 


Val 


Ser 


Thr 


Val 


He 


He 


Val 


Phe 


Trp 


Glu 


Phe 


He 




55 










60 










65 










70 




aac 


age 




gaa 


ggc 


tct 


ttc 


ttg 


tgg 


ata 


tat 


cac 


tea 


aaa 


aac 


eea 


414 


Asn 


Ser 


Thr 


Glu 


Gly 
75 


Ser 


Phe 


Leu 


Trp 


He 
80 


Tyr 


His 


Ser 


Lys 


Asn 
85 


Pro 




gaa 


gtt 


gat 


gac 


age 


agt 


get 


cag 


aag 


ggc 


tgg 


tgg 


ttt 


Ctg 


age 


tgg 


462 


Glu 


Val 


Asp 


Asp 
90 


Ser 


Ser 


Ala 


Gin 


Lys 
95 


Gly 


Trp 


Trp 


Phe 


Leu 
100 


Ser 


Trp 




ttt 


aac 


aat 


ggg 


ate 


cac 


aat 


tat 




caa 


ggg 












495 


Phe 


Asn 


Asn 
105 


Gly 


He 


His 


Asn 


Tyr 
110 


Gin 


Gin 


Gly 
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<210> 596 
<211> 939 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 416. .937 

<400> 596 

aaaaggagta gctattagcc aattcggcag ggcccgcttt ttagaagctt gatttcyttt 60 

gaagatgaaa gactagcgga agctctgcct ctttccccag tggcgaggga actcggggcg 120 

attggctggg aactgtatcc acccaaatgt caccgatttc ttcctatgca ggaaatgagc 180 

agacccatca ataagaaatt tctcagcctg gccgaaaatg gttggcccca cgaagccacg 240 

acaactggag gcaaagaggg ttgctcaacg ccccgcctca ttggaaaacc aaatcagatc 300 

tgggacctat atagcgtggc ggaggcgggg cgatgattgt cgcgctcgca cccactgcag 360 

ctgcgcacag tcgcatttct ttccccgccc ctgagaccct gcagcaccat ctgtc atg 418 

Met 



gcg 


get 


ggg 


ctg 


ttt 


ggt 


ttg 


age 


get 


cgc 


cgt 


ctt 


ttg 


gcg 


gca 


gcg 


466 


Ala 


Ala 


Gly 


Leu 
5 


Phe 


Gly 


Leu 


Ser 


Ala 
10 


Arg 


Arg 


Leu 


Leu 


Ala 
15 


Ala 


Ala 




gcg 


acg 


cga 


ggg 


etc 


ccg 


gcc 


gcc 


cgc 


gtc 


cgc 


tgg 


gaa 


tet 


age 


ttc 


514 


Ala 


Thr 


Arg 


Gly 


Leu 


Pro 


Ala 


Ala Arg 


Val 


Arg 


Trp 


Glu 


Ser 


Ser 


Phe 








20 










25 










30 












agg 




gtg 


gtc 


gcc 


ccg 


tec 


get 


gtg 


gcg 


gga 




egg 








Ser 


Arg 

35 


Thr 


Val 


Val 


Ala 


Pro 
40 


Ser 


Ala 


Val 


Ala 


Gly 
45 


Lys 


Arg 


Pro 


Pro 




gaa 


ccg 


acc 


aca 


ccg 


tgg 


caa 


gag 


gac 


cca 


gaa 


ecc 


gag 


gac 


gaa 


aac 


610 


Glu 


Pro 


Thr 


Thr 


Pro 


Trp 


Gin 


Glu Asp 


Pro 


Glu 


Pro 


Glu 


Asp 


Glu 


Asn 




50 










55 










60 










65 




ttg 


tat 


gag 


aag 


aac 


cca 


gac 


tec 


cat 


ggt 


tat 


gac 


aag 


gac 


ccc 


gtt 


658 


Leu 


Tyr 


Glu 


Lys 


Asn 


Pro Asp 


Ser 


His 


Gly 


Tyr 


Asp 


Lys 


Asp 


Pro 


Val 












70 










75 










80 






ttg 


gac 


gtc 


tgg 


aac 


atg 


cga 


ctt 


gtc 


ttc 


ttc 


ttt 


ggc 


gtc 


tec 


ate 


706 


Leu 


Asp 


Val 


Trp 

85 


Asn 


Met 


Arg 




Val 
90 


Phe 


Phe 


Phe 


Gly 


Val 
95 


Ser 


lie 




ate 


ctg 


gtc 


ctt 


ggc 


age 


acc 


ttt 


gtg 


gcc 


tat 


ctg 


cct 


gac 


tac 


agg 


754 


lie 


Leu 


Val 
100 


Leu 


Gly 


Ser 


Thr 


Phe 
105 


Val 


Ala 


Tyr 


Leu 


Pro 
110 


Asp 


Tyr 


Arg 




atg 


aaa 


gag 


tgg 


tec 


cgc 


cgc 


gaa 


get 


gag 


agg 


ctt 


gtg 


aaa 


tac 


cga 


802 


Met 


Lys 
115 


Glu 


Trp 


Ser 


Arg 


Arg 
120 


Glu 


Ala 


Glu 


Arg 


Leu 
125 


Val 


Lys 


Tyr 


Arg 




gag 


gcc 


aat 


ggc 


ttc 


cca 


tea 


tgg 


aat 


cca 


act 


get 


teg 


ace 


cca 


gca 


850 


Glu 


Ala 


Asn 


Gly 


Phe 




Ser 


Trp 


Asn 


Pro 


Thr 


Ala 


Ser 


Thr 


Pro 


Ala 




130 










135 










140 










145 




aga 


tec 


age 


tgc 


cag 


agg 


atg 


agt 


gac 


cag 


ttg 


eta 


agt 


ggg 


get 


caa 


898 


Arg 


Ser 


Ser 


Cys 


Gin Arg Met 


Ser 


Asp 


Gin 


Leu 


Leu 


Ser 


Gly 


Ala 


Gin 












150 










155 










160 






gaa 


gca 


ccg 


cct 


tec 


cca 


cec 


cct 


gcc 


tgc 


eat 


tet 


gac 


et 






939 


Glu 


Ala 


Pro 


Pro 


Ser 


Pro 


Pro 


Pro 


Ala 


Cys 


His 


Ser 


Asp 











165 170 



<210> 597 
<211> 528 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 51. .527 



<400> 597 
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aagctggggg agctcgtcgc cgccgccggc ggctagcggg cgtccgcgcc atg gag 5 5 

Met Glu 
1 

cgc tac gcg gcc gcc ttg gag gag gtg gcg gac ggt gcc egg cag cag 104 
Arg Tyr Ala Ala Ala Leu Glu Glu Val Ala Asp Gly Ala Arg Gin Gin 

5 10 15 

gag cga cac tac cag ttg ctg teg gcg eta cag age etg gtg aag gag 152 
Glu Arg His Tyr Gin Leu Leu Ser Ala Leu Gin Ser Leu Val Lys Glu 

20 25 30 

ttg ccc age tct ttc cag cag cgc ctg tee tae ace aeg etc age gac 200 
Leu Pro Ser Ser Phe Gin Gin Arg Leu Ser Tyr Thr Thr Leu Ser Asp 
35 40 45 50 

ctg gcc ctg gcg ctt etc gac ggc act gtg ttc gaa ate gtg cag ggg 248 
Leu Ala Leu Ala Leu Leu Asp Gly Thr Val Phe Glu He Val Gin Gly 

55 60 65 

eta ctg gag ate cag eae etc ace gaa aag age etg tae aae eag ege 296 
Leu Leu Glu He Gin His Leu Thr Glu Lys Ser Leu Tyr Asn Gin Arg 

70 75 80 

etg cgc eta eag aae gag cac cga gtg etc agg cag gcg ctg egg cag 34 4 

Leu Arg Leu Gin Asn Glu His Arg Val Leu Arg Gin Ala Leu Arg Gin 

85 90 95 

aag cac cag gaa gcc cag cag gcc tgc egg ccc cac aac ctg cct gtg 392 
Lys His Gin Glu Ala Gin Gin Ala Cys Arg Pro His Asn Leu Pro Val 

100 105 110 

gtt cag gcg get cag cag cga gaa eta gag gee gtg gaa cac egg ate 44 0 

Val Gin Ala Ala Gin Gin Arg Glu Leu Glu Ala Val Glu His Arg He 
115 120 125 130 

cgt gag gag cag egg gcg atg gac cat aag ate ate ctg gag ctg gac 488 
Arg Glu Glu Gin Arg Ala Met Asp His Lys He He Leu Glu Leu Asp 

135 140 145 

egg aag gtg get gac cag cag age aca ctg gag aag gcg g 528 
Arg Lys Val Ala Asp Gin Gin Ser Thr Leu Glu Lys Ala 
150 155 

<210> 598 
<211> 556 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 128. .556 

<400> 598 

cttctttggc cctgtgacac gtagcaacgg ggetggttea gggtctgaaa cagagtttgg 60 
gggttgtttg ggattagtga agctaetgec tttgecgeca gcgcasctca gagtttgatt 120 



atttgca 


atg 


tea 


ggc 


ttt 


gaa 


aae 


tta 




aeg 


gat 


ttc 


tae 


cag 


aca 


169 






Met 


Ser 


Gly 


Phe 


Glu 


Asn 


Leu 


Asn 


Thr 


Asp 


Phe 


Tyr 


Gin 


Thr 








1 








5 










10 












agt 


tac 


age 


ate 


gat 


gat 


cag 


tea 


cag 


cag 


tec 


tat 


gat 


tat 


gga 


gga 


217 


Ser 


Tyr 


Ser 


He 


Asp 


Asp 


Gin 


Ser 


Gin 


Gin 


Ser 


Tyr 


Asp 


Tyr 


Gly 


Gly 




15 










20 










25 










30 




agt 


gga 


gga 


ccc 


tat 


age 


aaa 


cag 


tat 


get 


ggc 


tat 


gac 


tat 


teg 


cag 


265 


Ser 


Gly 


Gly 


Pro 


Tyr 
35 


Ser 


Lys 


Gin 


Tyr 


Ala 
40 


Gly 


Tyr 


Asp 


Tyr 


Ser 
45 


Gin 




caa 


ggc 


aga 


ttt 


gtc 


cct 


cca 


gac 


atg 


atg 


cag 


cca 


caa 


cag 


cca 


tac 


313 


Gin 


Gly 


Arg 


Phe 

50 


Val 


Pro 


Pro 


Asp 


Met 

55 


Met 


Gin 


Pro 


Gin 


Gin 
60 


Pro 


Tyr 




ace 


ggg 


cag 


att 


tac 


cag 


cca 


act 


cag 


gca 


tat 


act 




get 


tea 


cct 


361 


Thr 


Gly 


Gin 
65 


He 


Tyr 


Gin 


Pro 


Thr 
70 


Gin 


Ala 


Tyr 


Thr 


75 


Ala 


Ser 


Pro 








ttc 


tat 


gga 






ttt 


gag 




gag 






tta 


tta 


gaa 


409 


Gin 


Pro 


Phe 


Tyr 


Gly 






Phe 


Glu 


Asp 


Glu 


Pro 


Pro 


Leu 




Glu 
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80 










85 










90 










gag 


tta 


ggt 


ate 


aat 


ttt 




cac 


ate 


tgg 




aaa aca 


eta 




gta 


457 


Glu 


Leu 


Gly 


He 


Asn 


Phe 


Asp 


His 


He 


Trp 


Gin 


Lys Thr 


Leu 


Thr 


Val 




95 










100 










105 








110 




tta 


cat 


ccg 


tta 


aaa 


gta 


gca 


gat 


ggc 


age 


ate 


atg aat 


gaa 


act 


gat 


505 


Leu 


His 


Pro 


Leu 


Lys 
115 


Val 


Ala 


Asp 


Gly 


Ser 
120 


He 


Met Asn 


Glu 


Thr 
125 


Asp 




ttg 


gca 


ggt 


cca 


atg 


gtt 


ttt 


tgc 


ctt 


get 


ttt 


tgg age 


aca 


ttg 


eta 


553 


Leu 


Ala 


Gly 


Pro 
130 


Met 


Val 


Phe 


Cys 


Leu 
135 


Ala 


Phe 


Trp Ser 


Thr 
140 


Leu 


Leu 





ctg 556 



<210> 599 
<211> 564 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 151. .564 



<400> 599 

ataacaaaac aatecaaeta tactgaataa tacagaatea aaetgggate eaaaaaeaat 60 
ttaaaaeaaa gttggaaaet eaacegaaac aagaaeaaat egaatcegac atgaaacaag 120 
teteaeaagt gttettaagc tgaeagaaag atg gte aaa ace aaa ett aaa ace 174 

Met Val Lys Thr Lys Leu Lys Thr 

1 5 

gta gta ate gae ata acg gte ata cca tte gee gac gat ttt tte tgc 222 
Val Val He Asp He Thr Val He Pro Phe Ala Asp Asp Phe Phe Cys 

10 15 20 

cet gga get cca gee ggc gta gag ata tea cca gcg get gat gga cca 270 
Pro Gly Ala Pro Ala Gly Val Glu He Ser Pro Ala Ala Asp Gly Pro 
25 30 35 40 

tct cot gaa gga ccc ate gee gag act gac gga gaa tot gac ggt gaa 318 
Ser Pro Glu Gly Pro He Ala Glu Pro Asp Gly Glu Ser Asp Gly Glu 

45 50 55 

gaa gaa gta gag ect ccc gga gee aac gat gga gaa acg gtg ggt gag 366 
Glu Glu Val Glu Pro Pro Gly Ala Asn Asp Gly Glu Thr Val Gly Glu 

60 65 70 

ttt gaa aca ggg gat gaa gat tta ggt gee att gat cca ggt ggt gaa 414 
Phe Glu Thr Gly Asp Glu Asp Leu Gly Ala He Asp Pro Gly Gly Glu 

75 80 85 

gte ate gga gca gaa gac gga gga ata gtt gaa gat gat tta ggt gcc 462 
Val He Gly Ala Glu Asp Gly Gly He Val Glu Asp Asp Leu Gly Ala 

90 95 100 

att ggt gcc ggc ggt gaa gte ate gga gca gag gat ggg gaa aca ggg 510 
He Gly Ala Gly Gly Glu Val He Gly Ala Glu Asp Gly Glu Thr Gly 
105 110 115 120 

gae gtg gat tta ggt ggt gee ggt ggt gaa gte gte gga gaa aca ggg 558 
Asp Val Asp Leu Gly Gly Ala Gly Gly Glu Val Val Gly Glu Thr Gly 

125 130 135 

gag etc 564 
Glu Leu 



<210> 600 
<211> 421 

<212> DNA 

<213> Homo sapiens 



<220> 
<221> CDS 
<222> 198. .419 
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<220> 

<221> misc_feature 
<222> 275. .278 
<223> n=a, g, c or t 

<400> 600 

cttttacttc aggcgaacct gaactcagtt aatgcaattt tgaagcatgc cctgaataga 60 
ttcagtcatt atcaagtcaa actaaaaacg gtgaaaggtt gcgactatta ccaaatagga 120 
aaaacctgaa gacataagaa ctacacatga ggaatatgtc atttagcact ttcacttttt 180 
gatctccaca gaagaca atg aga agt cat acc ata aca atg acg aca act 230 
Met Arg Ser His Thr He Thr Met Thr Thr Thr 
15 10 
tea gtc ago age tgg cct tac tec tee cac aga atg cgc ttt awn nnn 278 
Ser Val Ser Ser Trp Pro Tyr Ser Ser His Arg Met Arg Phe Xaa Xaa 

15 20 25 

hat cat age gae caa ceg cea caa aac tte tea gea aca cea aat gtt 326 
Xaa His Ser Asp Gin Pro Pro Gin Asn Phe Ser Ala Thr Pro Asn Val 

30 35 40 

act acc tgt ccc atg gat gaa aaa ttg khs act act gtg tta acc aca 374 
Thr Thr Cys Pro Met Asp Glu Lys Leu Xaa Thr Thr Val Leu Thr Thr 

45 50 55 

tec tac tct gtt att tte ate gtg gga etg gtt ggg aae ata ate ge 421 
Ser Tyr Ser Val He Phe He Val Gly Leu Val Gly Asn He He 
60 65 70 

<210> 601 
<211> 529 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 124 . .528 

<400> 601 

ggctgcgcct agtgggccgt tgccttacag ttgctgagag gaggcgagag gcgggggcgc 60 
tagggccgag atcatgtctg actgggagag gtttccttgg cagcagagga cgctaggttt 120 
ggg atg aaa gaa get ggg eag atg saa aat ctg gag age gcg agg gcc 168 
Met Lys Glu Ala Gly Gin Met Xaa Asn Leu Glu Ser Ala Arg Ala 
15 10 15 

ggg egg tea gtc age acc cag act ggc age atg acc ggt cag ata cea 216 
Gly Arg Ser Val Ser Thr Gin Thr Gly Ser Met Thr Gly Gin He Pro 

20 25 30 

agg ctt tct aaa gtc aac ctt tte act ctg etc age etc tgg atg gag 264 
Arg Leu Ser Lys Val Asn Leu Phe Thr Leu Leu Ser Leu Trp Met Glu 



etc 


ttt 




gca 


gaa 


gcc 




egg 






tet 


cag 


aaa 


aat 




grg 


312 


Leu 


Phe 


Pro 
50 


Ala 


Glu 


Ala 


Gin 


Arg 
55 


Gin 


Lys 


Ser 


Gin 


Lys 
60 


Asn 


Glu 


Xaa 




gga 


aag 


eat 


gga 


cee 


tta 


gga 


gat 


aat 


gaa 


gag 


agg 


ace 


aga 


gta 


tet 


360 


Gly 


Lys 
65 


His 


Gly 


Pro 




Gly 
70 


Asp 




Glu 


Glu 


75 


Thr 




Val 


Ser 




act 


gae 


aaa 


aga 


cag 


gta 


aag 


aga 


act 


ggt 


ctt 


gtg 


gtg 


gtg 




aac 


408 


Thr 


Asp 


Lys 


Arg 


Gin 


Val 


Lys 


Arg 


Thr 


Gly 


Leu 


Val 


Val 


Val 


Lys 


Asn 




80 










85 










90 










95 




atg 


aaa 


att 


gtt 


ggt 


etc 


cae 


tgt 


tet 


agt 


gaa 


gat 


tta 


cat 


gee 


ggg 


456 


Met 


Lys 


He 


Val 


Gly 
100 


Leu 


His 


Cys 


Ser 


Ser 
105 


Glu 


Asp 


Leu 


His 


Ala 
110 


Gly 




cag 


att 


get 


ctt 


att 


aaa 


cat 


ggg 


tea 


agg 


ctg 


aaa 


aac 


tgt 


gat 


cct 


504 


Gin 


He 


Ala 


Leu 
115 


He 


Lys 


His 


Gly 


Ser 
120 


Arg 


Leu 


Lys 


Asn 


Cys 
125 


Asp 


Pro 




tat 


ttt 


tec 


aga 




emt 


gtt 




















529 
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Tyr Phe Ser Arg Lys Xaa Val Leu 
130 135 



<210> 602 
<211> 208 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 24. .206 



<400> 602 

ccttccctgc ggccgcgcag age atg ccg gga gcc gcc ggc cgc egg tgt ccg 53 

Met Pro Gly Ala Ala Gly Arg Arg Cys Pro 

15 10 



gac 


cgc 


teg 


ccc 


eeg 


ttt 


gga 


cec 




ttc 


ggt tct 


tet 


ggg gtg 


ttg 


101 


Asp 


Arg 


Ser 


Pro 


Pro 
15 


Phe 


Gly 


Pro 


Asp 


Phe 
20 


Gly Ser 


Ser 


Gly Val 

25 


Leu 




atg 


etc 


eta 


aag 


ecc 


gag 


age 


aeg 


tgt 






age 






149 


Met 


Leu 


Leu 


Lys 
30 


Pro 


Glu 


Ser 


Thr 


Cys 
35 


Pro 


Asp Pro 


Ser 


Leu Tyr 
40 


Asp 




gcd 


baa 


etc 


tgc 


ccg 


gtc 


cca 


gaa 


cca 


aag 


eea tge 


egg 


ggt gtg 


gcc 


197 


Ala 


Xaa 


Leu 
45 


Cys 


Pro 


Val 


Pro 


Glu 
50 


Pro 


Lys 


Pro Cys 


Arg 
55 


Gly Val 


Ala 




tct 


gac 


ccc 


ac 






















208 


Ser 


Asp 

60 


Pro 



























<210> 603 
<211> 587 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 198. .587 



<400> 603 

cttttacttc aggcgaacct gaactcagtt aatgcaattt tgaagcatgc cctgaataga 60 
ttcagtcatt atcaagtcaa actaaaaacg gtgaaaggtt gcgactatta ccaaatagga 120 
aaaacctgaa gacataagaa ctacacatga ggaatatgtc atttagcact ttcacttttt 180 
gatetccaca gaagaca atg aga agt cat acc at a aca atg aeg aca act 230 
Met Arg Ser His Thr He Thr Met Thr Thr Thr 
15 10 



tea 


gtc 


age 


age 


tgg 


cct 


tae 


tec 


tee 


cac 


aga 


atg 


cgc 


ttt 


ata 


acc 


278 


Ser 


Val 


Ser 


Ser 

15 


Trp 


Pro 


Tyr 


Ser 


Ser 
20 


His 


Arg 


Met 


Arg 


Phe 
25 


He 


Thr 




aat 












eea 


eaa 


aae 


tte 


tea 


gea 


aca 


cca 


aat 


gtt 


326 


Asn 


His 


Ser 
30 


Asp 


Gin 


Pro 


Pro 


Gin 
35 


Asn 


Phe 


Ser 


Ala 


Thr 
40 


Pro 


Asn 


Val 




act 


aee 


tgt 




atg 








ttg 


eta 


tct 


act 


gtg 


tta 






374 


Thr 


Thr 
45 


Cys 


Pro 


Met 


Asp 


Glu 
50 


Lys 


Leu 




Ser 


Thr 
55 


Val 




Thr 


Thr 




tee 


tae 


tet 


gtt 


att 


tte 


ate 


gtg 


gga 


etg 


gtt 


ggg 


aae 


at a 


ate 


gee 


422 


Ser 


Tyr 


Ser 


Val 


He 


Phe 


He 


Val 


Gly 


Leu 


Val 


Gly 


Asn 


He 


He 


Ala 




60 










65 










70 










75 




etc 


tat 


gta 


ttt 


ctg 


ggt 


att 


cac 


cgt 


aaa 


aga 


aat 


tec 


att 




att 


470 


Leu 


Tyr 


Val 


Phe 


Leu 
80 


Gly 


He 


His 


Arg 


Lys 
85 


Arg 


Asn 


Ser 


He 


Gin 
90 


He 




tat 


eta 


ett 


aac 


gta 


gee 


att 


gea 


gac 


etc 


eta 


etc 


ate 




tge 


ete 


518 


Tyr 


Leu 


Leu 




Val 


Ala 


He 


Ala 


Asp 


Leu 


Leu 


Leu 


He 


Phe 


Cys 


Leu 





95 100 105 
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cct ttc cga ata atg tat cat att aac caa aac aag tgg aca eta ggt 566 

Pro Phe Arg lie Met Tyr His lie Asn Gin Asn Lys Trp Thr Leu Gly 

110 115 120 

gtg att ytg tgc aag gtt gtg 587 

Val He Leu Cys Lys Val Val 
125 130 



<210> 604 
<211> 528 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 327. .527 



<400> 604 

acaagaaaag aacatggtct agactgaagt accaactaaa tcatctcctt tcaaattatc 60 

accgacacca tcatggattc aagcaccgca cacagtccgg tgtttctggt atttcctcca 120 

gaaatcactg cttcagaata tgagtccaca gaactttcag ccacgacctt ttcaactcaa 180 

agccccttgc aaaaattatt tgctagaaaa atgaaaatct taggggatat ccattctggg 240 

gctctgtttt gttcattaat tctggagcct tcctaattgc agtgaaaaga aaaaccacag 300 

aaactctgat aatattgagc cgaata atg aat ttt ctt agt gcc ctg gga gca 353 

Met Asn Phe Leu Ser Ala Leu Gly Ala 

1 5 



ata 


get 


gga 


ate 


att 


etc 


etc 


aca 


ttt 


ggt 


ttc ate 


eta 


gat 


caa 


aac 


401 


He 


Ala 


Gly 


He 


He 


Leu 


Leu 


Thr 


Phe Gly 


Phe He 


Leu Asp 


Gin 


Asn 




10 










15 










20 








25 




tac 


att 


tgt 


ggt 


tat 


tct 


cac 


caa 


aat 


agt 


eag tgt 


aag 


get 


gtt 


act 


449 


Tyr 


He 


Cys Gly Tyr Ser 


His 


Gin 


Asn 


Ser 


Gin Cys 


Lys 


Ala 


Val 


Thr 












30 










35 








40 






gtc 


ctg 


ttc 


ttg 


gga 


att 


ttg att 


aca 


ttg 


atg act 


ttc 


age 


att 


att 


497 


Val 


Leu 


Phe 


Leu 
45 


Gly 


He 


Leu 


He 


Thr 
50 


Leu 


Met Thr 


Phe 


Ser 
55 


He 


He 




gaa 


tta 


ttc 


att 


tct 


ctg 


ctt 


tct 


caa 


ttt 


t 










528 


Glu 


Leu 


Phe 


He 


Ser 


Leu 


Leu 


Ser 


Gin 


Phe 















60 65 



<210> 605 
<211> 227 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 62. .226 



<400> 605 

atttccggtt ccggtgtcag ttcgaggcgc cgccgccgcc gccgcagccg ccggagccgc 60 
a atg cct aaa gga gga aga aag gga ggc cac aaa ggc egg gcg agg eag 109 
Met Pro Lys Gly Gly Arg Lys Gly Gly His Lys Gly Arg Ala Arg Gin 
15 10 15 

tat aca age cct gag gag ate gae gcg eas tgc agg ctg aga age aga 157 
Tyr Thr Ser Pro Glu Glu He Asp Ala Xaa Cys Arg Leu Arg Ser Arg 

20 25 30 

agg eca ggg aag aag agg age aaa aaa ggt gga gat ggg get gca ggt 205 
Arg Pro Gly Lys Lys Arg Ser Lys Lys Gly Gly Asp Gly Ala Ala Gly 

35 40 45 

gae ccc aaa aaa aaa aaa aaa a 227 
Asp Pro Lys Lys Lys Lys Lys 
50 55 



<210> 606 
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<211> 228 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 63. .227 



<400> 606 

agaacatcct aatcrwaaaa gttcgaggcg ccgccgccgc cgccgcagcc gccggagccg 60 
ca atg cct aaa gga gga aga aag gga ggc cac aaa ggc egg gcg agg 107 
Met Pro Lys Gly Gly Arg Lys Gly Gly His Lys Gly Arg Ala Arg 
15 10 15 

cag tat aca age eet gag gag ate gae gcg eas tge agg etg aga age 155 
Gin Tyr Thr Ser Pro Glu Glu lie Asp Ala Xaa Cys Arg Leu Arg Ser 

20 25 30 

aga agg cca ggg aag aag agg age aaa aaa ggt gga gat ggg get gca 203 
Arg Arg Pro Gly Lys Lys Arg Ser Lys Lys Gly Gly Asp Gly Ala Ala 

35 40 45 

ggt gae ece aaa aaa aaa aaa aaa a 228 
Gly Asp Pro Lys Lys Lys Lys Lys 
50 55 



<210> 607 
<211> 378 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 119. .376 



<220> 

<221> mise_feature 
<222> 143,318,332 
<223> n=a, g, c or t 



<400> 607 

gaaagacgga gagaggaaaa gggacaaggg caaggcaaga gagaggcgte gtcgtttgat 
etgctgccgg agtgcgaaga ggaagaggaa gaggaagagg agegeeggag ttgtaaag 
atg tts etg gtg gac tgg ttc tat ngg gtg etg tea teg ctt ggg ctg 
Met Xaa Leu Val Asp Trp Phe Tyr Xaa Val Leu Ser Ser Leu Gly Leu 
15 10 15 

tgg cag aag gag get aag ate etc ttc ctt ggc etc gac aac gee ggc 
Trp Gin Lys Glu Ala Lys lie Leu Phe Leu Gly Leu Asp Asn Ala Gly 

20 25 30 

aag ace aec ete etc eae atg etg aag gae gag egg etc gta cag cac 
Lys Thr Thr Leu Leu His Met Leu Lys Asp Glu Arg Leu Val Gin His 

35 40 45 

cag cca acg cag tac ccc acg tea gaa gag ttg age ate ggc agg ate 
Gin Pro Thr Gin Tyr Pro Thr Ser Glu Glu Leu Ser lie Gly Arg lie 

50 55 60 

aag ttc ang gcg ttc gac ctt ngg ggc cae eag ate gee ege ege gtc 
Lys Phe Xaa Ala Phe Asp Leu Xaa Gly His Gin lie Ala Arg Arg Val 
65 70 75 80 

tgg aag gac tac tmc gcc aa 
Trp Lys Asp Tyr Xaa Ala 



<210> 608 
<211> 455 
<212> DNA 
<213> Homo sapie 
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<220> 
<221> CDS 
<222> 63. .455 

<400> 608 

attsccacct gtgatgctca gagagaaacc atgagccctg aaaaacacca tattgttttg 60 

tg atg tea cct ctg gaa tgt tct gag tgt ttt ggt gac caa ctt ctg 107 

Met Ser Pro Leu Glu Cys Ser Glu Cys Phe Gly Asp Gin Leu Leu 



1 






5 








10 








15 




cat 


agg 


ace 


tat 


aee tgg 


eaa 


ete 


aea 


ttg 


eae 


tea 


agg 


eea 


aat tat 


155 


His 


Arg 


Thr 


Tyr 


Thr Trp 
20 


Gin 


Leu 


Thr 


Leu 
25 


His 


Ser 


Arg 


Pro 


Asn Tyr 
30 




aca 


aga 


aaa 


aga 


gat aee 


aga 


tet 


gaa 


age 


eta 


gaa 


att 


eea 


ate agt 


203 


Thr 


Arg 


Lys 


Arg 
35 


Asp Thr 


Arg 


Ser 


Glu 
40 


Ser 


Leu 


Glu 


lie 


Pro 
45 


lie Ser 




gtg 


gtt 


eta 


cct 


eag agg 


gge 


aca 


gee 


gaa 


cec 


ttc 


ccg 


agg 


etc eae 


251 


Val 


Val 


Leu 

50 


Pro 


Gin Arg 


Gly 


Thr 

55 


Ala 


Glu 


Pro 


Phe 


Pro 

60 


Arg 


Leu His 




aac 


ttg 


tac 


age 


aee eet 


egc 


tge 


geg 


eag 


eag 


gee 


gee 


etg 


eec egg 


299 


Asn 


Leu 


Tyr 


Ser 


Thr Pro 


Arg Cys 


Ala 


Gin 


Gin 


Ala 


Ala 


Leu 


Pro Arg 






65 








70 










75 










ctg 


age 


egc 


agg 


atg geg 


age 


eag 


eae 


tee 


tat 


eea 


etg 


aae 


ege ttc 


347 


Leu 


Ser 


Arg 


Arg 


Met Ala 


Ser 


Gin 


His 


Ser 


Tyr 


Pro 


Leu 


Asn Arg Phe 




80 








85 










90 








95 




tec 


tec 


gtg 


cct 


tta gac 


cec 


atg 


gag 


egc 


cec 


atg 


tec 


eag 


gee gac 


395 


Ser 


Ser 


Val 


Pro 


Leu Asp 
100 


Pro 


Met 


Glu 


Arg 
105 


Pro 


Met 


Ser 


Gin 


Ala Asp 
110 




ctg 


gag 


ctg 


gac 


tac aac 


ccg 


ccg 


egg 


gtg 


eag 


etc 


age 


gac 


gag atg 


443 


Leu 


Glu 


Leu 


Asp 


Tyr Asn 


Pro 


Pro 


Arg 


Val 


Gin 


Leu 


Ser 


Asp Glu Met 










115 








120 










125 






ttc 


ktg 


ttc 


car 






















455 


Phe 


Xaa 


Phe 
130 


Gin 

























<210> 609 
<211> 366 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 68. .364 

<400> 609 

gtacagaaag attggtggac aggageagcg gccggtgggiti gagggegete ggeggeggee 60 



tgeggcc 


atg 


gee 


ace 


gtg 


atg 


gea 


geg 


acg 


geg 


geg 


gag 


egg 


geg 


gtg 


109 






Met 


Ala 


Thr 


Val 


Met 


Ala 


Ala 


Thr 


Ala 


Ala 


Glu 


Arg 


Ala 


Val 








1 








5 










10 












etg 


gag 


gag 


gag 


ttc 


egc 


tgg 


ctg 


ctg 


cac 


gac 


gak 


ktg 


cac 


get 


gtg 


157 


Leu 


Glu 


Glu 


Glu 


Phe 


Arg 


Trp 


Leu 


Leu 


His 


Asp 


Xaa 


Xaa 


His 


Ala 


Val 




15 










20 










25 










30 




ttg 


wwg 


eag 


ctg 


eag 


gac 


ate 


etc 


aag 


gag 


gee 


tct 


ctg 


egc 


ttc 


act 


205 


Leu 


Xaa 


Gin 


Leu 


Gin 
35 


Asp 


He 


Leu 


Lys 


Glu 
40 


Ala 


Ser 


Leu 


Arg 


Phe 
45 


Thr 




ctg 


ccg 


ggc 


tec 


ggc 


act 


gag 


ggg 


cec 


gee 


aag 


caa 


gag 


aac 


ttc 


ate 


253 


Leu 


Pro 


Gly 


Ser 

50 


Gly 


Thr 


Glu 


Gly 


Pro 

55 


Ala 


Lys 


Gin 


Glu 


Asn 
60 


Phe 


He 




eta 


ggc 


age 


tgt 


ggc 


aca 


gac 


eak 


gtg 


aag 


ggt 


gtg 




act 


ctg 


eak 


301 


Leu 


Gly 


Ser 
65 


Cys 


Gly 


Thr 


Asp 


Xaa 
70 


Val 


Lys 


Gly 


val 


Leu 
75 


Thr 


Leu 


Xaa 




ggg 












geg 


gat 


gtg 


aac 


ctg 


aag 


atg 




egg 




349 


Gly 


Asp 


Ala 


Leu 


Ser 


Gin 


Ala 


Asp 


Val 






Lys 


Met 


Pro 
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<210> 610 
<211> 498 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 209. .496 

<400> 610 

atgatgccag tgaatgcgta gtcccagcct cagcacaggg gagccacctt gaagctctca 60 

aatatcactg ttgtgaatac agagagggaa aaccaactgt aacgtgccac ccaaataaag 120 

ctcattacta gtcctgtcca gcaacgtgcc tctcctggcc ctagagttct tggaaatagc 180 

ccaggccaaa gagaaggcct ttctcccc atg gtc age cac acg ttc cac atg 232 
Met Val Ser His Thr Phe His Met 



cgc 


aca 


gag 


gag 


tet 


gat 


gee 


tea 


cag 


gag gge gat 


gae 


eta 


eee 


aag 


280 


Arg 


Thr 


Glu 


Glu 


Ser Asp Ala 


Ser 


Gin 


Glu Gly Asp Asp 


Leu 


Pro 


Lys 






10 










15 






20 












tec 


tea 


gca 


aae 


aee 


age 


eat 


eee 


aag 


cag gat gae 


age 


ccc 


aag 


tee 


328 


Ser 


Ser 


Ala 


Asn 


Thr 


Ser 


His 


Pro 


Lys 


Gin Asp Asp 


Ser 


Pro 


Lys 


Ser 




25 










30 








35 








40 




tea 


gaa 


gaa 


aee 


ate 


cag 


ccc 


aag 


gag 


ggt gac ate 


ccc 


aag 


gcc 


cca 


376 


Ser 


Glu 


Glu 


Thr 


He 
45 


Gin 


Pro 


Lys 


Glu 


Gly Asp He 
50 


Pro 




Ala 

55 


Pro 




gaa 


gaa 


acc 


ate 


caa 


tec 


aag 


aag 


gag 


gac etc ccc 


aag 


tec 


tea 


gaa 


424 


Glu 


Glu 


Thr 


He 
60 


Gin 


Ser 


Lys 


Lys 


Glu 
65 


Asp Leu Pro 


Lys 


Ser 
70 


Ser 


Glu 




aaa 


gcc 


ate 


cag 


cce 


aaa 


gag 


agt 


aae 


ate ccc aag 


tee 


tea 


gea 


aaa 


472 


Lys 


Ala 


He 

75 


Gin 


Pro 


Lys 


Glu 


Ser 
80 


Asn 


He Pro Lys 


Ser 
85 


Ser 


Ala 


Lys 






ate 






aag 


ctg 




ata 


tt 












498 


Pro 


He 
90 


Gin 


Pro 


Lys 


Leu 


Ala 
95 


He 

















<210> 611 
<211> 504 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 214 . .504 

<400> 611 

aagaeagaea cggaagtget gggaggcgee gggageecgt teggttgegg gtgtetetgg 60 
cectgcggtc agccetggga acgteccgga gagctagatt cctagaggcc cgattccgct 120 
ageecggaae agaeaaagee agegeteeeg eeegeteccc gacttaggat ccgatgccgg 180 
cagegteetg gggeececgt agegggctgg aee atg age ctg ctg gac ggc etc 234 
Met Ser Leu Leu Asp Gly Leu 
1 5 

get tee teg ecg egg get ceg ctg cag tee age aag gee agg atg aaa 282 
Ala Ser Ser Pro Arg Ala Pro Leu Gin Ser Ser Lys Ala Arg Met Lys 

10 15 20 

aag etc ecg aag aag age cag aat gag aag tac egg ctg aag tac ctg 330 
Lys Leu Pro Lys Lys Ser Gin Asn Glu Lys Tyr Arg Leu Lys Tyr Leu 

25 30 35 

egg ctg cgc aaa gcg gcc aag gcc acg gtg ttt gaa aat get get att 378 
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Arg Leu Arg Lys Ala Ala 
40 45 
tgt gat gaa att get cgt 
Cys Asp Glu lie Ala Arg 
60 

gaa aga agg tac ttg eta 
Glu Arg Arg Tyr Leu Leu 
75 

gaa ggg gta agt aca ggc 
Glu Gly Val Ser Thr Gly 

90 



334 

Lys Ala Thr Val Phe Glu 
50 

ctt gag gaa aaa ttt ctt 
Leu Glu Glu Lys Phe Leu 
65 

aag aag etc etc cag ctt 
Lys Lys Leu Leu Gin Leu 
80 

mtg cag etc ctt 
Xaa Gin Leu Leu 
95 



PCT/JlBOl/00914 

Asn Ala Ala lie 
55 

aaa gea aaa gaa 426 
Lys Ala Lys Glu 
70 

cag get eta act 474 
Gin Ala Leu Thr 
85 

504 



<210> 612 
<211> 483 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 161. .481 



<400> 612 

gtgggcggtt cggcggctct gggcgctgtt gtttggtctt taggcctgcg gaggggcgtt 60 

atctggaggg cegcgggtgc aggccgcagt gacagggccg ctcgccccgc tagtcctgcc 120 

tgtctcccgg tgcagctgtg ttcgcggcct gcaggccaac atg gcg cag gag gtg 175 





















Met Ala Gin Glu 
1 


Val 
5 




teg 


gag 


tac 


ctg 


age 


cag 


aac ecg 


egg 


gtg gea 


gcc tgg gtg gag 


gcg 


223 


Ser 


Glu 


Tyr 




Ser 


Gin 


Asn Pro 


Arg 


Val Ala 


Ala Trp Val Glu 


Ala 












10 








15 


20 






ctg 


cgc 


tgc 


gac 


ggc 


gag 


act gac 


aaa 


cac tgg 


cgc cac cgc egg 


gat 


271 


Leu 


Arg 


Cys 


Asp 


Gly 


Glu 


Thr Asp 


Lys 


His Trp 


Arg His Arg Arg 


Asp 










25 








30 




35 






ttt 


ttg 


ctt 


cgc 


aac 


gcc 


ggg gac 


ctg 


gcc ecc 


get ggc ggc get 


gee 


319 


Phe 


Leu 


Leu 


Arg 


Asn 


Ala 


Gly Asp 


Leu 


Ala Pro 


Ala Gly Gly Ala 


Ala 








40 








45 






50 






tee 


get 


age 


acg 


gat 


gaa 






gcc gag 


age ggg ace cga 


aac 


367 


Ser 


Ala 


Ser 


Thr 


Asp 


Glu 


Ala Ala 


Asp 


Ala Glu 


Ser Gly Thr Arg 


Asn 






55 










60 






65 






egg 


cag 


ctg 


cag 


cag 


etc 


ate tec 


ttt 


tec atg 


gcc tgg gcg aac 


cac 


415 


Arg 


Gin 


Leu 


Gin 


Gin 




He Ser 


Phe 


Ser Met 


Ala Trp Ala Asn 


His 




70 










75 






80 




85 




gtc 


ttc 


etc 


ggg 


tgc 


cga 


tac cct 


caa 


aaa gtt 


atg gat aaa ata 


ctt 


463 


Val 


Phe 




Gly 


Cys 


Arg 


Tyr Pro 


Gin 


Lys Val 


Met Asp Lys He 


Leu 












90 








95 


100 






agt 


atg 


get 


gaa 


ggc 


ate 


aa 










483 


Ser 


Met 


Ala 


Glu 


Gly 


He 















105 



<210> 613 
<211> 473 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 277. .471 



<400> 613 

aaaetgteet tgeeaecaca attatttaat ggacccaaca gaaagtaace ecggaaatta 
ggacacetca tceeaaaaga cctttaaata ggggaagtcc acttgtgcac ggetgetcet 
tgctatagaa gacctgggae agaggaetgc tgtctgecet etctggtcac cctgcctagc 
tagaggatet gtgaceccag ccatgaggac cctcgecatc cttgetgcsa ttctcctggt 
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ggccctgcag gccagagcca ctccaggcaa gagctg atg agg ttg ctg cag ccc 2 94 























Met 
1 


Arg Leu 




Gin 
5 


Pro 




egg 
Arg 


age 
Ser 


aga 
Arg 


ttg 
10 


cag 

Gin 


egg 

Arg 


aea 
Thr 


tec 

Ser 


cag 

Gin 
15 


aag 
Lys 


tgg 
Trp 


ttg ttt 
Leu Phe 


esc 
Xaa 
20 


ttg 
Leu 


cat 
His 


342 


ggc 
Gly 


gac 
Asp 


gaa 
Glu 
25 


age 
Ser 


ttg 
Leu 


get 
Ala 


cea 
Pro 


aag 
Lys 
30 


eat 
His 


cea 
Pro 


ggc tea agg 
Gly Ser Arg 
35 


aaa 
Lys 


aac 
Asn 


atg 
Met 


390 


gac 

Asp 


tgc 

Cys 
40 


tat 

Tyr 


tgg 

Trp 


cag 

Gin 


aat 

Asn 


acc 
Thr 

45 


age 

Ser 


gtg 
Val 


cat 

His 


tgc 
Cys 


agg aga 
Arg Arg 

50 


acg 
Thr 


teg 
Ser 


eta 
Leu 


438 


tgg 
Trp 
55 


aac 
Asn 


ctg 
Leu 


cat 
His 


eta 
Leu 


cea 
Pro 
60 


ggg 

Gly 


aag 
Lys 


act 
Thr 


ctg 
Leu 


ggc 
Gly 
65 


at 








473 



<210> 614 
<211> 522 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 279. .521 



<400> 614 

attetyaatt gsgecmaget tccggcctgg ceacecgeca geaggtctca cccggttcgc 60 

atcagtcccg ggcccccggc gcccctcagt agtagcaaat tatcgcatcc ctgagcttag 120 

gatccccctt tagtttcaca tatgecccca cccccgtttc cactttgatg gcggcttcca 180 

aggagcatcc cttagtaaga accgtcgcaa atatcgtgat ttggatccac caaggcttgg 240 

ggcgagccgt cgaatgtgtg tggtagggga gcgggctg atg tta gca cgt ggt ctt 296 

Met Leu Ala Arg Gly Leu 



gtt 


arg 


scy tgr aat kgg ggr aat agg 


ags 


1 
eck 


gtg 


ttc 


acg 


5 
att 


agt 


344 


Val 


Xaa 


Xaa Xaa Asn Xaa Gly Asn Arg 
10 15 


Xaa 


Pro 


Val 


Phe 


Thr 

20 


He 


Ser 




ttc 


cac 


cac eta cea act tgg aac ctt 


tac 


ate 


tct 


cea 


age 


ctg 


agt 


392 


Phe 


His 


His Leu Pro Thr Trp Asn Leu 
25 30 


Tyr 


He 


Ser 


Pro 
35 


Ser 


Leu 


Ser 




ttc 


ctt 


ate tgt gag aca get gcg aat 


acc 


ggt 


ttc 


cag 


aga 


gta 


gtt 


440 


Phe 


Leu 


lie Cys Glu Thr Ala Ala Asn 


Thr 


Gly 


Phe Gin Arg Val 


Val 






40 


45 






50 












aag 


aac 


cct tgt gaa cac ctt ctt cag 


ctt 


cct 


teg 


acg 


caa 


aac 


aga 


488 


Lys 


Asn 


Pro Cys Glu His Leu Leu Gin 


Leu 


Pro 


Ser 


Thr 


Gin 


Asn 


Arg 




55 




60 




65 










70 




ctt 


ttt 


cat tgg agg aga aga agg gat 


ggc 


aga 


g 










522 


Leu 


Phe 


His Trp Arg Arg Arg Arg Asp 


Gly 


Arg 















75 80 



<210> 615 
<211> 322 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 171. .320 



<400> 615 

agtggttcgg ccgcggcgac cecactcegg cggcgcgtcc cccgagcttg gtacggctca 
geccgtctcc eccgaagceg cgcgcccgcg eeegcgcccc tcagteggtg gagcecgeag 
ccccccttgt ggeeegeggc agetccccgc eegctcggcc cgcgceegce atg gte 

Met Val 
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cgt ccg cgc cgt gcc 
Arg Pro Arg Arg Ala 
5 

ggt cgc ggc cgy cct 
Gly Arg Gly Arg Pro 
20 

cgc tec tgg cct gcc 
Arg Ser Trp Pro Ala 

35 

gg 

<210> 616 
<211> 524 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 269. .523 

<400> 616 

attttatact aatggagaga gagagaaaga catagaagtc tcactgactt gcacagttta 60 
cttcccactg actagatctt ggataaggac ttttttgtct tattcataaa aggcgattgt 120 
gttcaattat gttatgctca gaggtggaat atataatact gacttctatt gcttaaaaaa 180 
ggtgattatg aatggaggaa gaagtctgtg ggacaggaaa tttaaaataa aattttagag 240 
ttagatagat tatgagaaga aaaaatca atg oca agt ttt gtt gac cgc ate 292 

Met Pro Ser Phe Val Asp Arg lie 

1 5 



ttt 


ggt 


ggt gaa 






agt 


atg 


ate 


atg 


tat gat caa 


tge 


aga 


act 


340 


Phe 


Gly 
10 


Gly Glu 


Leu 


Thr 


Ser 
15 


Met 


He 


Met 


Tyr Asp Gin 
20 


Cys 


Arg 


Thr 




gtc 


tec 


ttg gtt 


cat 


gaa 


tct 


ttc 


ett 


gat 


ttg tec etc 


cea 


gtt 


tta 


388 


Val 


Ser 


Leu Val 


His 


Glu 


Ser 


Phe 


Leu 


Asp 


Leu Ser Leu 


Pro 


Val 


Leu 




25 








30 










35 






40 




gat 


gat 


cag agt 


ggt 


aag 


aaa 


agt 


gta 


aat 


gat aaa aat 


ctg 


aaa 


aag 


436 


Asp 


Asp 


Gin Ser 


Gly 
45 


Lys 


Lys 


Ser 


Val 


Asn 
50 


Asp Lys Asn 


Leu 


Lys 
55 


Lys 




aca 


gtg 


gag gat 


gaa 


gat 


caa 


gat 


agt 


gag 


gaa gaa aaa 


gat 


aac 


gac 


484 


Thr 


Val 


Glu Asp 


Glu 


Asp 


Gin 


Asp 


Ser 


Glu 


Glu Glu Lys 


Asp Asn Asp 








60 










65 






70 








agt 


tac 


ata aaa 


gag 


aga 


agt 


gat 


att 


cct 


tet gga aea 


a 






524 


Ser 


Tyr 


lie Lys 


Glu Arg 


Ser Asp 


He 


Pro Ser Gly Thr 














75 








80 






85 











<210> 617 
<211> 274 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 75. .272 

<400> 617 

atacacttct tggctgtgtg cgctcageag gacgtgggag gctccggctt caagacacte 60 
atccaagagg aagg atg gee agt ate ttt tct aag ttg eta act ggc cgc 110 

Met Ala Ser He Phe Ser Lys Leu Leu Thr Gly Arg 

1 5 10 

aat get tct ctg ctg ttt get acc atg ggc acc agt gtc ctg acc acc 158 
Asn Ala Ser Leu Leu Phe Ala Thr Met Gly Thr Ser Val Leu Thr Thr 

15 20 25 

ggg tac ctg ctg aac egg cag aaa gtg tgt gee gag gtc egg gag eag 206 



PCT/JlBOl/00914 

336 

1 

ccg tac cgc tec ggc gcc ggg ggc ccc ctm ggg 224 
Pro Tyr Arg Ser Gly Ala Gly Gly Pro Leu Gly 

10 15 
ccg egg ccc etc gtg gtg cgc gee gtc cgc teg 272 
Pro Arg Pro Leu Val Val Arg Ala Val Arg Ser 

25 30 
age ecy ccg agg ccc gca gcc tyc gcg gat ccg 320 
Ser Pro Pro Arg Pro Ala Ala Xaa Ala Asp Pro 
40 45 50 

322 
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Gly 


Tyr 
30 


Leu 




Asn 


Arg 


Gin 

35 


Lys 


Val 


Cys Ala 


Glu 

40 


Val 


Arg 


Glu 


Gin 




cct 


agg 


eta 


ttt 


cct 




age 


gca 


gac 


tac cca 


gac 


ctg 


cgc 


aas 


aca 


254 


Pro 


Arg 


Leu 


Phe 


Pro 


Pro 


Ser 


Ala 


Asp 


Tyr Pro 


Asp 


Leu 


Arg 


Xaa 


Thr 




45 










50 








55 










60 




aca 


act 




tgg 




agt 


gc 


















274 


Thr 


Thr 


Ala 


Trp 


Pro 
65 


Ser 























<210> 618 
<211> 243 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 21. .242 

<400> 618 

agggaagagg ctgtcgggaa atg gcg gee gcg gee ggg ctg ggg ctt cag egg 53 
Met Ala Ala Ala Ala Gly Leu Gly Leu Gin Arg 
15 10 



gag 


gca 


gca 


gag 


ggg 


aag 


tgg 


tea 


gcg 


tgg 


cga 


atg acg gaa 


gaa 


act 


101 


Glu 


Ala 


Ala 


Glu 


Gly 


Lys 


Trp 


Ser 


Ala 


Trp Arg 


Met 


Thr Glu 


Glu 


Thr 










15 










20 








25 








cgc 


att 


gtc tac 


tgg 


ate 


aag 


gac 


aga 


cag 


etc 


aee 


aac egt 


gac 


age 


149 


Arg 


He 


Val 


Tyr 


Trp 


He 


Lys 


Asp Arg 


Gin 


Leu 


Thr 


Asn Arg 


Asp 


Ser 








30 










35 










40 








acc 


ata 


ctg 


gaa 


ctt 


caa 


aaa 


gtt 


ctg 


aaa 


aca 


tgt 


tgg ttc 


gtt 


tgt 


197 


Thr 


He 
45 


Leu 


Glu 


Leu 


Gin 


Lys 
50 


Val 


Leu 


Lys 


Thr 


Cys 
55 


Trp Phe 


Val 


Cys 




tgg 


agt 


att 


ttt 


caa 


ctt 


aaa 


ttc 


ctt 


ttc 


ctt 


ttt 


ttt ttt 


ttt 


k 


243 


Trp 


Ser 


He 


Phe 


Gin 


Leu 


Lys 


Phe 


Leu 


Phe 


Leu 


Phe 


Phe Phe 


Phe 






60 










65 










70 













<210> 619 
<211> 499 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 295. .498 

<400> 619 

aggaagttga teaateeega ggctgctgag agacggtggc gcgattggga cagtcgeeag 60 
ggatggetga gegtgaagat geagcgggtg tccgggctgc tctcctggac gctgageaga 120 
gtcetgtggc tctccggcct ctctgagccg ggagctgccc ggcagccccg gatcatggaa 180 
gagaaagcct agaggtttat gatttgatta gaactatccg ggacccagaa aagcccaata 240 
ctttagaaga actggaagtg gtctcggaaa gttgtgtgga agttcaggag ataa atg 297 

Met 
1 



aag 


aag 


aat 


ate 


tgg 


tta 


tta 


tea 


ggt 


tea ege 


eaa 


eag 


tac 


etc 


att 


345 


Lys 


Lys 


Asn 


He 


Trp 


Leu 


Leu 


Ser 


Gly Ser Arg 


Gin 


Gin 


Tyr 


Leu 


He 










5 










10 








15 








get 


ctt 


tgg 


ega 


etc 


tta 


ttg 


ttg 


gaa 


ate tae 


att 


tet 


gaa 


gga 


acc 


393 


Ala 


Leu 


Trp Arg 


Leu 


Leu 


Leu 


Leu 


Glu 


He Tyr 


He 


Ser 


Glu 


Gly 


Thr 








20 










25 








30 












tea 


aca 


gaa 


gaa 


gac 


ate 


aat 


aag 




aat 


gac 


aaa 


gag 




441 


His 


Ser 
35 


Thr 


Glu 


Glu 


Asp 


He 
40 


Asn 


Lys 


Gin He 


Asn 
45 


Asp 


Lys 


Glu 


Arg 




gtg 


gca 




gca 


atg gaa 




ccc 


aac 


tta egg 


gaa 


att 


gtg 


gac 


agt 


489 


Val 


Ala 


Ala 


Ala 


Met 


Glu 




Pro 




Leu Arg 


Glu 


He 


Val 




Ser 
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50 55 60 65 

gtg tec ttg a 499 

Val Ser Leu 

<210> 520 
<211> 771 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> CDS 
<222> 613. .771 



<400> 620 

caggttcagg gcacagaacc caggcttgta ccatggtggt gggagaaaat gaccactggc 60 
caagaggact gctgacctgt gcaccaggct agtacttatg actacaaatt cttactgctt 120 
ctctaatcaa ctctgaggga agagggcatc tgatcattac aaaagggagg gcttataagt 180 
gatctcccaa gaaggcagtg atctgctagt gcctttggct ctgtacctct gctgggcatc 240 
tctccaaggt ctaaggtaac atattaaatg tttttgtcag ctaatgcagg ctcagtgact 300 
ttaagtctgt aagttaccca ggaagaagga ttataggaaa aataactcag taagtttaaa 360 
accaaacaca tttccattta gtgacaggaa tttaagcaag gacctgaagt agaatcaact 420 
gattcacaca gtagtaaata caaagtagaa caatgatctt ggcttcgctg tctggttcag 480 
tggtctgctg gaatgcaata cacaagttaa gtcacactgc agactgtttt ctagctgtgg 540 
ccgctggatg ccacttctag catagtagaa ctatgttagg aggaatggga aaagtgagca 600 
ccacttctca cc atg ttc ccc cct cct get gcc agt etc tgc tec eat gtt 651 
Met Phe Pro Pro Pro Ala Ala Ser Leu Cys Ser His Val 
15 10 
gga tgc age aga gat eac cea cea gtt ggc cca gga cag acc aat agg 699 
Gly Cys Ser Arg Asp His Pro Pro Val Gly Pro Gly Gin Thr Asn Arg 

15 20 25 

aag ggt cea ate act eta act aca gcc gaa etc acc tec aca aca gtc 747 
Lys Gly Pro lie Thr Leu Thr Thr Ala Glu Leu Thr Ser Thr Thr Val 
30 35 40 45 

tct gtg get eta gee tgg act cct 771 
Ser Val Ala Leu Ala Trp Thr Pro 



<210> 621 
<211> 658 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 440. . 658 



<400> 621 

gctttgtaat tggtgcagct ctgagaacac acgcatgcat tccaggcggt gctgctgggg 60 
eteggegggg eegtgtctgg tetgeaagge gegegggcte gtggggggtt ggetggggag 120 
eeeaegetge etggegaete gggeeaeega atgtgagaee gagteeettt atgteaeeag 180 
egcacacgct gatttgaaee etgettcgac gtgtgtgtea tggcttaaaa atagetgcta 240 

atctgtcaac ctgtcttggg cagaaacagc ggcggcgaca gagcaggagc gtcatggccg 300 
tggcgctgtc tgcgcsggcg atccgccttt cggactgagg cccagcgcag cgcttgcaaa 360 
gagcagcagc tacctggcaa ctgaacccat catcaccaca gecactcctg cagctgccac 420 
ggtttctgcc acctctaag atg tgc cct ggt aac tgg ctt tgg get tct atg 472 
Met Cys Pro Gly Asn Trp Leu Trp Ala Ser Met 
15 10 
act ttt atg gcc cgc ttc tec egg agt age tea agg tct cct gtt ega 520 
Thr Phe Met Ala Arg Phe Ser Arg Ser Ser Ser Arg Ser Pro Val Arg 

15 20 25 

act cga ggg acc ctg gag gag atg cca acc gtt caa cat cct ttc etc 568 
Thr Arg Gly Thr Leu Glu Glu Met Pro Thr Val Gin His Pro Phe Leu 
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aat gtc ttc gag ttg 
Asn Val Phe Glu Leu 
45 

aac cat gcc gac gag 
Asn His Ala Asp Glu 
60 

<210> 622 
<211> 460 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 32. .460 

<400> 622 

aatttttttg tgacttcttc agaccacttt c atg act tct gga age aaa tgt 52 

Met Thr Ser Gly Ser Lys Cys 

1 5 



cct 


agt 


aca 


gac 


tea 


gga 


aaa 


gaa 


gaa 


tat 


att 


gcc 


acg 


ttc 


aaa 


gga 


100 


Pro 


Ser 


Thr Asp 


Ser 


Gly 


Lys 


Glu 


Glu 


Tyr 


He 


Ala 


Thr 


Phe 


Lys 


Gly 








10 










15 










20 










tct 


gaa 


tac 


ttc 


tgc 


tac 


gac 


ttg 


tct 


caa 


aac 


ccc 


att 


caa 


age 


age 


148 


Ser 


Glu 
25 


Tyr 


Phe 


Cys 


Tyr 


Asp 
30 


Leu 


Ser 


Gin 


Asn 


Pro 
35 


He 


Gin 


Ser 


Ser 




agt 


gat 


gaa 


at a 


act 


ctg 


tea 


ttt 


aaa 


acc 


ctt 


cag 


agg 


aat 


gga 


ctg 


196 


Ser 


Asp 


Glu 


He 


Thr 


Leu 


Ser 


Phe 


Lys 


Thr 


Leu 


Gin 






Gly 


Leu 




40 










45 










50 










55 






ctt 


cac 


act 


ggg 


aaa 


teg 


get 


gat 


tat 


gtc 


aat 


ctt 




ctg 


aaa 


244 


Met 


Leu 


His 


Thr 


Gly 
60 


Lys 


Ser 


Ala 


Asp 


Tyr 
65 


Val 


Asn 


Leu 


Ala 


Leu 
70 


Lys 




aat 


gga 


get 


gtc 


tct 


ctg 


gtc 


att 


aat 


ttg 


gga 


tea 


ggg 


gcc 


ttt 


gaa 


292 


Asn 


Gly 


Ala 


Val 
75 


Ser 


Leu 


Val 


He 


Asn 
80 


Leu 


Gly 


Ser 


Gly 


Ala 
85 


Phe 


Glu 




gca 


eta 


gtg 


gag 


cct 


gtg 


aat 


gga 


aag 


ttt 






aat 


gcc 


tgg 


cat 


340 


Ala 


Leu 


Val 
90 


Glu 


Pro 


Val 


Asn 


Gly 
95 


Lys 


Phe 


Asn 


Asp 


Asn 
100 


Ala 


Trp 


His 




gat 


gtg 


aaa 


gtc 


acc 


arg 


aat 


ctg 


cgt 


cag 


gtg 


aca 


at a 


tea 


gtg 


gat 


388 


Asp 


Val 
105 


Lys 


Val 


Thr 


Xaa 


Asn 
110 


Leu 


Arg 


Gin 


Val 


Thr 
115 


He 


Ser 


Val 


Asp 




ggg 


att 


ctt 


acc 


aca 


aeg 


ggc 


tac 


acg 


caa 


gaa 


gat 


tat 


acc 


atg 


ctg 


436 


Gly 


He 


Leu 


Thr 


Thr 


Thr 


Gly 


Tyr 


Thr 


Gin 


Glu 


Asp 


Tyr 


Thr 


Met 


Leu 




120 










125 










130 










135 




ggg 


tct 


gat 


gac 


ttt 


ttc 


tat 


gtt 


















460 


Gly 


Ser 


Asp Asp 


Phe 


Phe 


Tyr 


Val 




























140 



























<210> 623 
<211> 481 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 145. .480 

<400> 623 

tttttcacct tcttctcctg gacagccggg gccaccattc cactggcttc attcttgctg 60 
etcagctgaa tacaaggcet cagtggtttg ctgggctttg gtgagatctt aaaggcaaae 120 
etctcttgge geaaggaagg agec atg cag ttg tat ctg tac teg atg tec 171 

Met Gin Leu Tyr Leu Tyr Ser Met Ser 

1 5 



339 

gag egg etc etc tac aca ggc 
Glu Arg Leu Leu Tyr Thr Gly 

50 55 
gtc tgg cea ggc ttc tat etc 
Val Trp Pro Gly Phe Tyr Leu 
65 70 



PCT/lBOl/00914 

aag aca gcc tgt 616 
Lys Thr Ala Cys 

gga gac 658 
Gly Asp 
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aca gcc ate atg ggg eta 
Thr Ala He Met Gly Leu 
10 15 

gag aag aaa gaa aac agt 
Glu Lys Lys Glu Asn Ser 
30 

gca tea tte eag eag gga 
Ala Ser Phe Gin Gin Gly 
45 

cea cet gge tae agt eea 

Pro Pro Gly Tyr Ser Pro 
60 

gee ctg aga eag eaa gae 
Ala Leu Arg Gin Gin Asp 
75 

age eta ete aae gtg aag 
Ser Leu Leu Asn Val Lys 
90 95 
ege ate eae tta atg aat 
Arg He His Leu Met Asn 
110 



340 

gat gaa eag get aga aee 

Asp Glu Gin Ala Arg Thr 
20 

gga gea ate aga tea gae 
Gly Ala He Arg Ser Asp 
35 

etg gea aae aat att tee 
Leu Ala Asn Asn He Ser 
50 

cgt atg ete aga tte tet 
Arg Met Leu Arg Phe Ser 

65 

caa GCC ctt etc etc etc 
Gin Pro Leu Leu Leu Leu 
80 85 
agg atg aga atg aag aee 
Arg Met Arg Met Lys Thr 
100 

agt a 
Ser 



PCT/JlBOl/00914 



etg gaa agg gaa 219 

Leu Glu Arg Glu 
25 

aga tgt gta aag 267 
Arg Cys Val Lys 
40 

ttt tat eta tge 315 
Phe Tyr Leu Cys 
55 

tet gee tet gee 363 

Ser Ala Ser Ala 
70 

tte etc etc etc 411 
Phe Leu Leu Leu 

ttt atg atg aee 459 
Phe Met Met Thr 
105 

481 



<210> 624 
<211> 456 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 295. .456 



<400> 624 

aggcggtgct gcctgtgtcg tcgtccctgt ttgttcttct tcaaacctca ggatttctcc 60 

gtgttteeca gaeegtaaeg tteeaaaggt agaaaeacag atgacagatt tttggtcttt 120 

tcgtcatttc ctatatatgt gttcaatggt ttggaatagt teaccagtga agtcatctca 180 

accagcacag ggccttgact cctaacaaca atcacacgaa cttggaagag gatccttccc 24 0 

eagetgaaae tteacttgag accttctcat tggccatcat cgacatctgg attc atg 297 

Met 



ata 


cag 


aga 


aac ttg 


agt 


aat 


gat 


gae 


aat 


age 


agt 


cea gcc 


tgc 


atg 


34 5 


He 


Gin Arg 


Asn Leu 


Ser 


Asn 


Asp Asp Asn 


Ser 


Ser 


Pro Ala 


Cys 


Met 










5 








10 








15 








gat 


get 


gae 


atg gat 


get 


gtc 


tec 


cat 


tet 


tae 


cac 


tgg atg 


tea 


aga 


393 


Asp 


Ala 


Asp 


Met Asp 


Ala 


Val 


Ser 


His 


Ser 


Tyr 


His 


Trp Met 


Ser Arg 








20 








25 










30 








aat 


get 


aga 


tgc gat 


cea 


gcc 


agt 


ctg 


tae 


teg 


act 


etc agt 


tec 


ate 


441 


Asn 


Ala 

35 


Arg 


Cys Asp 


Pro 


Ala 
40 


Ser 


Leu 


Tyr 


Ser 


Thr 
45 


Leu Ser 


Ser 


He 




tte 


tea 


act 


cac agt 






















456 


Phe 


Ser 


Thr 


His Ser 
























50 































<210> 625 
<211> 551 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 137. .550 



<400> 625 

aaaaaaaace agetgagtct ttgtgecagg aagaetgcgt gcagaaggtg actgtctcag 



60 



[fije>'/^:/yv'GG208389e.s^] 



WO 02/083898 PCT/lBOl/00914 
341 

tggagctggg tcatctcagg ccttggctcc ttgaactttt ggccgccatg tgcttcccga 120 
agtcctctct gatgac atg aag aag ctg aag gcc cga atg cac cag gcc ata 172 
Met Lys Lys Leu Lys Ala Arg Met His Gin Ala He 
15 10 



gaa 


aga 


ttt 


tat 


gat 


aaa 


atg 


caa 


aat 


gea 


gaa 


tea 


gga 


egt 


gga cag 


220 


Glu 


Arg 


Phe 
15 


Tyr 


Asp 


Lys 


Met 


Gin 
20 


Asn 


Ala 


Glu 


Ser 


Gly 
25 


Arg 


Gly Gin 




gtg 


atg 


teg age 


ctg 


gea 


gag 


ctg 


gag 


gac 


gac 


ttc 


aaa 


gag 


ggc tac 


2 68 


Val 


Met 
30 


Ser 


Ser 


Leu 


Ala 


Glu 
35 


Leu 


Glu 


Asp 


Asp 


Phe 
40 


Lys 


Glu 


Gly Tyr 




ctg 


gag 


aca 


gtg 


geg 


get 


tat 


tat 


gag 


gag 


cag 


cac 


cea 


gag 


etc act 


316 


Leu 


Glu 


Thr 


Val 


Ala 


Ala 


Tyr 


Tyr 


Glu 


Glu 


Gin 


His 


Pro 


Glu 


Leu Thr 




45 










50 










55 








60 




cct 


eta 


ett 


gaa 


aaa 


gaa 


aga 


gat 


gga 


tta 


egg 


tge 


cga 


ggc 


aac aga 


364 


Pro 


Leu 


Leu 


Glu 


Lys 
65 


Glu 


Arg 


Asp 


Gly 


Leu 
70 


Arg 


Cys 


Arg 


Gly 


Asn Arg 
75 




tec 


cct 


gte 


eeg 


gat 


gtt 


gag 


gat 


cec 


gea 


ace 


gag 


gag 


cct 


ggg gag 


412 


Ser 


Pro 


val 


Pro 
80 


Asp 


Val 


Glu 


Asp 


Pro 

85 


Ala 


Thr 


Glu 


Glu 


Pro 

90 


Gly Glu 




age 


ttt 


tgt 


gac 


aag 


gte 


atg 


aga 


tgg 




cag 


gcc 


atg 


ctg 


cag egg 


460 


Ser 


Phe 


Cys 
95 


Asp 


Lys 


Val 


Met 


Arg 
100 


Trp 


Phe 


Gin 


Ala 


Met 
105 




Gin Arg 




ctg 


cag 


ace 


tgg 


tgg 


cac 


ggg 


gtt 


ctg 


gcc 


tgg 


gtg 


aag 


gag 


aag gtg 


508 


Leu 


Gin 
110 


Thr 


Trp 


Trp 


His 


Gly 
115 


Val 


Leu 


Ala 


Trp 


Val 
120 


Lys 


Glu 


Lys Val 




gtg 


gcc 


ctg 


gte 


eat 


gea 


gtg 


cag 


gee 


etc 


tgg 


aaa 


cag 


ttc 


c 


551 


Val 


Ala 


Leu 


Val 


His 


Ala 


Val 


Gin 


Ala 


Leu 


Trp 


Lys 


Gin 


Phe 






125 










130 










135 













<210> 626 
<211> 494 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 283. .492 



<400> 626 

gactttgcga aggcatggcg gggacactgt gaatgtcagc ecagaaggtg atcagagect 60 
gttaattaaa atggaaagaa gaeagaaggg aaggtagaca tcaggttctc cctggagact 120 
tttcgttttc atttacgetg cggaaaetga cgtttttgcc taacacccca tgtaatgtaa 180 
aegtatagge ttgagtacgt gtccggccgc atgtgtagtg aaccctaaag ctttcctaat 240 
tgtagttagc ategtcccta ageggaaega ttttccgtga ac atg att tgt act 294 

Met lie Cys Thr 

1 

ttt eta cga gcc gta cag tat aeg gag aag ctg cac agg tec teg gea 342 

Phe Leu Arg Ala Val Gin Tyr Thr Glu Lys Leu His Arg Ser Ser Ala 

5 10 15 20 

aag cga ttg ctt ttg cea tac ate gtg ett aac aaa geg tgc ttg aag 390 

Lys Arg Leu Leu Leu Pro Tyr He Val Leu Asn Lys Ala Cys Leu Lys 

25 30 35 

act gag cec agt ttg aga tgt ggg ctt caa tat caa aag aaa acg ctg 4 38 

Thr Glu Pro Ser Leu Arg Cys Gly Leu Gin Tyr Gin Lys Lys Thr Leu 

40 45 50 

cga cct aga tgt att ctt gga gte ace cag aaa ace ate tgg aeg cag 486 
Arg Pro Arg Cys He Leu Gly Val Thr Gin Lys Thr He Trp Thr Gin 

55 60 65 

gga eeg ag 4 94 

Gly Pro 



<210> 627 
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<211> 585 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 128. .583 

<400> 627 

attttccggg aagatggcgg cgcacaagtc aggtccggca catgtttccg cggagcccca 60 
gcaatgacgg atgatatcac ctcttcttct ctggtgagag tctgaggata gagacttttt 120 



tctcacc 


atg 


aat 


gtc 


acc 


cca 


gag 


gtc 


aag 


agt 


cgt 


ggg 


atg 


aag 


ttt 


169 






Met 


Asn 


Val 


Thr 


Pro 


Glu 


Val 


Lys 


Ser Arg 


Gly Met 


Lys 


Phe 








1 








5 










10 












get 


gag 


gag 


cag 


ctg 


eta 


aag 


cat 


gga 


tgg 


act 


caa 


ggc 


aaa 


ggc 


etc 


217 


Ala 


Glu 


Glu 


Gin 


Leu 


Leu 


Lys 


His 


Gly 


Trp 


Thr 


Gin 


Gly 


Lys 


Gly 


Leu 




15 










20 










25 










30 




ggc 


egg 


aag 


gag 


aat 


ggt 


ate 


act 


cag 


get 




agg 


gtg 


aca 


ctg 


aag 


265 


Gly 


Arg 


Lys 


Glu 


Asn 

35 


Gly 


He 


Thr 


Gin 


Ala 
40 


Leu 


Arg 


Val 


Thr 


Leu 
45 


Lys 




caa 


gac 


act 


cat 


ggg 


gta 


gga 


cat 


gac 


cct 


gcc 


aag 


gag 


ttc 


aca 


aac 


313 


Gin 


Asp 


Thr 


His 


Gly 


Val 


Gly 


His 


Asp 


Pro 


Ala 


Lys 


Glu 


Phe 
60 


Thr 


Asn 




cac 


tgg 


tgg 


aat 


gag 


etc 


ttc 


aac 


aag 


act 


gcg 






ttg 


gta 


gtg 


361 


His 


Trp 


Trp 
65 


Asn 


Glu 


Leu 


Phe 


Asn 
70 


Lys 


Thr 


Ala 


Ala 


75 




Val 


Val 




gaa 


act 


ggg 


cag 


gat 


gga 


gta 


cag 


ata 


agg 


age 


ctt 


tet 


aag 


gag 


acc 


409 


Glu 


Thr 
80 


Gly 


Gin 




Gly 


Val 
85 


Gin 


He 


Arg 


Ser 


90 


Ser 




Glu 


Thr 




acc 


cgt 


tat 


aat 


cat 




aag 






ttg 






cag 


aag 


ttt 


gtg 


457 


Thr 


Arg 


Tyr 


Asn 


His 


Pro 


Lys 


Pro 


Asn 


Leu 


Leu 


Tyr 


Gin 


Lys 


Phe 


Val 




95 










100 










105 










110 




aag 


atg 


get 


aca 


ttg 


act 


tea 


ggt 


gga 


gag 


aag 


cca 


aac 


aaa 


gac 


ttg 


505 


Lys 


Met 


Ala 


Thr 


Leu 


Thr 


Ser 


Gly 


Gly 


Glu 


Lys 


Pro 


Asn Lys Asp 


Leu 












115 










120 










125 






gag 


age 


tgc 


agt 


gat 


gac 


gac 


aac 


cag 


ggg 


tea 


agt 


ecc 


caa 


aga 


ttc 


553 


Glu 


Ser 


Cys 


Ser 


Asp Asp Asp 


Asn 


Gin 


Gly 


Ser 


Ser 


Pro 


Gin Arg 


Phe 










130 










135 










140 








tta 


ctg 


atg 


aga 


tgc 


tgc 


tec 


aag 


cet 


gtg 


ar 












585 


Leu 


Leu 


Met 


Arg 


Cys 


Cys 


Ser 


Lys 


Pro 


Val 

















145 150 



<210> 628 
<211> 560 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 201. .560 

<400> 628 

atcytawtcg aaaagetctt tcettcttcc tcttgttcct cctcctgcct ctcttcgctt 60 
cgcctgcaaa cgeggtgggg gctgctcgge ggtcaggagc agcaagagac agagcgacat 120 
gagagattgg aecgcgggct gcactggaga atttactggt aggataattc atccctaaag 180 
agattgaagt gagcttcaga atg gca aaa gag gag ccc cag agt ate tea agg 233 
Met Ala Lys Glu Glu Pro Gin Ser He Ser Arg 
15 10 
gac ttg cag gaa ctg cag aag aag ctg tct ctg ctg ata gac tec ttc 281 
Asp Leu Gin Glu Leu Gin Lys Lys Leu Ser Leu Leu He Asp Ser Phe 

15 20 25 

cag aat aac tea aag gtg gtg gee ttt atg aag tct cca gtg ggt cag 329 
Gin Asn Asn Ser Lys Val Val Ala Phe Met Lys Ser Pro Val Gly Gin 
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30 








35 








40 










tac 


ttg 


gac 


age 


eat 


ccg 


ttt ctg gee 


tte 


aee 


ttg 


ctg 


gtg 


tte 


att 


377 


Tyr 


Leu 


Asp 


Ser 


His 


Pro 


Phe Leu Ala 


Phe 


Thr 


Leu 


Leu 


Val 


Phe 


He 






45 










50 






55 












gtc 


atg 


teg 


gee 


gtt 


eet 


gtt gga tte 


tte 


ctg 


cte 


ate 


gtg 


gtg 


ctt 


425 


Val 


Met 


Ser 


Ala 


Val 


Pro 


Val Gly Phe 


Phe 


Leu 


Leu 


He 


Val 


Val 


Leu 




60 










65 






70 










75 




acc 


acc 


ctg 


get 


get 


ctg 


ctg ggg gtc 


ata 


ata 


ttg 


gaa 


gga 


ttg 


gtc 


473 














Leu Gly Val 


He 
























80 






85 










90 






ate 


tot 


gtg 


ggt 


gge 


tte 


tea ctg cte 


tge 


ate 


etc 


tgt 


ggk 


ttg 


age 


521 


He 


Ser 


Val 


Gly 


Gly 


Phe 


Ser Leu Leu 


Cys 


He 




Cys 


Gly 


Leu 


Ser 










95 






100 










105 








ttc 


gta 


tea 


cte 


gee 


atg 


teg ggg atg 


atg 


ata 


gea 


tct 








560 


Phe 


Val 


Ser 


Leu 


Ala 


Met 


Ser Gly Met Met 


He 


Ala 


Ser 











110 115 120 



<210> 629 
<211> 728 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 420. .728 

<400> 629 

tttttggagg gctcttagca acggccctgg ttgagccccc tcagccatga gaaaatcaaa 60 

tcaatgtgcc attcttccag gcgcgaggca gcagcggctg cagttcaaca tgaaaggagg 120 

cttcctccct gcctgctaat tacctgctct tcccgatctc atcgtttctg cctttgcaaa 180 

gtgctactga gaagsgggaa gaaaegtccg ceacceatce ecettgetge ctgggggtte 240 

agacttgatt agatggctaa caggggcccg agctatggct taagccgaga ggtgcaggag 300 

aagatcgagc agaagtatga tgcggacctg gagaacaagc tggtggactg gatcatcctg 360 

eagtgcgceg aggaeataga geaeeegeee cceggcagge ccattttcag aaatggtta 419 



atg 


gac 


ggg 


acg 


gtc 


ctg 


tgc 


aag 


ctg 


ata 


aat 


agt 


tta tac 


cca 


cca 


467 


Met 


Asp 


Gly 


Thr 


Val 


Leu 


Cys 


Lys 


Leu 


He 


Asn 


Ser 


Leu Tyr 


Pro 


Pro 




1 








5 










10 








15 






gga 


caa 


gag 


ecc 


ata 


cce 


aag 


ate 


tea 


gag 


tea 


aag 


atg get 


ttt 


aag 


515 


Gly 


Gin 


Glu 


Pro 
20 


He 


Pro 


Lys 


He 


Ser 
25 


Glu 


Ser 


Lys 


Met Ala 
30 


Phe 


Lys 




cag 


atg 


gag 


caa 


ate 


tec 


cag 


tte 


eta 


aaa 


get 


gcg 


gag ace 


tat 


ggt 


563 


Gin 


Met 


Glu 

35 


Gin 


He 


Ser 


Gin 


Phe 

40 


Leu 


Lys 


Ala 


Ala 


Glu Thr 

45 


Tyr 


Gly 




gtc 


aga 


acc 


acc 


gac 


ate 


ttt 


cag 


acg 


gtg 


gat 


eta 


tgg gaa 


ggg 


aag 


611 


Val 


Arg 
50 


Thr 


Thr 


Asp 


He 


Phe 
55 


Gin 


Thr 


Val 


Asp 


Leu 
60 


Trp Glu 


Gly 


Lys 




gac 


atg 


gca 


get 


gtg 


cag 


agg 


acc 


ctg 


atg 


get 


tta 


ggc age 


gtt 


gca 


659 


Asp 


Met 


Ala 


Ala 


Val 


Gin 


Arg 


Thr 




Met 


Ala 


Leu 


Gly Ser Val 


Ala 




65 










70 










75 








80 




gtc 


acc 


aag 


gat 


gat 


ggc 


tgc 


tat 


egg 


gga 


gag 


cca 


tec tgg 


ttt 


cac 


707 


Val 


Thr 


Lys 


Asp 


Asp 


Gly 


Cys 


Tyr 


Arg 


Gly Glu 


Pro 


Ser Trp 


Phe 


His 












85 










90 








95 






agg 


aaa 


gcc 


cag 


cag 


aat 


egg 


















728 


Arg 


Lys 


Ala 


Gin 


Gin 


Asn 


Arg 





















100 



<210> 630 
<211> 496 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
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<222> 261. .494 
<400> 630 

ttcgcggtta gggattccat agttcacggg ttacggcaca cacgcgttac gtatatatcg 60 
aggttaatct gcgcaastct tcacttttgt ctgaatgtat tttagcggtg aattactcct 120 
tctggctaag agcttcacat cttcagtgtt aatggtggtg gttttcgcat gtcttgcaca 180 
catttctaga tcttgcactg tcgaaaagtc agctccgaaa ttgctgcaat gataagtttg 240 
ctgaactgca tctctgccca atg cca ctt cct cgc aaa gac aac cca cag tgt 293 
Met Pro Leu Pro Arg Lys Asp Asn Pro Gin Cys 













1 








5 










10 




agt 


cca 


ctg 


ctg 


cct 


cct 


ttg 


gcc 


tct 


gtt 


ggt 


aag 


aaa 


atg 


get aag 


341 


Ser 


Pro 


Leu 


15 


Pro 


Pro 




Ala 


Ser 
20 


Val 


Gly 




Lys 


Met 
25 


Ala Lys 




aaa 


ate 


get 


get 


get 


cct 


cca 


cct 




cct 


cct 


cct 


cca 


tea 


gaa egg 


389 


Lys 


He 


Ala 
30 


Ala 


Ala 


Pro 


Pro 


Pro 
35 


Pro 


Pro 


Pro 


Pro 


Pro 
40 


Ser 


Glu Arg 




cga 


ggc 


aaa 


cgc 


gca 


gaa 


cag 


aaa 


ggg 


get 


gtg 


ggg 


cag 


egg 


eca gat 


437 


Arg 


Gly 


Lys 


Arg Ala 


Glu 


Gin 


Lys 


Gly Ala 


Val 


Gly Gin 


Arg 


Pro Asp 






45 










50 










55 










tgg 


att 


ttc 


tot 


cog 


cag 




caa 


rgg 


atg 


ttt 


eta 


atg 


acc 


caa gga 


485 


Trp 


He 


Phe 


Ser 


Pro 


Gin 


Pro 


Gin 


Xaa 


Met 


Phe 


Leu 


Met 


Thr Gin Gly 




60 
a aw 


tta 


gtg 


gt 




65 










70 








75 


496 


Xaa 


Leu 


Val 





























<210> 631 
<211> 500 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 91. .498 

<400> 631 

wgaacatcct aatcgaaaaa agattctaag acaagaettc taatacggca tctcagttta 60 
ttgttctttg gggtttgttt tcagatcctc atg tea gaa cca ggc aaa etc age 114 
Met Ser Glu Pro Gly Lys Leu Ser 



cag 


aaa 


ate 


aaa 


gtt 


tgg 


ctt 


cag 


1 

gag 


tac 


tgg 


aac 


5 

ate 


aea 


gat 


etc 


162 


Gin 


Lys 


He 


Lys 


Val 


Trp 


Leu 


Gin 


Glu 


Tyr 


Trp 


Asn 


He 


Thr 


Asp 


Leu 






10 










15 










20 












gtg 


gcc 


att 


tec 


aea 


ttc 


atg 


att 


gga 


gea 


att 


ctt 


cgc 


eta 


cag 


aac 


210 


Val 


Ala 


He 


Ser 


Thr 


Phe 


Met 


He 


Gly 


Ala 


He 


Leu 


Arg 


Leu 


Gin 


Asn 




25 










30 










35 










40 




cag 


ccc 


tac 


atg 


ggc 


tat 


ggc 


egg 


gtg 


ate 


tac 


tgt 


gtg 


gat 


ate 


ate 


258 


Gin 


Pro 


Tyr 


Met 


Gly 


Tyr 


Gly 


Arg 


Val 


He 


Tyr 


Cys 


Val 


Asp 


He 


He 












45 










50 










55 






ttc 


tgg 


tac 


ate 


cgt 


gtc 


ctg 


gac 


ate 


ttt 


ggt 


gtc 


aac 


aag 


tat 


ctg 


306 


Phe 


Trp 


Tyr 


He 


Arg 


Val 


Leu 


Asp 


He 


Phe 


Gly 


Val 


Asn 


Lys 


Tyr 


Leu 










60 










65 










70 








ggg 




tac 


gtg 




atg 


att 


gga 


aag 


atg 


atg 


ate 


gac 


atg 


ctg 


tac 


354 


Gly 


Pro 


Tyr 

75 


Val 


Met 


Met 


He 


Gly 
80 


Lys 


Met 


Met 


He 


Asp 
85 


Met 


Leu 


Tyr 




ttt 


gtg 


gtc 


ate 




ctg 


gtc 


gtg 




atg 


agt 


ttc 


gga 


gta 


gcc 


cgt 


402 


Phe 


Val 


Val 


He 


Met 


Leu 


Val 


Val 


Leu 


Met 


Ser 


Phe 


Gly 


Val 


Ala 


Arg 






90 










95 










100 












caa 


gcc 


att 


ctg 


cat 


cca 


gag 


gag 


aag 


ccc 


tct 


tgg 


aaa 


ctg 


gcc 


cga 


450 


Gin 


Ala 


He 


Leu 


His 


Pro 


Glu 


Glu 


Lys 


Pro 


Ser 


Trp 


Lys 




Ala 


Arg 




105 










110 










115 










120 




aac 


ate 


ttc 


tac 


atg 


ccc 


tac 


tgg 


atg 


ate 


tat 


gga 


gag 


gtg 


ttg 


cag 


498 


Asn 


He 


Phe 


Tyr 


Met 


Pro 


Tyr 


Trp 


Met 


He 


Tyr 


Gly 


Glu 


Val 


Leu 


Gin 





125 130 135 
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500 



<210> 632 
<211> 524 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 170. .523 



<400> 632 

tttttccggt cttactcacg ttgcggcctt cctcgcgtca cagccgggat gaagccgatc 60 
ctactgcagg gccatgagcg gtccattacg cagattaagt ataaccgcga aggagacctc 120 
ctctttactg tggccaagga ccctatcgtc aatgtatggt actctgtga atg gtg aga 178 

Met Val Arg 



ggc 


tgg 


gca 


cct 


aca 


tgg 


gcc 


ata 


ccg 


gag 


ctg 


tgt 


ggt 


gtg 


tgg 


aeg 


226 


Gly 


Trp 
5 


Ala 


Pro 


Thr 


Trp 


Ala 
10 


He 


Pro 


Glu 




Cys 
15 


Gly 


Val 


Trp 


Thr 




ctg 


act 


ggg 


aca 


cca 


age 


atg 


tec 


tea 


ctg 


get 


cag 


ctg 


aea 


aea 


get 


274 


Leu 


Thr 


Gly 


Thr 


Pro 


Ser 


Met 


Ser 


Ser 




Ala 


Gin 




Thr 


Thr 


Ala 




20 










25 










30 










35 




gtc 


gtc 


tct 


ggg 




gtg 




cag 


tct 


gga 


gag 


gtg 


ttg 


gtg 


aat 


gtt 


322 


Val 


Val 


Ser 


Gly 


Thr 
40 


Val 


Lys 


Gin 


Ser 


Gly 
45 


Glu 


Val 


Leu 


Val 


Asn 
50 


Val 




aag 


gag 


cac 


tec 


egg 


cag 


ate 


aac 


gae 


ate 


eag 


tta 


tec 


agg 


gae 


atg 


370 


Lys 


Glu 


His 


Ser 
55 


Arg 


Gin 


He 


Asn 


Asp 
60 


He 


Gin 


Leu 


Ser 


65 




Met 




acc 


atg 


ttt 


gtg 


acc 


gcg 


tec 


aag 


gae 


aac 


aca 


gcc 


aag 


ctt 


ttt 


gae 


418 


Thr 


Met 


Phe 
70 


Val 


Thr 


Ala 


Ser 


Lys 
75 


Asp 


Asn 


Thr 


Ala 


Lys 
80 


Leu 


Phe 


Asp 




tec 


aca 


act 


ctt 


gaa 


eat 


eag 


aag 


act 


tte 


egg 


aea 


gaa 


egt 


eet 


gte 


466 


Ser 


Thr 


Thr 


Leu 


Glu 


His 


Gin 


Lys 


Thr 


Phe 


Arg 


Thr 


Glu Arg 


Pro 


Val 






85 










90 










95 












aac 


tea 


get 


gcc 


etc 


tec 


ccc 


aac 


tat 


gae 


eat 


gtg 


gtc 


ctg 


ggc 


ggt 


514 


Asn 


Ser 


Ala 


Ala 


Leu 


Ser 


Pro 


Asn 


Tyr 


Asp 


His 


Val 


Val 


Leu 


Gly 


Gly 




100 










105 










110 










115 




ggt 


cag 


gaa 


g 


























524 



Gly Gin Glu 



<210> 633 
<211> 508 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 141. .506 



<400> 633 

attggcttct ctcecagttt ccaccacact ccccatctgg gtaacctaag caggttactc 60 
geecetgtga ggaggttctc ttctctgtac aatggaaatg gttatecaca cetatctcte 120 
agggaggcte tgatatcate atg aga ccc age tgt get cct gga tgg ttt tac 173 
Met Arg Pro Ser Cys Ala Pro Gly Trp Phe Tyr 
15 10 
eae aag tee aat tge tat ggt tae tte agg aag etg agg aac tgg tet 221 
His Lys Ser Asn Cys Tyr Gly Tyr Phe Arg Lys Leu Arg Asn Trp Ser 

15 20 25 

gat gee gag etc gag tgt eag tet tac gga aac gga gcc cac ctg gea 269 
Asp Ala Glu Leu Glu Cys Gin Ser Tyr Gly Asn Gly Ala His Leu Ala 

30 35 40 

tct ate ctg agt tta aag gaa gcc age acc ata gca gag tac ata agt 317 
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Ser 


lie 

45 


Leu 


Ser 


Leu 


Lys 


Glu 

50 


Ala 


Ser 


Thr 


He 


Ala 

55 


Glu 


Tyr 


He 


Ser 




ggc 


tat 


cag 


aga 


age 


eag 


ecg 




tgg 


att 


ggc 


etg 


cac 


gae 


eca 


eag 


365 


Gly 


Tyr 


Gin 


Arg 


Ser 


Gin 


Pro 


He 


Trp 


He 


Gly 


Leu 


His 


Asp 


Pro 


Gin 




60 










65 










70 










75 




aag 


agg 






tgg 


cag 


tgg 


att 


gat 


ggg 




atg 




ctg 


tac 


aga 


413 


Lys 


Arg 


Gin 


Gin 


Trp 
80 


Gin 


Trp 


He 


Asp 


Gly 
85 


Ala 


Met 


Tyr 


Leu 


Tyr 
90 


Arg 




tec 


tgg 


tct 


ggc 


aag 


tec 


atg 


ggt 


ggg 


aac 


aag 


cac 


tgt 


get 


gag 


atg 


4 61 


Ser 


Trp 


Ser 


Gly 

95 


Lys 


Ser 


Met 


Gly 


Gly 
100 


Asn 


Lys 


His 


Cys 


Ala 

105 


Glu 


Met 




age 


tec 




aac 


aac 


ttt 




act 


tgg 


age 


age 




gaa 


tge 


aac 


aa 


508 


Ser 


Ser 


Asn 
110 


Asn 


Asn 


Phe 


Leu 


Thr 
115 


Trp 


Ser 


Ser 


Asn 


Glu 
120 


Cys 









<210> 634 
<211> 464 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 92. .412 

<220> 

<221> sig_peptide 
<222> 92. .160 
<223> Von Heijne matrix 
score 12 

seq RLLLLSLVSGALG/LG 

<220> 

<221> misc_feature 
<222> 325 

<223> n=a, g, c or t 
<400> 634 

tgetgcettc tetgatggct tctceagegg eetcagtaec aecttagcgg tcttctgeea 60 
tgagctgccc cactaactgg gtgactttgc c atg ctg etc cag tea ggg ctg 112 
Met Leu Leu Gin Ser Gly Leu 
-20 



tec 


ttt 


egg 


egg 


ctg 




ctg 


ctg 




etc 


gtg 


tct 


gga 


gee 


ctg 


gga 


160 


Ser 


Phe 
-15 


Arg 


Arg 


Leu 


Leu 


Leu 
-10 


Leu 


Ser 


Leu 


val 


Ser 
-5 


Gly 


Ala 


Leu 


Gly 




ttg 


ggg 


ggt 


gca 


gtc 


ctg 


ggg 


gtg 


ggg 


etc 


age 


etg 


ggc 


ect 


gtc 




208 


Leu 


Gly 


Gly 


Ala 


Val 


Leu 


Gly 


Val 


Gly 




Ser 


Leu 


Gly 


Pro 


Val 


Pro 




1 








5 










10 










15 






etc 


act 


ccc 


tgg 


gtg 


ttt 


ggg 


gtc 


act 




ggg 


gtc 


ttc 


etc 


tat 


gtg 


256 


Leu 


Thr 


Pro 


Trp 
20 


Val 


Phe 


Gly 


Val 


Thr 
25 


Ala 


Gly 


Val 


Phe 


Leu 
30 


Tyr 


Val 




gee 


ctt 


gtg 


gae 


atg 


eta 


eca 


gee 


ctg 


ctt 


cgt 


cct 


ecg 


gag 


CCC 


ctg 


304 


Ala 


Leu 


Val 
35 


Asp 


Met 




Pro 


Ala 
40 


Leu 


Leu 




Pro 


Pro 

45 


Glu 


Pro 






eet 


aeg 


ecc 


eat 


gtg 


etc 


etn 


cag 


ggg 


ctg 


ggg 


ctg 


ctg 


ctg 


ggg 


ggc 


352 


Pro 


Thr 
50 


Pro 


His 


Val 




55 


Gin 


Gly 


Leu 


Gly 


Leu 
60 


Leu 


Leu 


Gly 


Gly 




ggc 


etc 


atg 


Ctt 


gee 


ata 


ace 


Ctg 


Ctg 


gag 


gag 


egg 


eta 


Ctg 


CCC 


gtg 


400 


Gly 


Leu 


Met 


Leu 


Ala 


He 


Thr 


Leu 


Leu 


Glu 


Glu 


Arg 


Leu 


Leu 


Pro 


Val 




65 










70 










75 










80 





ace act gag ggc tgatggggcc aattggaaag gggtcgggtt gcccttcctt 4 52 

Thr Thr Glu Gly 

ecccccaacc ac 4 64 
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<210> 635 
<211> 506 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 193. .468 

<220> 

<221> sig_peptide 

<222> 193. .264 

<223> Von Heijne matrix 

score 11.6999998092651 

seq FVLGLGLTPPTLA/QD 

<400> 635 

acacacactc acacaaggac gcsaacccca cctagatgca aagcaggatt caaaagaaca 60 
tctttgcgtt ttctaccggc tccccatcat cgtactaggg aggaagaagc gggtgagaaa 120 
caaaacttct ttccattgtc ctgcccgttt ctgcggactt gttctgaggc cgaggagcct 180 
gtgttggaag ag atg gtg atg ggc ctg ggc gtt ttg ttg ttg gtc ttc gtg 231 
Met Val Met Gly Leu Gly Val Leu Leu Leu Val Phe Val 
-20 -15 



ctg 


ggt 


ctg 


ggt 


ctg 


acc 


cca 


ccg 


acc 


ctg 




cag 


gat aac 


tec agg 


279 


Leu 


Gly 


Leu 


Gly 


Leu 


Thr 


Pro 


Pro 


Thr 




Ala 


Gin 


Asp Asn 


Ser Arg 






-10 










-5 










1 




5 




tac 


aca 


cac 


ttc 


ctg 


acc 


cag 


cac 


tat 


gat 


gcc 


aaa 


cca cag 


ggc egg 


327 


Tyr 


Thr 


His 


Phe 


Leu 


Thr 


Gin 


His 


Tyr 


Asp 


Ala 


Lys 


Pro Gin 


Gly Arg 












10 










15 








20 




gat 


gac 


aga 


tac 


tgt 


gaa 


age 


ate 


atg 


agg 


aga 


egg 


ggc ctg 


ace tea 


375 


Asp 


Asp 


Arg 


Tyr 


Cys 


Glu 


Ser 


He 


Met 


Arg 


Arg 


Arg Gly Leu 


Thr Ser 










25 










30 








35 






GGC 


tgc 


aaa 


gac 


ate 


aac 


aca 


ttt 


att 


cat 


ggc 


aae 


aag ege 


asa tea 


423 


Pro 


Cys 


Lys 


Asp 


lie 


Asn 


Thr 


Phe 


He 


His 


Gly 


Asn 


Lys Arg 


Xaa Ser 








40 










45 










50 






agg 


cca 


tot 


gtg 


aaa 


aca 


aga 


atg 


gaa 


acc 


etc 


aca 


gag aaa 


ace 


4 68 


Arg 


Pro 


Ser 


Val 


Lys 


Thr 


Arg 


Met 


Glu 


Thr 


Leu 


Thr 


Glu Lys 


Thr 






55 










60 










65 









taagaataag eaagtettet tteeaggtea ceaettge 506 

<210> 636 
<211> 566 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 268. .507 

<220> 

<221> sig_peptide 

<222> 268. .363 

<223> Von Heijne matrix 

score 8.89999961853027 

seq LGLFFGFLLVIFA/IE 

<400> 636 

actttagcte gectaggatt tgagctgggt gtatgtetgc tcceagetea agteeetccg 60 

agtgccagag aggaaggcag ggagaagcgg asacccctct ttgggccaag gccaaggagg 120 

actgtgcttg ccgggattgc tgtccttgcc attggactat ggctccgatt cgactctcag 180 

accaagagca tcttcgagca agaaactaat aataataatt ccagcttcta cacaggagtc 240 

tatattctga tcggagccgg cgccctc atg atg ctg gtg ggc ttc ctg ggc tgc 2 94 
Met Met Leu Val Gly Phe Leu Gly Cys 
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-30 -25 



tgc 


ggg 


get 


gtg 


eag 


gag 


tee 


eag 


tgc 


atg ctg 


gga ctg 


tte 


tte gge 


342 


Cys 


Gly 


Ala 


Val 


Gin 


Glu 


Ser 


Gin 


Cys 


Met Leu 


Gly Leu 


Phe 


Phe Gly 










-20 










-15 






-10 






ttc 


etc 


ttg 


gtg 


ata 


tte 


gee 


att 


gaa 


ata get 


geg gee 


ate 


tgg gga 


390 


Phe 


Leu 


Leu 
-5 


Val 


lie 


Phe 


Ala 


lie 
1 


Glu 


He Ala 


Ala Ala 
5 


He 


Trp Gly 




tat 












gtg 


att 


aag 


gaa gte 


eag gag 


ttt 


tac aag 


438 


Tyr 


Ser 


His 


Lys 


Asp 


Glu 


Val 


He 


Lys 


Glu Val 


Gin Glu 


Phe 


Tyr Lys 




10 










15 








20 






25 




gac 


acc 


tac 


aac 


aag 


ctg 


aaa 


acc 


aag 


gat gag 


ccc cag 


egg 


gaa acg 


486 


Asp 


Thr 


Tyr 


Asn 


Lys 


Leu 


Lys 


Thr 


Lys 


Asp Glu 


Pro Gin 


Arg 


Glu Thr 












30 










35 






40 




ctg 


aaa 


sea 


tec 


aet 


atg 


cgt 


tgaactgctg tggtttggct gggggcgtgg 


537 


Leu 


Lys 


Xaa 


Ser 


Thr 


Met 


Arg 

















45 



aacagtttat cteagacatc tgcecaaga 566 

<210> 637 
<211> 568 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 30. .527 

<220> 

<221> sig_peptide 

<222> 30. .74 

<223> Von Heijne matrix 

score 8.60000038146973 

seq PLLIICLLPAIEG/KN 



<400> 637 



actggggcac agtaggagga ac 


;ccagaag 


atg 


ctg 


cet 


etc 


ctg 


ate 


ate 


tgt 


53 


















Met 


Leu 


Pro 


Leu 


Leu 


He 


He 


Cys 




















-15 










-10 








etc 


ctg 


cet 


gcc 


att 


gaa 


ggg 


aag 


aac 


tgc 


etc 


cgc 


tgc 


tgg 




gaa 


101 


Leu 


Leu 


Pro 
-5 


Ala 


He 


Glu 


Gly 


Lys 
1 


Asn 


Cys 


Leu 


5 


Cys 


Trp 


Pro 


Glu 




ctg 


tct 


gcc 


ttg 


ata 


gac 


tat 


gac 


ctg 


cag 


ate 


etc 


tgg 


gtg 


acc 




149 


Leu 


Ser 


Ala 


Leu 


He 


Asp 


Tyr 


Asp 


Leu 


Gin 


He 




Trp 


Val 


Thr 


Pro 




10 










15 










20 










25 




ggg 


eca 


ccc 


aca 


gaa 


ctt 


tct 


caa 


aat 


cgt 


gac 


cat 


ttg 


gaa 


gaa 


gaa 


197 


Gly 


Pro 


Pro 


Thr 


Glu 


Leu 


Ser 


Gin 


Asn 


Arg 


Asp 


His 


Leu 


Glu 


Glu 


Glu 












30 










35 










40 






aca 


gee 


aaa 


tte 


ttc 


aet 


caa 


gta 




caa 


gee 


att 


aaa 


acg 


tta 


cga 


245 


Thr 


Ala 


Lys 


Phe 


Phe 


Thr 


Gin 


Val 


His 


Gin 


Ala 


He 


Lys 


Thr 




Arg 










45 










50 










55 








gat 


gat 


aaa 




gta 


ctt 


ctg 


gaa 


gag 


ate 


tac 


acg 


cac 


aag 


aat 


etc 


293 


Asp 


Asp 


Lys 


Thr 


Val 


Leu 


Leu 


Glu 


Glu 


He 


Tyr 


Thr 


His 


Lys 


Asn 


Leu 








60 










65 










70 










ttt 


act 


gag 


agg 


ctg 


aat 


aag 


ata 


tct 


gat 


ggg 


ctg 


aag 


gag 


aag 


gac 


341 


Phe 


Thr 


Glu 


Arg 


Leu 


Asn 


Lys 


He 


Ser 


Asp 


Gly 


Leu 


Lys 


Glu 


Lys 


Asp 






75 










80 










85 












ata 


cag 


tec 


aca 


ctg 


aag 


gte 


ace 


age 


tgt 


get 


gac 


tgc 


agg 


act 


eae 


389 


He 


Gin 


Ser 


Thr 


Leu 


Lys 


Val 


Thr 


Ser 


Cys 


Ala 


Asp 


Cys 


Arg 


Thr 


His 




90 










95 










100 










105 




ttc 


etc 


tec 


tgc 


aat 


gac 


ccc 


act 


ttc 


tgc 


eca 


gcc 


agg 


aac 


egg 


egg 


437 


Phe 


Leu 


Ser 


Cys 


Asn 


Asp 


Pro 


Thr 


Phe 


Cys 


Pro 


Ala 




Asn 


Arg 














110 










115 










120 






acc 


tec 


ctg 


tgg 


get 


gtg 


agt 


etc 


age 


agt 


get 


eta 


etc 


ctg 


gcc 


ata 


485 
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Thr Ser Leu Trp Ala Val Ser Leu Ser Ser Ala Leu Leu Leu Ala lie 

125 130 135 

get gga gat gtb wet ttt act ggc aaa gga aga agg agg cag 527 
Ala Gly Asp Val Xaa Phe Thr Gly Lys Gly Arg Arg Arg Gin 

140 145 150 

taaagcagga acagggcagc ccgcatgtct tccagaagtg a 568 



<210> 638 
<211> 420 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 56. .217 



<220> 

<221> sig_peptide 

<222> 56. . 133 

<223> Von Heijne matrix 

score 8.39999961853027 
seq AAILLLTVPVAEG/VT 



<400> 638 

aaaactgaaa acagaacaaa cacaaacaca aacagagaat caacgaagaa gaaaa atg 58 

Met 

gtg aag gtg atg tgg gtt tec gtt tta get ctg gcg gcg gcg att etc 106 

Val Lys Val Met Trp Val Ser Val Leu Ala Leu Ala Ala Ala He Leu 

-25 -20 -15 -10 

ctt ttg acg gte ceg gtg gea gaa ggg gtg acg tgc teg cet atg cag 154 

Leu Leu Thr Val Pro Val Ala Glu Gly Val Thr Cys Ser Pro Met Gin 

-5 15 
ctg get tea tgt gcg gcg gcg atg acg tea tct teg ccg eca teg gag 202 
Leu Ala Ser Cys Ala Ala Ala Met Thr Ser Ser Ser Pro Pro Ser Glu 

10 15 20 

get gtt gea caa age tgagagagea geagceatgc ctttgtgggt acatgaggaa 257 
Ala Val Ala Gin Ser 
25 

ecctacccte cgccaatacg ttagetccce taaegcaagg aaagtctcca aeagttgcaa 317 
gattccttec ccaagctgtt aaggaaatgt taatcatgat taattagtga caagtttege 377 
tgattatagt ggttaatgct ggtcttatet tegtcagtae tac 420 



<210> 639 
<211> 526 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 276. .440 



<220> 

<221> sig_peptide 

<222> 276. .353 

<223> Von Heijne matrix 

score 8.30000019073486 
seq SVLLLAIVLISWG/CI 



<400> 639 

tatetcaegc geggeagttc aagaaeatca 
tgatacatta tggegggegg geaaggceec 
cggaagaggc cagcgaeagc ttgeacgaag 
agatggetge tggcgcetgg eageeaecge 



gacacgctca acaaaeaett gttgettgac 60 

gatcacgtga taccgeeece eaaetccggg 120 

egagtgggcg ggegctegtg eeeggetaaa 180 

etgggaggtt aetgtaagge cegeagetce 240 
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cgccagctcc cgcggactgc tgccgcctcc ttacc atg aag cca gta agt cgt 293 

Met Lys Pro Val Ser Arg 
-25 

cgc acg ctg gac tgg att tat tea gtg ttg ctg ctt gcc ate gtt tta 341 

Arg Thr Leu Asp Trp lie Tyr Ser Val Leu Leu Leu Ala lie Val Leu 

-20 -15 -10 -5 

ate tee tgg gge tgc ate ate tat get teg atg gtg tet gea aga cga 38 9 

lie Ser Trp Gly Cys lie He Tyr Ala Ser Met Val Ser Ala Arg Arg 

15 10 
cag eta agg aag aaa tac cca gac aaa ate ttt ggg acg aat gaa aat 437 
Gin Leu Arg Lys Lys Tyr Pro Asp Lys He Phe Gly Thr Asn Glu Asn 

15 20 25 

ttg taactcttct ggatttaatt atctgaaaat acagttcttt ccctcatgct 4 90 

Leu 

tatgtagata taaaaataaa attcataatg caaagt 526 



<210> 640 
<211> 543 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 152. .454 



<220> 

<221> sig_peptide 

<222> 152. .217 

<223> Von Heijne matrix 

score 7.90000009536743 
seq ILLVALQARLSHS/RQ 



<400> 640 

aggaaattag gaeacctcat eccaaaagac etttaaatag gggaagtcca cttgtgcacg 60 
gctgctcctt gctatagaag acctgggaca gaggactgct gtctgccctc tctggtcacc 120 
ctgcctagct agaggatctg tgaccccagc c atg agg acc etc gcc ate ctt 172 
Met Arg Thr Leu Ala He Leu 
-20 



get 


gcc 


att 


etc 


ctg 


gtg 


gcc 


ctg 


cag 


gcc 


agg 


ctg 


age 


cac 


tec 


agg 


220 


Ala 


Ala 


He 


Leu 


Leu 


Val 


Ala 


Leu 


Gin 


Ala 


Arg 


Leu 


Ser 


His 


Ser 


Arg 




-15 










-10 










-5 










1 




caa 


gag 


ctg 


atg 


agg 


ttg 


ctg 


cag 


ccc 


egg 


age 


aga 


ttg 




egg 




268 


Gin 


Glu 


Leu 


Met 
5 


Arg 


Leu 


Leu 


Gin 


Pro 
10 


Arg 


Ser 


Arg 


Leu 


Gin 
15 


Arg 


Thr 




tee 


cag 


aag 


tgg 


ttg 


ttt 




ttg 


eat 


ggg 


acg 


aaa 




tgg 


etc 


caa 


316 


Ser 


Gin 


Lys 
20 


Trp 


Leu 


Phe 


Pro 


Leu 
25 


His 


Gly 


Thr 


Lys 


Ala 
30 


Trp 


Leu 


Gin 




age 


ate 


cag 


got 




gga 


aaa 


aca 


tgg 


eet 


get 


att 


gea 


gaa 


tac 


cag 


364 


Ser 


He 
35 


Gin 


Ala 


Gin 


Gly 


Lys 
40 


Thr 


Trp 


Pro 


Ala 


He 
45 


Ala 


Glu 


Tyr 


Gin 




egt 


gea 


ttg 


cag 


gag 


aac 


gtc 


get 


atg 


gaa 


cct 


gea 


tet 


acc 


agg 


gaa 


412 


Arg 


Ala 


Leu 


Gin 


Glu 




Val 


Ala 


Met 


Glu 


Pro 


Ala 


Ser 


Thr 


Arg 


Glu 




50 










55 










60 










65 






tet 


ggg 


cat 


tet 


get 


get 


gag 


ctt 


gea 


gaa 


aaa 


gaa 








454 


Asp 


Ser 


Gly 


His 


Ser 


Ala 


Ala 


Glu 


Leu 


Ala 


Glu 




Glu 











70 75 



tgagctcaaa atttgctttg agagctacag ggaattgcta ttactcmtgt accttctgct 514 
caatttcctt tcctcatctc aaataaatg 543 



<210> 641 
<211> 336 
<212> DNA 
<213> Homo sapiens 
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<220> 
<221> CDS 
<222> 77. .247 

<220> 

<221> sig_peptide 

<222> 77. .139 

<223> Von Heijne matrix 

score 7.80000019073486 

seq SVLIFCLLNLAIS/DR 

<220> 

<221> inisc_f eature 
<222> 301 

<223> n=a, g, c or t 
<400> 641 

atctgagcta gggaagaatg tgtattctgt tgttggatga agtagtctac agatgtcagt 
tatatccagt tgagta atg gtg ttg ttg agt tea aca atg tct gta ctg att 
Met Val Leu Leu Ser Ser Thr Met Ser Val Leu He 
-20 -15 -10 

ttc tgc ctg eta aat ctg gcc att tct gat aga agg gta atg aag tct 
Phe Cys Leu Leu Asn Leu Ala He Ser Asp Arg Arg Val Met Lys Ser 
-5 15 



att tgc att gag aat ttg gtc act ctg tta tta ggc aca taagaatcct 2 57 

He Cys He Glu Asn Leu Val Thr Leu Leu Leu Gly Thr 

25 30 35 

cagctgcatt ttgcattatt tccagagttc ttggttgtaa tcantggata tacaggctgt 317 
ggaagactta ccccatttc 336 

<210> 642 
<211> 507 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 110. .496 

<220> 

<221> sig_peptide 

<222> 110. .187 

<223> Von Heijne matrix 

score 6.59999990463257 

seq LFWLASGWTPAFA/YS 

<400> 642 

aagatgcgga ggagggtgac gcactagctc tccagttcgc ccgttcctgg cctgaccccc 60 
accaaggccc ataccgcagt aggctcctcg ggctgcccct cggttgaca atg gtc tec 118 

Met Val Ser 
-25 

agg atg gtc tct acc atg eta tct ggc eta ctg ttt tgg ctg gca tet 166 
Arg Met Val Ser Thr Met Leu Ser Gly Leu Leu Phe Trp Leu Ala Ser 

-20 -15 -10 

gga tgg act cca gca ttt get tac age cce egg aee eet gac egg gtc 214 
Gly Trp Thr Pro Ala Phe Ala Tyr Ser Pro Arg Thr Pro Asp Arg Val 

-5 15 
tea gaa gca gat ate eag agg etg ctt eat ggt gtt atg gag eaa ttg 2 62 

Ser Glu Ala Asp He Gin Arg Leu Leu His Gly Val Met Glu Gin Leu 



[fije>'/^:/yv'GG208389e.s^] 



WO 02/083898 



10 










15 








ggc 


att 


gcc 


agg 


ccc 


cga 


gtg 


gaa 


tat 


Gly 


He 


Ala 


Arg 


Pro Arg 


Val 


Glu 


Tyr 










30 










ctt 


gtg 


ggc 


ccc 


cag 


age 


att 


gaa 


ggt 


Leu 


Val 


Gly 


Pro 


Gin 


Ser 


He 


Glu 


Gly 








45 










50 


cat 


ttg 


ggt 


cct 


ttt 


ggc 


aac 


ate 


ccc 


His 


Leu 


Gly 


Pro 


Phe 


Gly 


Asn 


He 


Pro 






60 










65 




gga 


gac 


aac 


att 


cct 


aag 


gac 


ttt 


agt 


Gly 


Asp 


Asn 


He 


Pro 


Lys 


Asp 


Phe 


Ser 




75 










80 






etc 


caa 


ate 


cct 


gtc 


ctg 


ttg 


gaa 


aaa 


Leu 


Gin 


He 


Pro 


Val 


Leu 


Leu 


Glu 


Lys 



90 95 
tagaaaaeae e 



PCT/lBOl/00914 

352 

20 25 
cca get eac cag gee atg aat 310 
Pro Ala His Gin Ala Met Asn 
35 40 
gga get cat gaa gga ctt cag 358 
Gly Ala His Glu Gly Leu Gin 
55 

aac ate gtg gea gag ttg act 4 06 

Asn He Val Ala Glu Leu Thr 
70 

gag gat cag ggg tac cag ace 4 54 

Glu Asp Gin Gly Tyr Gin Thr 
85 

cag atg atg gat gtc 4 96 

Gin Met Met Asp Val 
100 

507 



<210> 643 
<211> 466 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 14. .310 



<220> 

<221> sig_peptide 

<222> 14. .136 

<223> Von Heijne matrix 

score 5.40000009536743 
seq WLCPGLRLPGSLA/GR 



gtaaacagac aac atg gcg gcc gcg gtg kss gcg gca cct ggg gcc ttg 49 
Met Ala Ala Ala Val Xaa Ala Ala Pro Gly Ala Leu 













-40 










-35 










-30 




gga 
Gly 


tee 
Ser 


ctg 
Leu 


eat 
His 


get 
Ala 
-25 


ggc 

Gly 


ggc 
Gly 


gee 
Ala 


ege 
Arg 


Ctg 
Leu 
-20 


gtg 

Val 


gee 
Ala 


get 
Ala 


tge 
Cys 


agt 
Ser 
-15 


gcg 
Ala 


97 


tgg 
Trp 


etc 
Leu 


tge 
Cys 


eeg 
Pro 
-10 


ggg 

Gly 


ttg 
Leu 


agg 
Arg 


ctg 
Leu 


eec 
Pro 
-5 


ggc 
Gly 


teg 
Ser 


ttg 
Leu 


gea 
Ala 


ggc 

Gly 

1 


egg 
Arg 


ega 
Arg 


145 


gee 
Ala 


ggc 
Gly 
5 


eeg 
Pro 


gcg 

Ala 


ate 

He 


tgg 

Trp 


gcc 
Ala 
10 


cag 
Gin 


ggc 
Gly 


tgg 
Trp 


gta 
Val 


cct 
Pro 
15 


gcg 
Ala 


gcc 
Ala 


ggg 

Gly 


ggt 
Gly 


193 


eec 
Pro 
20 


Ala 


ccg 
Pro 


aaa 
Lys 


agg 
Arg 


ggc 
Gly 
25 


tac 
Tyr 


age 
Ser 


tct 
Ser 


gag 
Glu 


atg 
Met 
30 


Lys 


Thr 


gag 
Glu 


Asp 


gag 
Glu 
35 


241 


ctg 
Leu 


egg 

Arg 


gtg 

Val 


egg 

Arg 


eac 
His 
40 


ctg 

Leu 


gag 
Glu 


gag 
Glu 


gag 
Glu 


Asn 
45 


cga 
Arg 


gac 
Asp 


aga 
Arg 


gac 
Asp 


aag 
Lys 
50 


aac 
Asn 


289 


atg 
Met 


gtc 
Val 


aaa 
Lys 


ggc 
Gly 
55 


atg 
Met 


gag 
Glu 


atg 
Met 


tgaaeatgtg tggtgtattt gggaegtaae 


340 



ataaggaatt gtggtgcttg gaataaaeag agcttatgge aaaaatteae teagtaaaaa 400 
tcttataaaa atgctatcaa aagetgtgga tgetttgaaa tetgataaga aagtaeggae 4 60 



<210> 644 
<211> 502 
<212> DNA 
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<213> Homo sapiens 

<220> 
<221> CDS 
<222> 19. .342 

<220> 

<221> sig_peptide 

<222> 19. .75 

<223> Von Heijne matrix 

score 4.19999980926514 

seq AVAASAASGQAEG/KK 

<400> 644 



gcgcgctgct cttctaag 


atg 


get 


gcc 


get acc 


ggt 


gcg 


gtg 


gea 


gee 


teg 


51 












Met 


Ala 


Ala 


Ala Thr 


Gly Ala 


Val 


Ala 


Ala 


Ser 




















-15 










-10 






gcc 


gcc 


teg 


ggt 


cag 


gcg 


gaa 


ggt 


aaa aag 




acc 


gat 


ctg 


egg 


gtc 


99 


Ala 


Ala 


Ser 


Gly 
-5 


Gin 


Ala 


Glu 


Gly 


Lys Lys 
1 


He 


Thr 


Asp 

5 


Leu 


Arg 


Val 






gat 


ctg 


aag 


tec 


gag 


etg 


aag 


egg egg 


aac 


tta 


gac 


ate 


acc 


gga 


147 


He 


Asp 


Leu 


Lys 


Ser 


Glu 


Leu 


Lys 


Arg Arg 


Asn 


Leu 


Asp 


He 


Thr 


Gly 






10 










15 








20 












gtc 


aag 


acc 


gtg 


etc 


ate 


tee 


ega 


etc aag 


cag 






gaa 


gag 


gaa 


195 


Val 


Lys 


Thr 


Val 


Leu 


He 


Ser 


Arg 


Leu Lys 


Gin 


Ala 


He 


Glu 


Glu 


Glu 




25 










30 








35 










40 




gga 


ggc 






gat 


aat 


att 


gaa 


tta act 


gtt 


tea 


act 


gat 


act 


oca 


243 


Gly 


Gly 


Asp 


Pro 


Asp 


Asn 


He 


Glu 


Leu Thr 


Val 


Ser 


Thr 


Asp 


Thr 


Pro 












45 








50 










55 






aac 


aag 




cca 






ggc 




ggt aaa 










gat 


gag 


291 


Asn 


Lys 


Lys 


Pro 


Thr 


Lys 


Gly 


Lys 


Gly Lys 


Lys 


His 


Glu 


Ala 


Asp 


Glu 










60 










65 








70 








ttg 


agt 


gga 


gat 


get 


tct 


gtg 


gaa 


gat gat 


get 


ttt 


ate 


aag 


gta 


aag 


339 


Leu 


Ser 


Gly 


Asp 


Ala 


Ser 


Val 


Glu 


Asp Asp 


Ala 


Phe 


He 


Lys 


Val 


Lys 





75 80 85 

tgt taattactca ggttaagatg cttattttaa gcctagtatt ttgcatcatg 392 
Cys 

aattttgtge tttttgttga aeaatggtca etgaagtatg taggttaaag akdaagaect 4 52 
atgtaatatt gacaacgttt aatattgtca atctcattaa ctctcataat 502 



<210> 645 
<211> 493 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 264. .443 

<220> 

<221> sig_peptide 

<222> 264. .386 

<223> Von Heijne matrix 

score 4.19999980926514 

seq VFWMSFYLSRVNC/TT 

<400> 645 

ttcgtgaaat ttgttggttt tgaggactgt aaaagtgatt teatactctg aatataaaac 60 

tggataatag ggtaatgttt taaaatttat tatgetatta ttcagaatgc caaagtatta 120 

ttttttttcc caaaatcagt ctggacattt actacttttt agactttttg acgttgaact 180 

ttctgtataa aaattggctg ggttttgagc ttttggtaag aaataaaage cgattaagca 240 

ctggccgccc cgcgtggtac cca atg cec gag tea ctg tgg cag cat teg cac 293 
Met Pro Glu Ser Leu Trp Gin His Ser His 
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-40 -35 



tgg 


tgt 


ggg 


gag 












gag get ggg 


ttt 


ctg 


ggc 


acc 




Trp 


Cys 


Gly Glu 


Ser 


Phe 


Gin 


Leu 


Lys 


Glu Ala Gly 


Phe 


Leu 


Gly Thr 






-30 










-25 






-20 












ctt 


gaa 


gtt 


ttc 


tgg 


atg 


tct 


ttt 


tat 


ctt tct cgt 


gtg 


aac 


tgc 


act 


389 


Leu 


Glu 


Val 


Phe 


Trp 


Met 


Ser 


Phe 


Tyr 


Leu Ser Arg Val 


Asn 


Cys 


Thr 




-15 










-10 








-5 








1 




aca 


aaa 


gag 


acc 


age 


cca 


ctt 


ccc 


aag 


cca gcc aga 


cac 


ctg 


ggt 


ctt 


437 


Thr 


Lys 


Glu 


Thr 


Ser 


Pro 


Leu 


Pro 


Lys 


Pro Ala Arg 


His 


Leu 


Gly 


Leu 





5 10 15 



gag cca taaactggcg tagttaagct ttgcagcttc cagtgtattt tatttattct 4 93 

Glu Pro 

<21Q> 646 
<211> 540 
<212> DNA 

<213> Homo sapiens ■• ■ .- 

<220> 
<221> CDS 
<222> 175. .354 

<220> 

<221> sig_peptide 
<222> 175. .309 
<223> Von Heijne matrix 
score 4 

seq ENFLSLLSKSCSA/DP 
<400> 646 

attttaaaat ggaaattggg gcgaggggtt ggcggctggg cgaagggagc attcaaaagc 60 
ggagaatgtc actttacccg agaaattcac tacgatgggc cattggtttc tgctcggctc 120 
gcaggcgcac tgcacgagtg gaggtgtggc tagtggctgt gatgagataa atcc atg 177 

Met 
-45 



cat 


age 


ctt 


ttc 


att 


geg 


age ttg 


aaa 


gtt 


ett 


ttt 


tat 


tac 


agt ttt 


225 


His 


Ser 


Leu 


Phe 


lie 


Ala 


Ser 


Leu 


Lys 


Val 


Leu 


Phe 


Tyr 


Tyr 


Ser Phe 












-40 










-35 










-30 




age 


ttt 


agg 


ttt 


aat 


tgg 


ttc 


gac 


tgc 


ctt 


etc 


cac 


aat 


ttg 


ggc gag 


273 


Ser 


Phe 


Arg 


Phe 


Asn 


Trp 


Phe 


Asp 


Cys 


Leu 


Leu 


His 


Asn 


Leu 


Gly Glu 










-25 










-20 










-15 






aat 


ttc 


Ctt 


age 


ctt 


etc 


age 


aaa 


agt 


tgt 


tct 


geg 


gac 


ccg 


tct ggg 


321 


Asn 


Phe 


Leu 


Ser 


Leu 


Leu 


Ser 


Lys 


Ser 


Cys 


Ser 


Ala 


Asp 


Pro 


Ser Gly 








-10 










-5 










1 








tea 


act 


ttc 


atg 


agg 


gac 


att 


gag 


aca 


aac 


aaa 


tgaaatatgg gttaaagtac 


374 


Ser 


Thr 


Phe 


Met 


Arg 


Asp 


He 


Glu 


Thr 


Asn 


Lys 












5 










10 










15 













tctgagcagc tacaaaaaga agaccagtct atcctgctgg agacagtggc cacgtgaaga 434 
aagagctett geagtatgaa gaccacatgg aaagagaggc cacatggaac caacagtcag 4 94 
eatcttggtt teggacacgt gaagaattea tetcagactg tgtate 540 

<210> 647 
<211> 505 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> CDS 
<222> 100. .498 

<220> 

<221> sig_peptide 
<222> 100. .138 
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<223> Von Heijne matrix 

score 3.79999995231628 
seq MAHCVTLVQLSIS/CD 

<400> 647 

attttgtgaa gcggcgaagg aggtggtggc tgcgttgggc tccgggaagc cgttcgggct 60 

ggggctgtcg gccgcggggc ggaggcactc gcgcggggg atg gcc cac tgc gtg 114 

Met Ala His Cys Val 



























-10 






acc 


ttg 


gtt cag 


ctg 


tec 


att 


tec 


tgt 


gac cat 


etc 


att 


gac 


aag 


gac 


162 


Thr 


Leu 


Val Gin 
-5 


Leu 


Ser 


He 


Ser 


Cys 

1 


Asp His 


Leu 


He 

5 


Asp 


Lys 


Asp 




ate 


ggc 


tec aag 


tct 


gac 


eea 


etc 




gtc ctt 


tta 


cag 


gat 


gtg 


gga 


210 


lie 


Gly 


Ser Lys 


Ser 


Asp 


Pro 


Leu 


Cys 


Val Leu 


Leu 


Gin 


Asp 


Val 


Gly 






10 








15 








20 












ggg 


ggc 


age tgg 


get 


gag 


ett 


ggc 


egg 


act gaa 


egg 


gtg 


egg 


aac 


tgc 


258 


Gly 


Gly 


Ser Trp Ala 


Glu 


Leu 


Gly Arg 


Thr Glu 


Arg 


Val 


Arg 


Asn 


Cys 




25 








30 








35 










40 




tea 


age 


cct gag 


ttc 


tee 


aag 


act 


eta 


eag ett 


gag 


tac 


ege 


ttt 


gag 


306 


Ser 


Ser 


Pro Glu 


Phe 


Ser 


Lys 


Thr 


Leu 


Gin Leu 


Glu 


Tyr 


Arg 


Phe 


Glu 










45 










50 








55 






aca 


gtc 


cag aag 


eta 


ege 


ttt 


gga 


ate 


tat gac 


ata 


gac 


aac 


aag 


aeg 


354 


Thr 


Val 


Gin Lys 


Leu 


Arg 


Phe 


Gly 


He 


Tyr Asp 


He 


Asp 


Asn 


Lys 


Thr 








60 










65 








70 








cca 


gag 


Gtg agg 


gat 


gat 


gac 


ttc 


eta 


ggg ggt 


get 


gag 


tgt 




eta 


402 


Pro 


Glu 


Leu Arg 
75 


Asp 


Asp 


Asp 


Phe 
80 


Leu 


Gly Gly 


Ala 


Glu 
85 


Cys 


Ser 


Leu 




gga 


cag 


att gtg 


tec 


age 


eag 


gta 


ctg 


act etc 


ccc 


ttg 


atg 


ctg 


aag 


450 


Gly 


Gin 


He Val 


Ser 


Ser 


Gin 


Val 




Thr Leu 


Pro 




Met 


Leu 


Lys 






90 








95 








100 












cct 


gga 


aaa cct 


get 


ggg 


egg 


ggg 


aca 


tea egg 


tct 


cag 


etc 


agg 


aat 


498 


Pro 


Gly 


Lys Pro 


Ala 


Gly 


Arg 


Gly Thr 


Ser Arg 


Ser 


Gin 


Leu 


Arg 


Asn 




105 








110 








115 










120 





taaagga 505 

<210> 648 
<211> 472 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 78. .281 

<220> 

<221> sig_peptide 

<222> 78. .218 

<223> Von Heijne matrix 

score 3.70000004768372 

seq PLLSLHSRGGSSS/ES 

<400> 648 

attttgcgaa eggcgagcag cggcgcggcg cggagagacs agcggaggtt ttcctggttt 60 
eggaeeceag eggecgg atg gtg aaa tec tec ctg cag egg ate etc aat 110 
Met Val Lys Ser Ser Leu Gin Arg He Leu Asn 
-45 -40 
age cac tgc ttc gee aga gag aag gaa ggg gat aaa ccc age gee ace 158 
Ser His Cys Phe Ala Arg Glu Lys Glu Gly Asp Lys Pro Ser Ala Thr 

-35 -30 -25 

ate cac gcc age cgc acc atg ccg etc eta age ctg cac age ege ggc 206 
He His Ala Ser Arg Thr Met Pro Leu Leu Ser Leu His Ser Arg Gly 
-20 -15 -10 -5 

ggc age age agt gag agt tec agg gtc tec etc cac tgc tgt agt aac 254 
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Gly Ser Ser Ser Glu Ser Ser Arg Val Ser Leu His Cys Cys Ser Asn 

15 10 
ccg ggt ccg ggg cct egg tgg tgc tec tgatgcccct cacccacccc 301 
Pro Gly Pro Gly Pro Arg Trp Cys Ser 

15 20 
tgaagatccc aggtgggcga gggaatagtc agagggatca caatctttca gctttttggg 361 
ctttgagatt gtgagaccgg ggcatcccct tatccccaag agacccgacg cttgcttcat 421 
ggcctacacg ttcgagagag agtcttcggg agaggaggag gagtagggcc g 472 

<210> 649 
<211> 458 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 173. .373 

<220> 

<221> sig_peptide 

<222> 173. .265 

<223> Von Heijne matrix 

score 3.70000004768372 

seq ASWALGRWGSCRA/LD 

<400> 649 

agaatctgac attccaggac tccactaggc ctgttttcct ctgtgggaac tagagccaag 60 
gcgagagacc cgtgccagcc ccgaggctcc cggggcccat gggcaggcgg ggctggtaca 120 
gggccccggg gctctctcag cccatctgtc acccctcccc cccaacaccc ag atg tec 178 

Met Ser 
-30 

cgt ctt ggc agt get gtc ccg gac tgc cac att aat get cca gaa ccc 226 
Arg Leu Gly Ser Ala Val Pro Asp Cys His lie Asn Ala Pro Glu Pro 
-25 -20 -15 

gcc age tgg gee ctg ggg egg tgg gge tee tge aga gee etg gae aaa 274 
Ala Ser Trp Ala Leu Gly Arg Trp Gly Ser Cys Arg Ala Leu Asp Lys 

-10 -5 1 

gge gag gga ggg agg gaa ggg gat cct aag cac ccc tec cte cct ggc 322 
Gly Glu Gly Gly Arg Glu Gly Asp Pro Lys His Pro Ser Leu Pro Gly 

5 10 15 

eet agg agg cag aca tgc ccg atg aga tea aca ttg atg aat tgt tgg 370 
Pro Arg Arg Gin Thr Cys Pro Met Arg Ser Thr Leu Met Asn Cys Trp 



20 25 30 35 

agt tagagagtga agaggagaga agecggaaaa tccagggact ectgaagtea 423 

Ser 

tgtgggaaae ctgtcgagga ettcatecag ggage 458 



<210> 650 
<211> 482 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 8. .250 

<220> 

<221> sig_peptide 
<222> 8. . 64 

<223> Von Heijne matrix 
score 3.5 

seq RLLRASVARHVSA/IP 
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<400> 650 



actgaag 


atg 


gcg 


get 


get 


gta 


gga 


egg 


ttg 


etc 


cga 


gcg 


teg 


gtt 


gee 


49 






Met 


Ala 


Ala 


Ala 


Val 


Gly 


Arg 


Leu 


Leu 


Arg Ala 


Ser 


Val 


Ala 
















-15 










-10 












cga 


cat 


gtg 


agt 


gcc 


att 


cct 


tgg 


ggc 


att 


tct 


gcc 


act 


gca 


gcc 


etc 


97 


Arg 


His 


Val 


Ser 


Ala 


He 


Pro 


Trp 


Gly 


He 


Ser 


Ala 


Thr 


Ala 


Ala 


Leu 




-5 










1 








5 










10 






agg 


cct 


get 


gca 


tgt 


gga 


aga 


acg 


ago 


ttg 


aca 


aat 


tta 


ttg 


tgt 


tct 


145 


Arg 


Pro 


Ala 


Ala 
15 


Cys 


Gly 


Arg 


Thr 


Ser 

20 


Leu 


Thr 


Asn 


Leu 


Leu 

25 


Cys 


Ser 




ggt 


tec 










att 


att 


tct 


tct 


ate 


ett 


tgg 




tea 


cct 


193 


Gly 


Ser 


Ser 
30 


Gin 


Ala 


Lys 


He 


He 

35 


Ser 


Ser 


He 


Leu 


Trp 

40 


He 


Ser 


Pro 




ttg 


tgt 


gtc 


eta 






ttg 


ttg 


ctt 


tta 


gtg 


aca 


aag 


eta 


acg 


aat 


241 


Leu 


Cys 
45 


Val 




Gin 


Lys 


Leu 
50 


Leu 


Leu 




Val 


Thr 
55 


Lys 


Leu 


Thr 


Asn 





ttc aeg atg tgaactgtga agttgtcgca gtctcagtgg attcceaett 290 

Phe Thr Met 

60 

tagecatctt gcetggataa atacaccaag aaagaatggt ggtttgggcc acatgaacat 350 

cgeactcttg teagaettaa etaageagat tteecgagae taeggtgtgc tgttagaagg 410 

ttctggtctt gcactaagag gtctcttcat aattgaeect aatggagtea teaageattt 470 

gagegtcaac ga 482 



<210> 651 
<211> 291 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 130. .288 



<220> 

<221> sig_peptide 
<222> 130. .234 
<223> Von Heijne matrix 
score 3.5 

seq GLVEVVAVGVVRS/DQ 



<400> 651 

agtetctgtc ggagtctgtt teecagtgca cccaacttgt gatagaaggt atatgttctg 60 
ctgaaaaaga gaaaaaggaa aaaaaagaat aaaageagag ageagagaag gacagggtet 120 
ggctgccgc atg gag ggg agg age agg age agg aaa gee tgt ctg gag gtt 171 

Met Glu Gly Arg Ser Arg Ser Arg Lys Ala Cys Leu Glu Val 

-35 -30 -25 

get aca gta aac cag gtc ttc gtg ggg ttg gtc gag gtg gtg gca gtg 219 
Ala Thr Val Asn Gin Val Phe Val Gly Leu Val Glu Val Val Ala Val 

-20 -15 -10 

ggg gtg gtg aga agt gat cag att aag aat atg tat tta aaa tac age 267 
Gly Val Val Arg Ser Asp Gin He Lys Asn Met Tyr Leu Lys Tyr Ser 
-5 15 10 

cag eat gat ctg ctg atg gae tag 291 
Gin His Asp Leu Leu Met Asp 



<210> 652 

<211> 766 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
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<222> 166. .333 
<400> 652 

aaaaggagta gctattagcc aattcggcag ggcccgcttt ttagaagctt gatttcyttt 60 
gaagatgaaa gactagcgga agctctgcct ctttccccag tggcgaggga actcggggcg 120 
attggctggg aactgtatcc acccaaatgt caccgatttc ttcct atg cag gaa atg 177 

Met Gin Glu Met 

1 

age aga ccc ate aat aag aaa ttt cte age etg gee gaa aat ggt tgg 225 

Ser Arg Pro lie Asn Lys Lys Phe Leu Ser Leu Ala Glu Asn Gly Trp 

5 10 15 20 

cee cae gaa gee aeg aea aet gga gge aaa gag ggt tge tea acg cee 273 

Pro His Glu Ala Thr Thr Thr Gly Gly Lys Glu Gly Cys Ser Thr Pro 

25 30 35 

cge cte att gga aaa cea aat cag ate tgg gae eta tat age gtg geg 321 
Arg Leu lie Gly Lys Pro Asn Gin lie Trp Asp Leu Tyr Ser Val Ala 

40 45 50 

gag gcg ggg cga tgattgtcgc gctcgcaccc actgcagctg cgcacagtcg 373 
Glu Ala Gly Arg 
55 

catttctttc cccgcccctg agaccctgca gcaccatctg tcatggcggc tgggctgttt 433 
ggtttgagcg ctcgccgtct tttggcggca gcggcgacgc gagggctccc ggccgcccgc 493 
gtccgctggg aatctagctt ctccaggact gtggtcgccc cgtccgctgt ggcggaaagc 553 
ggcccccaga accgaccaca ccgtgcaaga ggacccagaa cccgaggacg aaaacttgta 613 
tgagaagaac ecagacteec atggttatga eaaggaecee gttttggaeg tetggaaeat 673 
gcgacttgtc ttcttctttg gegtetceat eatcetggte cttggeagea eetttgtggc 733 
ctatctgeet gaetaeagga tgaaagagtg ggt 7 66 

<210> 653 
<211> 488 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 199.. 357 

<400> 653 

cagtaggata eatagctttt ettceagtga aaeaaagtte atatcateea ttgtttttca 60 
ageaegtgac accagcetea aagtaaatga catgaeeagt ggttgaaeag tetaattttc 120 
aaatttaata tagageatat aaettetgat ttgatagtat ttattttaaa aaattatgtt 180 
ttcatcattc atttgaaa atg aaa aag ccc caa agt gag aac ttt ggg gga 231 
Met Lys Lys Pro Gin Ser Glu Asn Phe Gly Gly 
15 10 
ggg cct aga aca tgg ata gat etc tta gtg gtc ttt cca aaa gta cat 279 
Gly Pro Arg Thr Trp He Asp Leu Leu Val Val Phe Pro Lys Val His 

15 20 25 

gta ctt gaa ata ttt tea tta tea tae tat tet ttg aaa aaa aag atg 327 
Val Leu Glu He Phe Ser Leu Ser Tyr Tyr Ser Leu Lys Lys Lys Met 

30 35 40 

ctt act gta tae ttg ttt tea age ate cte taaaatcaaa ggttttgatc 377 
Leu Thr Val Tyr Leu Phe Ser Ser He Leu 

45 50 
acaatatgca gatttctctt gatagatact taaataggct atttctctcc tcttcttggg 437 
caatgccttg ttttctcctc tgaatatttg catttgaaag gattgcttcc t 488 

<210> 654 

<211> 489 

<212> DNA 

<213> Homo sapiens 



<220> 
<221> CDS 
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<222> 253. .426 
<400> 654 

aacctcacga ggccttacgt gcctagagcg gttcctctcc gtggcgccct accgccttcg 60 
gcgcgaaggt ggctggtgcg taccgtcccg aaccctagga gccgagatcc cacccgcagg 120 
accccaaaac ccaatgatcc tgcagcagcc cttgcagcga ggcccccagg gaggggccag 180 
cgcctcccgc gggccgcctt gggggtgact tggggcctgg acgccagctc csctctscga 240 
ggagctgtgc cc atg age acc aag egg cgc ctg gag gag gag cag gag cct 2 91 

Met Ser Thr Lys Arg Arg Leu Glu Glu Glu Gin Glu Pro 

15 10 
ctg cgc aag cag ttt ctg tct gag gag aac atg gcc acc cac ttc tct 339 
Leu Arg Lys Gin Phe Leu Ser Glu Glu Asn Met Ala Thr His Phe Ser 

15 20 25 

caa etc age ctg cac aat gac caa ccc eta etg eag eec eac eat gac 387 
Gin Leu Ser Leu His Asn Asp Gin Pro Leu Leu Gin Pro His His Asp 



etctggegct atcctggcag cctcatcctg aggcctccgt ctgctgaggc tgg 489 

<210> 655 
<211> 616 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 205. .387 

<400> 655 

aetteegggg gagcggcgcg gcggcgggga ggatctctca ccccatcact cagggtggcg 60 

eaateacgac tcattggctc actgcagcct agacctccca gctggagcaa ttctcctgcc 120 

tcagccttct gagtagetgg gaetacagtt ggttetaaag agtggtgagt cagaagagae 180 

gteaggcagc aagcgacttg ggcc atg gcc tct gac eta gac ttc tea cct 231 

Met Ala Ser Asp Leu Asp Phe Ser Pro 



ccg 


gag 


tgc 


ccg 


age 


cca ctt tec tgg aga 


ace tgc tac 


ggt 


aeg 


gac 


279 


Pro 


Glu 


Cys 


Pro 


Ser 


Pro Leu Ser Trp Arg 


Thr Cys Tyr 


Gly 


Thr Asp 




10 










15 


20 






25 




tct 


tec 


tgg 


gag 


eea 


tet tee age tea tet 


gtg tge tgg 


eea 


tea 


teg 


327 


Ser 


Ser 


Trp 


Glu 


Pro 


Ser Ser Ser Ser Ser 


Val Cys Trp 


Pro 


Ser 


Ser 












30 


35 






40 






tac 


cca 






agt 


ccc acg agg egg agg 


ctg aac cgt 


ctg 


age 


cca 


375 


Tyr 


Pro 


Phe 


Pro 


Ser 


Pro Thr Arg Arg Arg 


Leu Asn Arg 


Leu 


Ser 


Pro 










45 




50 




55 










gtg 


ctg 


agg 


tgacgaggaa gcccaaggct gctgttcctt ctgtgaacaa 




427 


Glu 


Val 




Arg 

















60 

gaggcccaag aaagagacta agaagaagcg gtagaagagg aggcctgagg actgggcgkg 4 87 

cagggagagg gtettkgggg acagecteet gggaatctae attgtgtttc ccecgcatty 547 

ecaggctcag gttetgarga rgctgtgaeg eectatgaee geagagatet agaeagtegt 607 

aacagtccc 616 

<210> 656 
<211> 508 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 37. .219 
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<400> 656 



ctttcccctg ccggtgtcct gctcgccgtc cccgcc 


atg 


etg 


tet 


eta 


gae 


ttt 


54 
















Met 
1 


Leu 


Ser 


Leu 


Asp 
5 


Phe 




ttg 


gac 


gat gtg 


egg 


egg 


atg 


aac aag egg 


eag 


ete 


tat 


tat 


eaa 


gte 


102 


Leu 


Asp 


Asp Val 


Arg 


Arg 


Met 


Asn Lys Arg 


Gin 


Leu 


Tyr 


Tyr 


Gin 


Val 








10 








15 








20 








eta 


aat 


ttt gga 


atg 


att 


gtc 


tea teg gea 


eta 


atg 


ate 


tgg 


aag 


ggg 


150 


Leu 


Asn 


Phe Gly 


Met 


He 


Val 


Ser Ser Ala 


Leu 


Met 


He 


Trp 


Lys 


Gly 








25 








30 






35 










tta 


atg 


gta ata 


act 


gga 


agt 


gaa agt ecg 


att 


gta 


gtg 


gtc 


tea 


ggc 


198 


Leu 


Met 


Val He 


Thr 


Gly 


Ser 


Glu Ser Pro 


He 


Val 


Val 


Val 


Ser 


Gly 






40 








45 






50 












aaa 


atg 


ggc ata 


tea 


agt 


ttt 


tgaeeaaagg agataataat gcggttgatg 


249 


Lys 


Met 


Gly He 


Ser 


Ser 


Phe 


















55 








60 





















accgaggcct ctataaacaa ggaeaacatt ggctagagaa aaaagatgtt gtggggagag 309 

ccaggggatt tgttccttat attggaattg tgaegateet eatgaatgae tateetaaat 369 

ttaagtatgc agttctcttt ttgetgggtt tattegtget ggtteategt gagtaagaag 429 

cctgccttgc tgttcctggg aagatgeeat agttttegtt aetggatgtt tggagtagat 489 

actggtctgt gattggtgg 508 



<210> 657 
<211> 500 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 114.. 452 

<400> 657 

tttteeeetg eeggtgteet getegeegte eeegeeatge tgtetetaga etttttggae 60 
gatgtgegge ggatgaaeaa geggeastet attateaagt eetaaatttt gga atg 116 

Met 



att 


gtc 


tea 


teg 


gea 


eta 


atg 


ate 


tgg 


aag 


ggg 


tta 


atg 


gta 


ata 


act 


164 


He 


Val 


Ser 


Ser 


Ala 


Leu 


Met 


He 


Trp 


Lys 


Gly 


Leu 


Met 


Val 


He 


Thr 










5 










10 










15 








gga 


agt 


gaa 


agt 


eeg 


att 


gta 


gtg 


gtg 


ete 


agt 


ggc 


age 


atg 


gaa 


eet 


212 


Gly 


Ser 


Glu 


Ser 


Pro 


He 


Val 


Val 


Val 


Leu 


Ser 


Gly 


Ser 


Met 


Glu 


Pro 








20 










25 










30 










gea 


ttt 


eat 


aga 


gga 


gat 


ett 


ete 


ttt 


eta 


aea 


aat 


ega 


gtt 


gaa 


gat 


260 


Ala 


Phe 


His 


Arg 


Gly 


Asp 


Leu 


Leu 


Phe 


Leu 


Thr 


Asn 


Arg 


Val 


Glu 


Asp 






35 










40 










45 












eee 


ata 


ega 


gtg 


gga 


gaa 


att 


gtt 


gtt 


ttt 


agg 


ata 


gaa 


gga 


aga 


gag 


308 


Pro 


He 


Arg 


Val 


Gly 


Glu 


He 


Val 


Val 


Phe 


Arg 


He 


Glu 


Gly Arg Glu 




50 










55 










60 










65 




att 


cct 


ata 


gtt 


eac 


ega 


gtc 


ttg 


aag 


att 


eat 


gaa 


aaa 


ttt 


gtt 


eet 


356 


He 


Pro 


He 


Val 


His 


Arg 


Val 


Leu 


Lys 


He 


His 


Glu 


Lys 


Phe 


Val 


Pro 












70 










75 










80 






tat 


att 


gga 


att 


gtg 


acg 


ate 


etc 


atg 


aat 


gac 


tat 


cct 


aaa 


ttt 


aag 


404 


Tyr 


He 


Gly 


He 


Val 


Thr 


He 


Leu 


Met 




Asp 


Tyr 


Pro 


Lys 


Phe 


Lys 










85 










90 










95 








tat 


gea 


gtt 


etc 


ttt 


ttg 


etg 


ggt 


tta 


tte 


gtg 


etg 


gtt 


cat 


cgt 


gag 


452 


Tyr 


Ala 


Val 


Leu 


Phe 


Leu 


Leu 


Gly 


Leu 


Phe 


Val 


Leu 


Val 


His 


Arg 


Glu 





100 105 110 



taagaagect geettgetgt teetggggaa gatgeeatag ttttegtt 500 

<210> 658 
<211> 411 
<212> DNA 
<213> Homo sapiens 
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<220> 
<221> CDS 
<222> 90. .305 

<220> 

<221> inisc_f eature 

<222> 138,315,326,343,360,377 

<223> n=a, g, c or t 

<400> 658 

ttgggcagaa aaattcaagc aggagattgt atttcttttg gagttgtacg atttccttta 60 



ttatttgaac tgcagtaaag aaagctggg 


atg 


ggc 


tec 


tct 


agg 


gat 


act 




113 












Met 
1 


Gly 


Ser 


Ser 


Arg 
5 


Asp 


Thr 


Ser 




aga 


tec 


ctg ggc 


ggt tgt age 


cet 


ngc 


tec 


tet 


tta 


aat 


gga 


ttt 


ggt 


161 


Arg 


Ser 


Leu Gly 


Gly Cys Ser 


Pro 


Xaa 


Ser 


Ser 


Leu 


Asn 


Gly 


Phe 


Gly 






10 




15 










20 












ttc 


aaa 


gac gat 


cat etc cgt 


ett 


ete 


gga 


tgt 


cat 


agt 


gee 




gat 


209 


Phe 


Lys 


Asp Asp 


His Leu Arg 


Leu 


Leu 


Gly 


Cys 


His 


Ser 


Ala 


Thr 


Asp 




25 






30 








35 










40 




cat 


etc 


cag etc 


etk gee aee 


etg 


ggc 


ttt 


etc 


eae 


ttt 


tgc 


etc 


tat 


257 


His 


Leu 


Gin Leu 


Leu Ala Thr 


Leu 


Gly 


Phe 


Leu 


His 


Phe 


Cys 


Leu 


Tyr 










45 






50 










55 






gtt 


ttg 


ett etc 


earn egt ett 


age 


eae 


gat 


ate 


tac 


etc 


tet 


gtc 


atg 


305 


Val 


Leu 


Leu Leu 


Xaa Arg Leu 


Ser 


His 


Asp 


He 


Tyr 


Leu 


Ser 


Val 


Met 








60 






65 










70 









tgatgtgacn cttgtttttt naaecaagag ttgeectnaa gstceettaa aaaanagctg 365 
atcttactgg snttcttttg taaarcteet gtgatagatg cagatt 411 



<210> 659 
<211> 529 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 235. .444 

<400> 659 

atttccagcg gttgctggtt ctgacgggtt gtagtctgee aggacaatga gttatgacta 60 

ceatcagaac tggggeegtg atgggggtec eegeagctee ggtggggget atggaggggg 120 

geeageaggg ggteatggag gtaaecgagg etceggagga greggeggeg gcggaggggg 180 

tggtcgagge ggcaggggcc ggcatccegg geacctgaaa gecgegaaat cggc atg 237 

Met 



tgg 


tac 


gcg 


aaa 


aaa 


cag 


ggg cag aag 


aac 


aag 


gaa 


gcg 


gag 


agg 


caa 


285 


Trp 


Tyr 


Ala 


Lys 
5 


Lys 


Gin 


Gly Gin Lys 
10 


Asn 


Lys 


Glu 


Ala 


Glu 
15 


Arg 


Gin 




gag 


aga 


get 


gta 


gta 


eae 


atg gat gaa 


ega 


ega 


gaa 


gaa 


caa 


att 


gta 


333 


Glu 


Arg 


Ala 
20 


Val 


Val 


His 


Met Asp Glu 
25 


Arg 


Arg 


Glu 


Glu 
30 


Gin 


He 


Val 




cag 


tta 


etg 


aat 


tet 


gtt 


caa gcg aag 


aat 


gat 


aaa 


gag 


tea 


gaa 


gea 


381 


Gin 


Leu 
35 


Leu 


Asn 


Ser 


Val 


Gin Ala Lys 
40 


Asn 


Asp 


Lys 
45 


Glu 


Ser 


Glu 


Ala 




cag 


ata 


tee 


aee 


cea 


gac 


ttg get gtc 


ege 


tet 


gcg 


cet 


teg 


ggc 


cag 


429 


Gin 


He 


Ser 


Thr 


Pro 


Asp 


Leu Ala Val 


Arg 


Ser 


Ala 


Pro 


Ser 


Gly 


Gin 




50 










55 






60 










65 




ccc 


ttt 


egg 


get 


tgc 


tgagcctgga ggccgakgag aatgcacttc cgggttttgc 


484 


Pro 


Phe 


Arg 


Ala 


Cys 
70 























tgaggctctg aggagctacc aggaggctgc ggctgcaggc acctt 52 9 
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<210> 660 
<211> 478 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 128. .286 



<400> 660 

gttctttggc cctgtgacac gtagcaacgg ggctggttca gggtctgaaa cagagtttgg 60 
gggttgtttg ggattagtga agctactgcc tttgccgcca gcgcagctca gagtttgatt 120 
atttgca atg tea ggc ttt gaa aac tta aac acg gat ttc tac cag aca 169 

Met Ser Gly Phe Glu Asn Leu Asn Thr Asp Phe Tyr Gin Thr 

15 10 
agt tac age ate gat gat cag tea cag cag tec tat gat tat gga gga 217 
Ser Tyr Ser lie Asp Asp Gin Ser Gin Gin Ser Tyr Asp Tyr Gly Gly 
15 20 25 30 

agt gga gga ccc tat aac agt atg ctg get atg act att ege age aag 265 
Ser Gly Gly Pro Tyr Asn Ser Met Leu Ala Met Thr lie Arg Ser Lys 

35 40 45 

gea gat ttg tee etc cag aca tgatgcagcc acaacagcca tacaccgggc 316 
Ala Asp Leu Ser Leu Gin Thr 
50 

agatttacca gecaactcag gcatataetc cagcttcace tcagcettte tatggaaaca 37 6 
actttgagga tgagceaect ttattagaag agttaggtat caattttgae cacatctgge 436 
aaaaaacact aacagtatta catccgttaa aagtagcaga tg 478 



<210> 661 
<211> 456 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 110. .277 



<400> 661 

tacggtttta ttattatate gatttatttg tgaggaggga atctgttcat gtagatttta 60 
aaagtgcaag agtgtccagc gtcatctgee tetcctgtec cctgaccac atg att ttc 118 

Met lie Phe 
1 



tgt 


tec 


cat 


aaa 


eta 


gea 


ctg ttg agt 


ata 


ttt 


cea 


gag 


cat act 


cea 


166 


Cys 


Ser 


His 




Leu 


Ala 




Leu Ser 


He 


Phe 


Pro 


Glu 


His Thr 


Pro 




5 










10 








15 










ggc 


gtg 


tgt 


aaa 


tac 


age 


ett 


cea etc 


ctg 


cac 


aca 


cac 


cgt gtt 


ttt 


214 


Gly Val 


Cys 


Lys 


Tyr 


Ser 


Leu 


Pro Leu 


Leu 


His 


Thr 


His 


Arg Val 


Phe 




20 










25 








30 








35 




aca 


tea 


gta 


aca 


ttc 


tgc 


gea 


tat tgt 


tea 


gea 


cea 


tgc 


ett ttc 


cac 


262 


Thr 


Ser 


Val 


Thr 


Phe 
40 


Cys 


Ala 


Tyr Cys 


Ser 
45 


Ala 


Pro 


Cys 


Leu Phe 
50 


His 




tta 


aca 


gta 


ata 


att 


taaaggtc 


;at tccatttcag tacataaagj 


a gcgtcctcag 


317 


Leu 


Thr 


Val 


He 
55 


He 























gccgggcgcg gtggctcgca cctgtaatcc tagcactttg ggaggctgag gcaggtggat 377 
caectgaggt tggaagttga agaccagcet ggccaaaagg gcgaaaccct gtctctactg 437 
aaaatagaaa aattagetc 456 



<210> 662 
<211> 506 
<212> DNA 
<213> Homo sapiens 
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<220> 
<221> CDS 
<222> 71. .379 



<400> 662 

acagctgccc gggactccag tgatcgccgc ggctcgctcg cgccccggaa actgcccctt 60 
ctcgggggtc atg atg ggc age aag atg gcg tct get agt agg gtc gtt 10 9 

Met Met Gly Ser Lys Met Ala Ser Ala Ser Arg Val Val 

15 10 



cag 


gta 


gtc 


aaa 


cca 


cac 


act 


cca 


tta 


ata 


agg 


ttt 


cct 


gac 


aga 


aga 


157 




15 










20 








Arg 


Phe 

25 


Pro 


Asp 


Arg 






gac 


aat 


cot 


aaa 


ccc 




gta 




gaa 


get 


ttg 


aga 


tea 


gca 


ggg 


eta 


205 


Asp 


Asn 


Pro 


Lys 


Pro 


Asn 


Val 


Ser 


Glu 


Ala 


Leu 


Arg 


Ser 


Ala 


Gly 


Leu 




30 










35 










40 










45 




cca 


tct 


cac 


tct 


tct 


gta 


att 


tea 


eaa 


eat 


tet 


aaa 


gga 


agt 


aaa 


tea 


253 


Pro 


Ser 


His 


Ser 


Ser 
50 


Val 


lie 


Ser 


Gin 


His 
55 


Ser 


Lys 


Gly 


Ser 


Lys 
60 


Ser 




cca 


gat 


ttg 


ctg 


atg 


tat 


cag 


ggt 


cca 


cca 


gac 


act 


gca 


gaa 


ata 


ata 


301 


Pro 


Asp 


Leu 


Leu 
65 


Met 


Tyr 


Gin 


Gly 


Pro 
70 


Pro 


Asp 


Thr 


Ala 


Glu 
75 


lie 


lie 




aaa 


aca 


tta 


cct 


cag 


aaa 


tac 


aga 


agg 


aaa 


ctt 


gtg 


tct 


caa 


gaa 


gaa 


34 9 


Lys 


Thr 


Leu 
80 


Pro 


Gin 


Lys 


Tyr 


85^ 




Lys 


Leu 


Val 


Ser 
90 


Gin 


Glu 


Glu 




atg 


gaa 


ttt 


ate 


caa 


cgt 


gga 


ggt 


cct 


gaa 


taaccatggt ggctgctgtt 


399 


Met 


Glu 


Phe 


lie 


Gin 


Arg 


Gly 


Gly 


Pro 


Glu 

















95 100 



tgtcatcaga caatagaatt gtctttacaa taaaggactt ceaaaatgae agatgagaaa 459 
ctgtatatta aacaccttta ataaatatta tgaaaaaaam haawaaa 506 



<210> 663 
<211> 595 
<212> DNA 
<213> Homo sapie: 



<220> 
<221> CDS 
<222> 207. .578 



<220> 

<221> inisc_f eature 
<222> 19 

<223> n=a, g, c or t 



<400> 663 

atctcgcgtc cccaacggnc cecegggteg gtttcegcgg tggecatgae tgeggccgtg 60 

ttcttcggct gcgcttcatt gectteggge etgcgetcge cetttatgte tteaccatcg 120 

ccaycgagcc gttgcgtatc atcttcctea tcgccggagc tttcttctgg ttggtgtctc 180 

tactgatttc gtcccttgtt tggttc atg gca aga gtc att att gac aac aaa 233 

Met Ala Arg Val lie lie Asp Asn Lys 



gat 


gga 




aca 


cag 


aaa 


tat 


1 

ctg 


ctg 


ate 


ttt 


5 

gga 


gcg 


ttt 


gtc 


tct 


281 


Asp 


Gly 


Pro 


Thr 


Gin 


Lys 


Tyr 


Leu 




He 


Phe 


Gly 


Ala 


Phe 


Val 


Ser 




10 










15 










20 










25 




gtc 


tat 


ate 




gaa 


atg 


ttc 


cga 


ttt 


gca 


tat 


tat 


aaa 




tta 




329 


Val 


Tyr 


He 


Gin 


Glu 
30 


Met 


Phe 


Arg 


Phe 


Ala 
35 


Tyr 


Tyr 


Lys 


Leu 


Leu 
40 


Lys 








agt 


gaa 


ggt 


ttg 


aag 


agt 


ata 


aac 


cca 


ggt 


gag 




gca 


ccc 


377 


Lys 


Ala 


Ser 


Glu 


Gly 


Leu 


Lys 


Ser 


He 




Pro 


Gly 


Glu 


Thr 


Ala 


Pro 








45 










50 










55 








tct 


atg 


cga 


ctg 


ctg 


gcc 


tat 


gtt 


tct 


ggc 


ttg 


ggc 


ttt 


gga 


ate 


atg 


425 


Ser 


Met 


Arg 
60 


Leu 




Ala 


Tyr 


Val 
65 


Ser 


Gly 




Gly 


Phe 
70 


Gly 


He 


Met 
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agt gga gta ttt tec ttt 
Ser Gly Val Phe Ser Phe 
75 

ggc aca gtg ggc att cat 
Gly Thr Val Gly He His 
90 95 
age ttt cat gac get ggt 
Ser Phe His Asp Ala Gly 
110 

tgt att ttt tgatggctgt c 

Cys He Phe 



364 

gtg aat ace eta tet gae 
Val Asn Thr Leu Ser Asp 
80 85 
gga gat tet eet eaa tte 
Gly Asp Ser Pro Gin Phe 
100 

eat tat ett get gea tgt 
His Tyr Leu Ala Ala Cys 
115 



PCT/lBOl/00914 



tec ttg ggg cea 473 

Ser Leu Gly Pro 

ttc ett tta tte 521 
Phe Leu Leu Phe 
105 

att etg ggg cat 5 69 

He Leu Gly His 
120 

595 



<210> 664 
<211> 487 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 41. .226 



<400> 664 

gatacggtte eteeaccgag gcccatgega agtttccact atg get tec age act 55 





















Met 
1 


Ala 


Ser 


Ser 


Thr 
5 




gtc ccg 


gtg 


age 


get 


get 


ggc 


teg 


get 


aat 


gaa act 


cec 


gaa 


ata 


ccg 


103 


Val Pro 


Val 


Ser 


Ala 


Ala 


Gly 


Ser 


Ala 




Glu Thr 


Pro 


Glu 


He 


Pro 










10 










15 








20 






gac aac 


gtg 


gga 


gat 


tgg 


ett 


egg 


ggc 


gtc 


tac eet 


ttg 


cea 


etg 


ata 


151 


Asp Asn 


Val 


Gly Asp 


Trp 


Leu 


Arg Gly Val 


Tyr Pro 


Leu 


Pro 


Leu 


He 








25 










30 








35 








gga atg 


act 


tec 


gga 


gaa 


ett 


gat 


act 


aaa 


ttt ggg 


act 


ett 


tgc 


tgc 


199 


Gly Met 


Thr 


Ser 


Gly 


Glu 


Leu 


Asp 


Thr 


Lys 


Phe Gly 


Thr 


Leu 


Cys 


Cys 






40 










45 








50 










ggg agt 


ttg 


get 


ggc 


cag 


gaa 


ett 


gag 


tgacattgae ctcatggcac 




246 


Gly Ser 


Leu 


Ala 


Gly 


Gin 


Glu 


Leu 


Glu 
















55 










60 





















ctcagccagg ggtgtagcca agtagacaaa tggaatcctg tgctgaaccc gaatctteca 306 

aaaaaeagcc tacaatctgt gaccaccaea agatgtgeee tgatggcagc tgaagtttga 366 

ttcagatggg eacttttctt ccccttccet gectagttte cttttgttcc ttgagtccac 426 

gcagaattcc attctctggt eageagaeag gettaageta aagtattgea tetattctgt 486 

a 487 



<210> 665 
<211> 415 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 54. .248 



<400> 665 

acttctgace tgcggcccgt agtccgagee ggggacggcg gcgtcggtgg gtc atg 56 

Met 
1 

etc egg cac tec cec teg etg tgg gag etg gtg gag gag cac gtt ccg 104 
Leu Arg His Ser Pro Ser Leu Trp Glu Leu Val Glu Glu His Val Pro 

5 10 15 

etc egg gag cga cge gaa gtg aag agg att etg ggg gga ggc ggc ggt 152 
Leu Arg Glu Arg Arg Glu Val Lys Arg He Leu Gly Gly Gly Gly Gly 

20 25 30 

gga eet gag cet gga get gcg ggc gga ggt gge gat gtt aeg gge act 200 
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Gly Pro Glu Pro Gly Ala Ala Gly Gly Gly Gly Asp Val Thr Gly Thr 

35 40 45 

get cca aga ggc teg ate etc tea age ecc cag etc eeg eec eat etc 248 

Ala Pro Arg Gly Ser lie Leu Ser Ser Pro Gin Leu Pro Pro His Leu 
50 55 60 65 

tgaceectet tetcttetgg caeeacegcc tctcctaaag gacctcttge gccaggagct 308 

eeggeagttg etecagggte tccgccacaa agccatctgt gagggcaggg aeeaggscea 3 68 

agcttgggtc cagtatagcc cccarggtcc tgcactttgc cttggag 415 

<210> 666 
<211> 487 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 13. .432 

<400> 666 

agtgaggaaa tc atg ggt cag aat cga gag atg ctt ccg ttt tgg atg aac 51 

Met Gly Gin Asn Arg Glu Met Leu Pro Phe Trp Met Asn 
1 5 10 



age 


acc 


ggc 


agg 


egg 


gaa 


ggt 


tgg 


cag 


cgt 


gga 


tgg 


eac 


ggc 


tac 


gac 


99 


Ser 


Thr 


Gly 


Arg 


Arg 


Glu 


Gly 


Trp 


Gin 




Gly 


Trp 


His 


Gly 


Tyr 


Asp 






15 










20 










25 












aac 


gag 


etc 


atg 


gac 


atg 


egg 


ggc 


ate 


ttc 


ctg 


gee 


ttc 


gga 


ect 


gat 


147 


Asn 


Glu 


Leu 


Met 


Asp 


Met 


Arg 


Gly 


He 


Phe 


Leu 


Ala 


Phe 


Gly 


Pro 


Asp 




30 










35 










40 










45 




ttc 


aaa 


tec 


aac 


ttc 


aga 


get 


get 


ect 


ate 


agg 


teg 


gtg 


gac 


gtc 


tac 


195 


Phe 


Lys 


Ser 


Asn 


Phe 


Arg 


Ala 


Ala 


Pro 


He 


Arg 


Ser 


Val 


Asp 


Val 


Tyr 












50 










55 










60 






aat 


gtc 


atg 


tgc 


aat 


gtg 


gtg 


ggc 


ate 


ace 


eeg 


ctg 


eec 


aac 


aac 


gga 


243 


Asn 


Val 


Met 


Cys 


Asn 


Val 


Val 


Gly 


He 


Thr 


Pro 


Leu 


Pro 


Asn 


Asn 


Gly 










65 










70 










75 








tec 


tgg 


tec 


agg 


gtg 


atg 


tgc 


atg 


ctg 


aag 


ggc 


cgc 


gcg 


gca 


ctg 


ecc 


291 


Ser 


Trp 


Ser 
80 




Val 


Met 


Cys 


Met 
85 


Leu 


Lys 


Gly 


Arg 


Ala 
90 


Ala 


Leu 


Pro 




ege 


ctg 


tct 


ggc 


cca 


gee 


act 


gtg 


ecc 


tgg 


cat 


gat 


tct 


tct 


ctt 


ect 


339 


Arg 


Leu 


Ser 


Gly 


Pro 


Ala 


Thr 


Val 


Pro 


Trp 


His 


Asp 


Ser 


Ser 


Leu 


Pro 






95 










100 










105 












get 


tgc 


ata 


act 


gat 


eat 


att 


get 


tgt 


etc 


aga 


aaa 


aaa 


eac 


eat 


cag 


387 


Ala 


Cys 


He 


Thr 


Asp 


His 


He 


Ala 


Cys 




Arg 




Lys 


His 


His 


Gin 




110 










115 










120 










125 




caa 


agt 


ggg 


ect 


cca 


aag 


cca 


gat 


gat 


ttt 


cat 


ttt 


atg 


tgt 


gaa 




432 


Gin 


Ser 


Gly 


Pro 


Pro 




Pro 


Asp 


Asp 


Phe 


His 


Phe 


Met 


Cys 


Glu 














130 










135 










140 







taatagcttc attaaeacaa tcaagaeeat gcaeattgta aataeattat tettg 487 



<210> 667 
<211> 476 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 33. .218 

<400> 667 

gaacaagcgt ccctgatcca gaaggtgttc ag atg gag atg gcg agt tct get 53 
Met Glu Met Ala Ser Ser Ala 
1 5 

ggc tec tgg etc tct ggc tgc etc ate ect etc gtc ttc etc egg ctg 101 
Gly Ser Trp Leu Ser Gly Cys Leu He Pro Leu Val Phe Leu Arg Leu 
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10 15 20 

tct gtg cat gtg tea ggc cac gca ggg gat gcc ggc aag ttc cac gtg 149 
Ser Val His Val Ser Gly His Ala Gly Asp Ala Gly Lys Phe His Val 

25 30 35 

gcc eta eta ggg ggc aca gcc gag ctg etc tgc cet etc tec etc tgg 197 
Ala Leu Leu Gly Gly Thr Ala Glu Leu Leu Cys Pro Leu Ser Leu Trp 
40 45 50 55 

cec ggg aeg tac cea agg agg tgaggtggct geggtcccca ttcccgeage 248 
Pro Gly Thr Tyr Pro Arg Arg 
60 

gctcccaggc tgttcacata ttccgggatg ggaaggacca ggatgaagat ctgatgccgg 308 
aatataaggg gaggacggtg ctagtgagag atgcccaaga gggaagtgtc actctgeaga 368 
tccttgacgt gcgccttgag gaccaagggt cttaccgatg tctgatccaa gttggaaatc 428 
tgagtaaaga ggacaccgtg atcctgeagg ttgeageecc atctgtgg 476 

<210> 668 
<211> 475 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 90. .431 

<400> 668 

gtattatgat gtcgtataat gtaacattct gaaaatacaa catgtccacc taaatttcaa 60 



agggctgaaa aaaatttctt gaaagatga 


atg 


gac 


att 


gtg 


gee 


eat 


tgt 


gag 


113 
















Met 

1 


Asp 


Tie 


Val 


Ala 
5 


His 


Cys 


Glu 




aaa 




atg 


gca 


gag 


tea 


gtt aag 




eat 


eag 


ttt 


age 


act 


tet 


gaa 


161 


Lys 


Ala 


Met 


Ala 


Glu 


Ser 


Val Lys 


Lys 


His 


Gin 


Phe 


Ser 


Thr 


Ser 


Glu 






10 










15 








20 












tct 


tat 


ett 


cea 


gca 


act 


tta ata 


ctg 


aac 


age 


ata 


tec 


ttt 


aat 


act 


209 


Ser 


Tyr 


Leu 


Pro 


Ala 


Thr 


Leu He 


Leu 


Asn 


Ser 


He 


Ser 


Phe 


Asn 


Thr 




25 










30 








35 










40 




gat 


ggc 


tec 


ata 


cag 


tat 


aaa gaa 


aag 


ctg 


tat 


ttc 


tct 


gcc 


tec 


gaa 


257 


Asp 


Gly 


Ser 


He 


Gin 


Tyr 


Lys Glu 


Lys 


Leu 


Tyr 


Phe 


Ser 


Ala 


Ser 


Glu 












45 








50 










55 






gcc 


eta 


gac 


get 


tac 


att 


gat gat 


ttt 


cac 


tta 


aac 


tat 


gag 


cct 


cct 


305 


Ala 


Leu 


Asp 


Ala 


Tyr 


He 


Asp Asp 


Phe 


His 


Leu 


Asn 


Tyr 


Glu 


Pro 


Pro 










60 








65 










70 








gat 


att 


gat 




aag 


gtt 




gat 


cag 


agt 


cct 


Gtt 


gaa 


ttt 


ctt 


353 


Asp 


lie 


Asp 


Thr 


Lys 


Val 


Asn Leu 


Asp 


Gin 


Ser 


Pro 


Leu 


Glu 


Phe 


Leu 








75 








80 










85 










get 






aat 


agt 


ggg 


gtc cat 


gga 


ate 


gca 


cea 


ggc 


caa 




ggc 


401 


Ala 


Lys 


Ser 


Asn 


Ser 


Gly 


Val His 


Gly 


He 


Ala 


Pro 


Gly 


Gin 


Ser 


Gly 






90 










95 








100 












atg 


gag 


aag 


ate 


ata 


cag 


tgg tat 


got 


gca 


taacaaggtt tctgtcagca 


451 


Met 


Glu 


Lys 


He 


He 


Gin 


Trp Tyr 


Ala 


Ala 
















105 










110 























acaaattget tatatgatgg tgge 475 

<210> 669 
<211> 578 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 411. .575 

<400> 669 

gageaagccc tggtggcagc gcagggtcca gtgcagcccc tccccacagc atgctggggg 60 
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ctaattctga tgtcatcttt ctgcagaaaa ccattagacc atccctccag actgccaccc 120 

tcaaagccgt ctgcccaggc cccatctgac actcttgaca tctgcaggtc ccagacccta 180 

tgatgtgtcc actctggagg ctcctcatct tcctcgggtt gctggccttg cccttggcac 240 

cacacaagca gccttggcct ggcctggccc aagcccacag agacaacaaa tccaccctgg 300 

caagaagtcc ttcgataaca acatcgtaaa gatgtgtgca catatgagca tcgttgtgga 360 

gttctggctg gagaaagacg agtttggccg gaggatctgg tgataggcaa atg cga 416 

Met Arg 



tgc aga 




age agt 


gte 


eat 


gtg 


gee 


ate 


cte 


aet 


gar 


get 


ate 


cea 


464 


Cys Arg 


Pro 
5 


Ser Ser 


Val 


His 


Val 
10 


Ala 


He 


Leu 


Thr 


Glu 
15 


Ala 


He 


Pro 




cca aag 


atg 


ate agt 


tte 


tet 


aea 


ame 


tea 


aag 


aga 


ate 


tte 


aaa 


aak 


512 


Pro Lys 


Met 


He Ser 


Phe 


Ser 


Thr 


Xaa 


Ser 


Lys 


Arg 


He 


Phe 


Lys 


Xaa 




20 








25 










30 












tct cca 


a am 


atg kta 


aaa 


agt 


eag 


gta 


tgt 


cet 


ctg 


ate 


ggt 


gaa 


ate 


560 


Ser Pro 


Xaa 


Met Xaa 


Lys 


Ser 


Gin 


Val 


Cys 


Pro 


Leu 


He 


Gly 


Glu 


He 




35 






40 










45 










50 




teg ggc 


age 


tgg atg 


tga 






















578 


Ser Gly 


Ser 


Trp Met 



























55 



<210> 670 
<211> 467 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 62. .430 



<400> 670 

attteeggtt ecggtgteag ttcgaggege egeegeegee geegcagecg eeggageege 60 
a atg eet aaa gga gga aga aag gga gge eae aaa ggc egg geg agg eag 109 
Met Pro Lys Gly Gly Arg Lys Gly Gly His Lys Gly Arg Ala Arg Gin 
15 10 15 



tat 


aea 


age 


cet 


gag gag ate 


gac 


geg 


cas 


tgc 


agg 


ctg 


aga 


age 


aga 


157 


Tyr 


Thr 


Ser 


Pro 
20 


Glu Glu He 


Asp 


Ala 
25 


Xaa 


Cys 


Arg 


Leu 


Arg 
30 


Ser 


Arg 




agg 


cca 


ggg 


aag 


aag agg age 


aaa 


aaa 


ggt 


gga 


gat 


ggg 


get 


gca 


ggt 


205 


Arg 


Pro 


Gly 

35 


Lys 


Lys Arg Ser 


Lys 
40 


Lys 


Gly 


Gly 


Asp 


Gly 
45 


Ala 


Ala 


Gly 




gac 


cce 


aaa 


aag 


gag aag aaa 


tct 


eta 


gac 


tea 


gat 


gag 


agt 


gag 


gat 


253 


Asp 


Pro 
50 


Lys 


Lys 


Glu Lys Lys 
55 


Ser 


Leu 


Asp 


Ser 


Asp 
60 


Glu 


Ser 


Glu 


Asp 




gaa 


gaa 


gat 


gac 


tac eag caa 




ego 




ggc 


gtt 


gaa 


ggg 




ate 


301 


Glu 


Glu 


Asp 


Asp 


Tyr Gin Gin 


Lys 


Arg 


Lys 


Gly 


Val 


Glu 


Gly 




He 




65 








70 








75 










80 




gac 


ate 


gag 




ccc aac egg 


gtg 


gca 


eag 


aea 


ace 


aaa 


aag 


gtc 


aca 


349 


Asp 


He 


Glu 




Pro Asn Arg 


Val 


Ala 


Gin 


Thr 


Thr 


Lys 


Lys 


Val 


Thr 










85 






90 










95 






caa 


ctg 


gat 


ctg 


gac ggg cca 


agg 


age 


ttt 


cga 


gga 


gag 


aac 


gag 


aag 


397 


Gin 




Asp 


100 


Asp Gly Pro 


Arg 


Ser 
105 


Phe 


Arg 


Gly 


Glu 


110 


Glu 


Lys 




aga 


ttg 


aga 


age 


aga agg caa 




age 


gtt 


aca 


tgaaaatgea ettggecggg 


450 


Arg 


Leu 


Arg 


Ser 


Arg Arg Gin 




Ser 


Val 


Thr 















115 120 
aagacagage aageeaa 467 



<210> 671 
<211> 468 
<212> DNA 
<213> Homo sapiens 
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<220> 
<221> CDS 
<222> 63. .431 

<400> 671 

agaacatcct aatcrwaaaa gttcgaggcg ccgccgccgc cgccgcagcc gccggagccg 60 
ca atg cct aaa gga gga aga aag gga ggc cac aaa ggc egg gcg agg 107 
Met Pro Lys Gly Gly Arg Lys Gly Gly His Lys Gly Arg Ala Arg 
15 10 15 



cag 


tat 


aca 


age 


eet 


gag 


gag 


ate 


gae 


gcg 


eas 


tge 


agg 


etg 


aga 


age 


155 


Gin 


Tyr 


Thr 


Ser 


Pro 

20 


Glu 


Glu 


He 


Asp 


Ala 

25 


Xaa 


Cys 


Arg 


Leu 


Arg 

30 


Ser 




aga 


agg 


oca 


ggg 


aag 


aag 


agg 


age 


aaa 


aaa 


ggt 


gga 




ggg 






203 








35 










40 








Asp 


Gly 
45 


Ala 


Ala 




ggt 


gac 


ccc 




aag 


gag 


aag 


aaa 


tct 


eta 


gae 


tea 




gag 


agt 


gag 


251 


Gly 


Asp 


Pro 

50 


Lys 


Lys 


Glu 


Lys 


Lys 
55 


Ser 


Leu 


Asp 


Ser 


Asp 
60 


Glu 


Ser 


Glu 




gat 




gaa 


gat 


gae 


tae 


eag 


eaa 


aag 


ege 


aaa 


ggc 


gtt 


gaa 


ggg 


etc 


299 


Asp 


Glu 


Glu 


Asp 


Asp 


Tyr 


Gin 


Gin 


Lys 


Arg 


Lys 


Gly 


Val 


Glu 


Gly Leu 






65 










70 










75 












ate 


gac 


ate 


gag 


aae 


ece 


aae 


egg 


gtg 


gea 


eag 


aca 


ace 


aaa 


aag 


gte 


347 


He 


Asp 


He 


Glu 


Asn 


Pro 


Asn 




Val 


Ala 


Gin 


Thr 


Thr 


Lys 


Lys 


Val 




80 










85 










90 










95 




aca 


caa 


etg 


gat 


etg 


gae 


ggg 


eca 


agg 


age 


ttt 


ega 


gga 


gag 


aae 


gag 


395 


Thr 


Gin 


Leu 


Asp 


Leu 

100 


Asp 


Gly 


Pro 


Arg 


Ser 
105 


Phe 


Arg 


Gly 


Glu 


Asn 
110 


Glu 




aag 


aga 


ttg 


aga 


age 


aga 


agg 


eaa 


aag 


age 


gtt 


aca 


tgaaaatgca 




441 


Lys 


Arg 


Leu 


Arg 


Ser 


Arg 


Arg 


Gin 


Lys 


Ser 


Val 


Thr 













115 120 



cttggccggg aagacagagc aagccaa 468 

<210> 672 
<211> 527 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 85. .327 

<400> 672 

ettttctatg teaattttet ggeaaeaaag tctgtggcca agaaagaagc taccgactcc 60 
ttcgaccaea aaaagttctt ccaa atg gtc ggc etg aag aaa aag agt gcg 111 

Met Val Gly Leu Lys Lys Lys Ser Ala 

1 5 





gat 


gtg 


aag 


aag 


gtg 


ttt cac 


atg 


etg 


gac 


aag 


gac 


aaa 


agt 


ggc 


159 


Asp 


Asp 


Val 


Lys 




Val 


Phe His 


Met 


Leu 


Asp 


Lys 


Asp 




Ser 


Gly 




10 










15 








20 










25 




ttc 


ate 


gag 


gag 


gat 


gag 


etg gga 


ttc 


ate 


eta 




ggc 


ttc 


tec 


eca 


207 


Phe 


He 


Glu 


Glu 


Asp 
30 


Glu 


Leu Gly 


Phe 


He 
35 


Leu 


Lys 


Gly 


Phe 


Ser 
40 


Pro 




gat 


gee 


aga 


gac 


etg 


tct 


get aaa 


gaa 


ace 


aag 


atg 


etg 


atg 


get 


get 


255 


Asp 


Ala 


Arg 


Asp 


Leu 


Ser 


Ala Lys 


Glu 


Thr 


Lys 


Met 


Leu 


Met 


Ala 


Ala 








45 








50 










55 








gga 


gac 


aaa 


gat 


ggg 


gac 


ggc aaa 


att 


ggg 


gtt 


gac 


gaa 


ttc 


tec 


act 


303 


Gly 


Asp 


Lys 

60 


Asp 


Gly 


Asp 


Gly Lys 
65 


He 


Gly 


Val 


Asp 


Glu 
70 


Phe 


Ser 


Thr 




etg 


gtg 






ast 


aag 


aag cac 


tgactgcccc tggtcttcca cctctctgcc 


357 




Val 


Ala 


Glu 


Xaa 


Lys 


Lys His 





















75 80 



ctgaacaccc aatctcggcc cctcttgcca ccctcctgea tttctgttea gttcgtttat 417 
gttatttttt actcccccat cccctgtggc cctctaatga caccattctt ctggaaaatg 477 
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ctggagaagc aataaaggtt gtaccagtca aaaaaaaaaa aaaatgcgaa 527 

<210> 673 
<211> 594 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 161. .589 

<400> 673 

aaaaggatat acaaagaatg attttgccat tcatacacta aaaaataact ttgaagcaga 60 
tgcttatttt gttaaagtta ttgttcttag tggtctgtgt tctaatgact gtcctcgtaa 120 
gataacaccc tttggagtga atcaacctgg accttatatc atg tat aca act gta 175 

Met Tyr Thr Thr Val 



gat 


gca 


aat 


ggg 


tat 


ctg 


aaa 


aat 


gga 


tea 


get 


ggc 


caa 


etc 


age 


caa 


223 


Asp 


Ala 


Asn 


Gly 


Tyr 


Leu 


Lys 


Asn 


Gly 


Ser 


Ala 


Gly 


Gin 




Ser 


Gin 












10 










15 










20 






tec 


tea 


eat 


tta 


get 


etc 


caa 


eta 


cea 


tac 




gtg 


ctt 


ggt 


tta 


ggt 


271 


Ser 


Ser 


His 


Leu 


Ala 


Leu 


Gin 


Leu 


Pro 


Tyr 


Asn 


Val 


Leu 


Gly 


Leu 


Gly 










25 










30 










35 








egg 


age 


gca 


aat 


ttt 


ctt 


gac 


cat 


etc 


tac 


gtt 


ggt 






cgt 


cca 


319 


Arg 


Ser 


Ala 


Asn 


Phe 


Leu 


Asp 


His 


Leu 


Tyr 


Val 


Gly 


He 


Pro 


Arg 


Pro 








40 










45 










50 










tot 


gga 


gaa 


aaa 


tct 


ata 


cga 






gag 


tgg 




gca 


ate 


att 


cca 


367 


Ser 


Gly 


Glu 


Lys 


Ser 


He 


Arg 


Lys 


Gin 


Glu 


Trp 


Thr 


Ala 


He 


He 


Pro 





aat tec cag eta att gte att cca tac ect cac aat gtc ect cga agt 415 

Asn Ser Gin Leu He Val He Pro Tyr Pro His Asn Val Pro Arg Ser 

70 75 80 85 

tgg agt gee aaa ctg tat ctt aca cca agt aat att gtt ctg ctt act 4 63 

Trp Ser Ala Lys Leu Tyr Leu Thr Pro Ser Asn He Val Leu Leu Thr 

90 95 100 

get ata get etc ate ggt gte tgt gtt tte ate ttg gca ata att ggc 511 
Ala He Ala Leu He Gly Val Cys Val Phe He Leu Ala He He Gly 

105 110 115 

att tta eat tgg cag gaa aag aaa gca gat gat aga gaa aaa cga caa 559 
He Leu His Trp Gin Glu Lys Lys Ala Asp Asp Arg Glu Lys Arg Gin 

120 125 130 

gaa gee cac egg ttt cat ttt gat get atg tgact 594 
Glu Ala His Arg Phe His Phe Asp Ala Met 
135 140 

<210> 674 
<211> 545 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 297. .500 

<400> 674 

atgaggttee atgeggceaa caagagaggg agatagaaag gaaagggccc cagatgtggc 60 

tcccetteea aatactttaa gttcecagtt taacctatgt gatcctttca atattteeac 120 

cataatgcat gggtccagtc etgttctttg ggttgeacaa aataattage acaagcatac 180 

aaaaaggcaa gagtaatatt caaaatactg aagagctatg gtacactaaa gtgccctggg 240 

ggtaagtetg tgtgtgceag ectgeagcag gtggtetcag ccetggaagg aacagc atg 299 

Met 
1 

atg gac acc atg gat ccc tte tgg ttt aat gca ttc aaa agg act aat 347 
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Met Asp Thr Met Asp Pro Phe Trp Phe Asn Ala Phe Lys Arg Thr Asn 

5 10 15 

acc ata ctg cac cat ttg aga atg tec aag cac aca gat gca gca gaa 395 
Thr lie Leu His His Leu Arg Met Ser Lys His Thr Asp Ala Ala Glu 

20 25 30 

gag gtg eta ttg gaa aaa aaa ggt tgc acg gga gtc ata aca eta aac 443 
Glu Val Leu Leu Glu Lys Lys Gly Cys Thr Gly Val lie Thr Leu Asn 

35 40 45 

aga oca aag ttc etc aat gca ctg act ctt aat atg att egg cag att 4 91 

Arg Pro Lys Phe Leu Asn Ala Leu Thr Leu Asn Met lie Arg Gin lie 
50 55 60 65 

tat cac age taaagaagtt ggaacaagat cetgaaaett tcctgatcat tataa 545 
Tyr His Ser 



<210> 675 
<211> 534 
<212> DNA 

<213> Homo sapiens 



<220> 
<221> CDS 
<222> 13. .234 



<400> 675 

tttttttcca ct atg get tec age act gtc ecg gtg age get get ggc teg 51 
Met Ala Ser Ser Thr Val Pro Val Ser Ala Ala Gly Ser 









1 








5 










10 




get 






act 


ccc 


gaa 


ata 


ceg 


gac 


aac 


gtg 


gga 


gat tgg ctt egg 


99 


Ala 


Asn 
15 


Glu 


Thr 


Pro 


Glu 


He 
20 


Pro 


Asp 


Asn 


Val 


Gly 
25 


Asp Trp Leu Arg 




ggc 


gtc 


tac 


cgc 


ttt 


gee 


act 


gat 


agg 


aat 


gac 


ttc 


egg agg aac ttg 


147 


Gly Val 


Tyr 


Arg 


Phe 


Ala 


Thr 


Asp 


Arg 


Asn Asp 


Phe Arg Arg Asn Leu 




30 










35 










40 




45 




ata 


eta 


aat 


ttg 


gga 


etc 


ttt 


get 


gcg 


gga 


gtt 


tgg 


ctg gee agg aac 


195 


He 


Leu 


Asn 


Leu 


Gly 

50 


Leu 


Phe 


Ala 


Ala 


Gly 

55 


Val 


Trp 


Leu Ala Arg Asn 
60 




ttg 


agt 


gac 








atg 


gca 


cct 


cag 




ggg 


gtg tagceaagta 


244 


Leu 


Ser 


Asp 


He 
65 


Asp 


Leu 


Met 


Ala 


Pro 
70 


Gin 


Pro 


Gly 


Val 





gacaaatgga atectgtgct gaaeccgaat cttccaaaaa aeagcetaca atetgtgace 304 

aeeacaagat gtgccctgat ggcagctgaa gtttgattca gatgggcact tttetteece 364 

ttccctgeet agtttecttt tgtteettga gtecaegcag aattceatte tetggteage 424 

agaeaggett aagetaaagt attgeeteta ttetgtaaag ttetgtaeat agttceeaag 484 

cttctgeagg gggtgatttt tgetettgte ctgagaaata aeagtgetgt 534 



<210> 676 
<211> 509 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 125. .367 



<220> 

<221> misc_feature 
<222> 91,461 

<223> n=a, g, e or t 



<400> 676 

agccgagcct gcggaaggcg gcggcggcgg cacctgcgat cagcggctgg ggcaggttat 60 

ggtagtgcgg actgeggtgt gagcagageg neeaegggge eegeeatgcg ecggeggece 12 0 

tgae atg ggc gee acg ggt cea aag etc ggg gee tgt ggc ctt cgc etc 16 9 
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Met Gly Ala Thr Gly Pro Lys Leu Gly Ala Cys Gly Leu Arg Leu 





1 




5 






10 








15 




ggc 


ggc 


egg ags 


ggc ggc 


tea gag gca 


gca 


gga 


get 


gag 


caa 


get ttg 


217 


Gly 


Gly 


Arg Xaa 


Gly Gly 


Ser Glu Ala 


Ala 


Gly 


Ala 


Glu 


Gin 


Ala Leu 










20 




25 










30 




gtr 


egg 


cct egg 


ggc cga 


get gtr ccc 


ccc 


ttc 


gta 


ttc 


acg 


cgc cgc 


265 


Val 


Arg 


Pro Arg 


Gly Arg 


Ala Val Pro 


Pro 


Phe 


Val 


Phe 


Thr 


Arg Arg 








35 




40 










45 






ggc 


tct 


atg ttc 




gag gat ggg 


gat 


ctr 


get 


cac 


gag 


ttc tat 


313 


Gly 


Ser 


Met Phe 


Tyr Asp 


Glu Asp Gly 


Asp 


Leu 


Ala 


His 


Glu 


Phe Tyr 








50 




55 








60 








gag 


gag 


aca ate 


gtc acc 


aag aac ggg 


cag 


aak 


ckg 


gcc 


aag 


ctg agg 


361 


Glu 


Glu 


Thr He 


Val Thr 


Lys Asn Gly Gin 


Xaa 


Xaa 


Ala 


Lys 


Leu Arg 






65 






70 






75 











gca teg tgaagetgga tcascceege ateeacgtgg attteectgt gatcctetat 417 

Ala Ser 

80 

gaggtgtgae ccetgggagg tggeagaeag aagcaeceee tgeneeggca agaaaetecc 477 
aggctcaatc aaggtgtggc ttceatttga gg 509 



<210> 677 
<211> 591 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 101. .304 



<400> 677 

gcggaagtge tgettaetgg gtcetgaatc ggetgegtge tgttetagge atctttcacc 60 
ctgaagtcac gggaacaacg ctgaggggca gccagagatt atg tgg get ttt cea 115 

Met Trp Ala Phe Pro 
1 5 

tgg gaa gaa cca etc tac gtt ate acc ttc tac ata gta gca gtt gaa 163 
Trp Glu Glu Pro Leu Tyr Val He Thr Phe Tyr He Val Ala Val Glu 

10 15 20 

gcc aaa tgg aca gaa age ceg aga caa cat gaa gtt gtt eta caa gtt 211 
Ala Lys Trp Thr Glu Ser Pro Arg Gin His Glu Val Val Leu Gin Val 

25 30 35 

att ttg gag aaa ttg act tae cat acc act cat caa ccc atg caa aag 259 
He Leu Glu Lys Leu Thr Tyr His Thr Thr His Gin Pro Met Gin Lys 

40 45 50 

cct gtc tat gtc caa tea gca gaa tgt etc gga cca cct aaa aag 304 
Pro Val Tyr Val Gin Ser Ala Glu Cys Leu Gly Pro Pro Lys Lys 

55 60 65 

taaaagaagg agactgaaat aatagcatet ttgatgaaaa ctatctggaa gacaagttgt 364 
taaeaattct ggggatcttg gtgattacag agttcttaat ccctctgtcc ataggtgatg 424 
acaattaeag getgectata ggtcctatag tgeteacaea ectccagcee ttccccatgg 484 
tgtacacaca cttgcagtat attcatctct ttgtcttatt tgagagtagg gctgggtgtg 54 4 
tgtacaaact aatgacaaat acttgacagt cacacagcag tgataca 591 

<210> 678 
<211> 348 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 11. .331 



<400> 678 

aaaaaacaac atg cca cag tta aaa ctt act tea gtt tgg aca act act 



49 
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Met Pro Gin Leu Lys Leu Thr Ser Val Trp Thr Thr Thr 





age 


tac 


tac 




grg 




cga 


acg 


agk 


tea 


cet 


ggd 


tak 


ama 


cct 


97 




15 


Tyr 


Tyr 


Th^ 


Xaa 


Thr 
20 


Arg 








25 


Gly 










gga 


cca 


cca 


cca 


atg 


gat 


atw 


caa 


atg 


gca 


aac 


aat 


ttt 


act 


ccg 


ccc 


145 


Gly 


Pro 


Pro 


Pro 


Met 


Asp 


He 


Gin 


Met 


Ala 


Asn 


Asn 


Phe 


Thr 








30 










35 










40 










45 




tct 


gca 


act 


cct 


cag 


gga 


aat 


gac 


tgt 


gac 


etc 


tat 


gca 


eat 


eac 


age 


193 


Ser 


Ala 


Thr 


Pro 


Gin 


Gly Asn 


Asp 


Cys 


Asp 


Leu 


Tyr 


Ala 


His 


His 


Ser 
































60 






acg 


gcc 


agt 


ata 


gta 


atg 


cct 


ctg 


cat 


tac 


age 


etc 


gkc 


ttc 


ate 


att 


241 


Thr 


Ala 


Ser 


He 
65 


Val 


Met 


Pro 


Leu 


His 
70 


Tyr 


Ser 


Leu 


Xaa 


Phe 
75 


He 


He 




ggg 


etc 


gtg 


gga 


aac 


tta 


eta 


gcc 


ttg 


gtc 


gtc 


att 


gtt 


caa 


aac 


agg 


289 


Gly 


Leu 


Val 


Gly 


Asn 


Leu 


Leu 


Ala 


Leu 


Val 


Val 


He 


Val 


Gin 


Asn Arg 








80 










85 










90 










aaa 






ate 


tea 


cct 


tgr 


gcc 


cam 


ccc 


cam 


ccc 


att 


gkc 






331 


Lys 


Lys 


He 


He 


Ser 


Pro 


Xaa 


Ala 


Xaa 


Pro 


Xaa 


Pro 


He 


Xaa 









taatctgtag ; 



<210> 679 
<211> 503 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 194. .409 



<400> 679 

catcteccae caggceccae ctgcaaeact ggggatcaaa tttcaacaeg agatttggaa 60 

ggggcaaata tecaaactgt ateagcactc tttgaaagat tacagagtaa atatctttct 120 

tcgtcagaaa tggaatgate ccegcctcgc gtaeagtgaa tatcetgaeg actctttaga 180 

cctcgacccc tec atg ttg gac tec att tgg aaa cct gat ttg ttc ttt 229 

Met Leu Asp Ser He Trp Lys Pro Asp Leu Phe Phe 









1 






5 








10 






gcc 


aat 


gaa 


aag ggt 


gcc 


aac 


ttt cat 


gaa 


gtc 


act 


aca gac aac 




277 


Ala 


Asn 


Glu 


Lys Gly 


Ala 


Asn 


Phe His 


Glu 


Val 


Thr 


Thr Asp Asn Lys 








15 








20 








25 






ttg 


eta 


aga 


att ttc 


aaa 


aat 


gga aat 


gtt 


ctt 


tat 


tea ata aga 


tta 


325 




Leu 
30 


Arg 


He Phe 




Asn 
35 


Gly Asn 


Val 




Tyr 
40 


Ser He Arg 


Leu 




aca 


tta 


aca 


ctt tee 


tgt 


cea 


atg gat 


etc 


aag 


aat 


ttt ccc atg 


gat 


373 


Thr 




Thr 


Leu Ser 


Cys 


Pro 


Met Asp 


Leu 


Lys 


Asn 


Phe Pro Met 


Asp 




45 








50 








55 






60 




gta 


caa 




tgt ata 


atg 


caa 


ctg gaa 


age 


ttg 


gtt 


taaaatgaca 




419 


Val 


Gin 


Thr 


Cys He 
65 


Met 


Gin 


Leu Glu 


Ser 
70 


Leu 


Val 









agataaaaga ttgtaaateg gtggaaecac tggaaagtag gcaaaaagag tttaacetae 479 
tggaaageaa gagctgtcag aatc 503 



<210> 680 
<211> 561 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 256. .426 



<400> 680 
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ttgaagattt ttttggtata ttctagatgt taatctcatc ctgtcttctg tttatcaact 60 
ttgtcaatgg taactttgag aaatttttag ttttgatata gccccagtcc attaattttt 120 
gccttaggtt gcactttaag agtataagat atttttcctt accccaagat tacaaagata 180 
ttctgtattc tgtcagcttt ccagttttac ctttcacatt gagggcttta ttctatttga 2 40 
agttcatctt catat atg gga aaa caa agg aat etc aca tta ttt ttt tec 2 91 

Met Gly Lys Gin Arg Asn Leu Thr Leu Phe Phe Ser 
1 5 10 

ttt cca ttc tgt gag aca gtt ttc eta aat cat eta tta aac agt gta 339 
Phe Pro Phe Cys Glu Thr Val Phe Leu Asn His Leu Leu Asn Ser Val 

15 20 25 

ttc ttt cot tea tat agt ttt gtg gag cca ctt tta tea tac etc aga 387 
Phe Phe Pro Ser Tyr Ser Phe Val Glu Pro Leu Leu Ser Tyr Leu Arg 

30 35 40 

ttc cca eag aca cat att agt tta ctt ttt caa ctt tat taagagttat 436 
Phe Pro Gin Thr His lie Ser Leu Leu Phe Gin Leu Tyr 
45 50 55 

taaggctgag cgtggtggct eaatgcctgt aatcccagca ctttgggaga ctgaggtggg 4 96 
aggattgttt gagtccagga gtteaagaec agectgggea acatggagag accetgtcte 556 

tacaa 561 

<210> 681 
<211> 520 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 21. .326 

<400> 681 

ctttggettc cecaceggca atg gag egg ttg aeg ttg ect etc gge gge geg 53 
Met Glu Arg Leu Thr Leu Pro Leu Gly Gly Ala 













1 








5 










10 




geg 


geg 


gtg 


gac 


gag 


tac 


etg 


gag 


tac 


egg 


aga 


att 


gtt 


ggt 


gag 


gat 


101 


Ala 


Ala 


Val 


Asp 
15 


Glu 


Tyr 


Leu 


Glu 


Tyr 

20 


Arg 


Arg 


lie 


Val 


Gly 

25 


Glu 


Asp 




gat 


gga 


ggg 


aaa 


ctt 


ttt 


act 


ect 


gaa 


gaa 


tat 


gaa 


gaa 


tac 


aaa 


aga 


149 


Asp 


Gly 


Gly 
30 


Lys 


Leu 


Phe 


Thr 


Pro 
35 


Glu 


Glu 


Tyr 


Glu 


Glu 
40 


Tyr 


Lys 


Arg 




aaa 


gtt 


tta 


ect 


etg 


ege 


tta 


caa 


aac 


aga 


tta 


ttt 


gtg 


age 


tgg 


egg 


197 


Lys 


Val 
45 


Leu 


Pro 


Leu 


Arg 


Leu 
50 


Gin 


Asn 


Arg 


Leu 


Phe 
55 


Val 


Ser 


Trp 


Arg 




tea 


cca 


aca 


ggg 


atg 


gat 


tgt 


aaa 


ctt 


gtg 


gge 


cca 


gag 


aca 


etg 


tgt 


245 


Ser 


Pro 


Thr 


Gly 


Met 


Asp 


Cys 


Lys 


Leu 


Val 


Gly 


Pro 


Glu 


Thr 


Leu 


Cys 




60 










65 










70 










75 




ttt 


tgt 


aca 


eat 


agg 


tat 


aaa 


caa 


eat 


aaa 


act 


gac 


ttg 


gaa 


geg 


att 


293 


Phe 


Cys 


Thr 


His 


Arg 
80 


Tyr 


Lys 


Gin 


His 


Lys 
85 


Thr 


Asp 


Leu 


Glu 


Ala 
90 


lie 




ect 


eag 


cag 


tgc 


ccc 


att 




ccc 


ect 


gee 


aag 


tgactggctg ceagtgeagg 


346 


Pro 


Gin 


Gin 


Cys 
95 


Pro 


lie 


Asp 


Pro 


Pro 
100 


Ala 


Lys 















gcttaeettt atgteeeett gaatggtage eageecatte getgeaggtg caaacacttt 406 
getgateage aeagtgetge geetggettt acatgcaata eatgtteeaa gtgttcagga 4 66 
ttceataget getteaettg tgettgtggt eageetgeat attggyeatg acac 520 



<210> 682 
<211> 513 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 85. .396 
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<400> 682 

actttctcgg ggtgtctgcg taactgccca gacttgcctt ggtttggtca gatgacctcc 60 
tctgggactg gctagccagc gttc atg tct aca tta ttg ate teg tet tee 111 

Met Ser Thr Leu Leu He Ser Ser Ser 

1 5 





gee 


tta 


ttg 


gag 


gaa 


ctg 


gaa 


cgc 


tec 




ctt 


cag 


gae 


agt 


gat 


159 


Lp 


Ala 


Leu 


Leu 


Glu 


Glu 


Leu 


Glu 


Arg 


Ser 


Thr 


Leu 


Gin 


Asp 


Ser 


Asp 




10 










15 










20 










25 




gaa 


tat 


tec 


aac 


cea 


get 


cet 


ctt 


ccc 


ctg 


gat 


cag 


cat 


tec 


aga 


aag 


207 


Glu 


Tyr 


Ser 


Asn 


Pro 

30 


Ala 


Pro 


Leu 


Pro 


Leu 

35 


Asp 


Gin 


His 


Ser 


Arg 

40 


Lys 




gag 


act 


aac 


ett 


gat 


gag 


act 


teg 


gag 


ate 


ctt 


tct 


att 


cag 


gat 


aac 


255 


Glu 


Thr 


Asn 


Leu 
45 


Asp 


Glu 


Thr 


Ser 


Glu 
50 


He 


Leu 


Ser 


He 


Gin 
55 


Asp 


Asn 




aca 


agt 


ccc 


ttg 


ccg 


gcg 


cas 


teg 


tgt 


ata 


eta 


cea 


ata 


tec 


agg 


age 


303 


Thr 


Ser 


Pro 
60 


Leu 


Pro 


Ala 


Xaa 


Ser 
65 


Cys 


He 


Leu 


Pro 


He 

70 


Ser 


Arg 


Ser 




tea 


atg 


tct 


aca 


gtg 




ccc 


aag 


age 


caa 


agg 


aat 


cae 


cae 


cgc 


ctt 


351 


Ser 


Met 
75 


Ser 


Thr 


Val 


Lys 


Pro 
80 


Lys 


Ser 


Gin 


Arg 


Asn 
85 


His 


His 


Arg 


Leu 




eta 


aaa 


cgt 


cag 


cag 


ctg 


etc 


agt 


tgg 


atg 


age 


tea 


ttg 


etc 


ace 




396 


Leu 


Lys 


Arg 


Gin 


Gin 


Leu 


Leu 


Ser 


Trp 


Met 


Ser 


Ser 


Leu 




Thr 






90 










95 










100 















tgactgagat gcaggceaag gttgcagtga gagcagatgc tggcaagaag cacttaecag 456 
acaageagga tcacaaggcc tccetggact eaatgcttgg gggtctegag caggaat 513 



<210> 683 
<211> 521 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 317. .505 

<400> 683 

aagggaatct ccgggcgggg tagtgcaggc gccgggtttc ccgcggtccg agctggcgcg 60 
ggcggaggag aatcgctctt aaagggccag cgcacacgeg ttcttttgtt ccggggecgc 120 
aggcggggca ggcccgactt tcgccgtctt cttgtctact ctccagaacg gccatgattt 180 
cccaattctt cattctgtee teeaaggggg acegctcate taeaaagact tccgcgggga 240 
cagtggcggc cgggatgtgg ccgagctctt ctaccggaag ctgacgggac tgccaggaga 300 
cgagtccccg gttgtc atg gac tat ggc tat gta eag acc aca tec acg gag 352 
Met Asp Tyr Gly Tyr Val Gin Thr Thr Ser Thr Glu 
15 10 
atg ctg agg aat ttc ate eag acg gaa get gtg gte age aag cce ttc 400 
Met Leu Arg Asn Phe He Gin Thr Glu Ala Val Val Ser Lys Pro Phe 

15 20 25 

age etc ttt gac etc age age gtt ggc ttg ktt ggg get gag aca caa 4 48 

Ser Leu Phe Asp Leu Ser Ser Val Gly Leu Xaa Gly Ala Glu Thr Gin 

30 35 40 

eag age aaa gtg gee ccc cag cag tgc age cag ycg sec cgt cct gkc 4 96 

Gin Ser Lys Val Ala Pro Gin Gin Cys Ser Gin Xaa Xaa Arg Pro Xaa 
45 50 55 60 

eag teg etc tgaccagagc caaaag 521 
Gin Ser Leu 

<210> 684 
<211> 462 
<212> DNA 
<213> Homo sapiens 



<220> 
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<221> 
<222> 



<400> 684 

a atg etc age gcg ctg ccc ggc tgg gga ccc gcg cac ctg cag cgc ccg 4 9 

Met Leu Ser Ala Leu Pro Gly Trp Gly Pro Ala His Leu Gin Arg Pro 
15 10 15 

ctg etc ggc cct gca tec tgc ctg ggc ate ctg cgc ccg gee atg aeg 97 
Leu Leu Gly Pro Ala Ser Cys Leu Gly lie Leu Arg Pro Ala Met Thr 
20 25 30 

ate ttc ace acg ttc tgg ggc 145 
He Phe Thr Thr Phe Trp Gly 
35 40 45 

etc gtc ggc ate gee ggg ccc tgg ttc gtg ccg aag gae cca ace gcg 193 
Leu Val Gly He Ala Gly Pro Trp Phe Val Pro Lys Asp Pro Thr Ala 

50 55 60 

gag tgatcatcac eatgetggte geeacegecg tctgctgtta cetettctgg 24 6 

Glu 
65 

ctcatcgcca tcctggcgca stgaaccccc tgttcgggcc ccagctgaag aatgagacca 306 
tctggtaegt gegcttcctg tgggagtgae ccgecgecce egaeeeaggt gcecagctct 366 
eggaatgact gtggetccae tgtecetgae aaccccttcg teeggaccet cecceaeaea 426 
aetatgtetg gtcaceaget ccctcctget ggcace 4 62 



<210> 685 
<211> 513 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 128. .376 



<400> 685 

atctggatat ttctcacatt tgccttggtt tgatgctcct ggagttccca aaagaataga 60 
gatcacttta tcaagcttgc agaacaggtt gcatgattgt tgtgctgctg cttgctagta 120 



aaagttt 


atg 


ctt' 


gta 


gaa 


aga 


aaa 


gtt 


aat 


ggg 


ctg 


gtg 


ccg 


egg 


etc 


169 






Met 


Leu 


Val 


Glu 


Arg 


Lys 


Val 


Asn 


Gly Leu 


Val 


Pro 


Arg 


Leu 








1 








5 










10 












atg 


cct 


gta 


ate 


gca 


gca 


cat 


tgg 


gag 


gee 


cag 


gtg 


ggc 


gca 


tea 


ctt 


217 


Met 


Pro 


Val 


He 


Ala 


Ala 


His 


Trp 


Glu 


Ala 


Gin 


Val 


Gly 


Ala 


Ser 


Leu 




15 










20 










25 










30 




gag 


ggt 


tec 


age 


ate 


ttg 


gae 


aga 


get 


gtt 


att 


gaw 


cac 


aat 


ttg 


ttg 


265 


Glu 


Gly 


Ser 


Ser 


He 
35 


Leu 


Asp 


Arg 


Ala 


Val 
40 


He 


Xaa 


His 


Asn 


Leu 
45 


Leu 




tet 


gca 


age 


aaa 


tta 


tat 


aat 


aat 


att 


ace 


ttc 


gaa 


gaa 


ctt 


gga 


get 


313 


Ser 


Ala 


Ser 


Lys 
50 


Leu 


Tyr 


Asn 


Asn 


He 
55 


Thr 


Phe 


Glu 


Glu 


Leu 
60 


Gly 


Ala 




ett 


tta 


gag 


ate 


cct 


gca 


get 


aag 


gcg 






ata 


gca 


tct 


caa 




361 


Leu 


Leu 


Glu 


He 


Pro 


Ala 


Ala 


Lys 


Ala 


Glu 


Lys 


He 


Ala 


Ser 


Gin 


Met 





65 70 75 



ata acc gaa gae gta tgaatggatt tattgaccag attgatggaa tagttcattt 416 
He Thr Glu Asp Val 
80 

tgaaacacga gaagccctgc caacgtggga taagcagatc caatcacttt gtttccaagt 476 
gaataacett tttggagaaa attagtcaaa cagcacc 513 



<210> 686 
<211> 498 
<212> DNA 
<213> Homo sapiens 



<220> 
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<221> CDS 
<222> 118. .399 



<400> 686 

aacaattcat gaagttgaag aaaagacact gtcagaaatg aacacagaag cggasaacag 60 



cttctccatc acgccagaaa tggcaatgct gaagaagtaa 


gac^ 


iactatt agagacc 


117 


atg 


gcg 


agt 


aat 


gaa 


gtg 


att 


get 


gac 


att 


aat 


tgc 


aaa 


gga 


aga 


agt 


165 


Met 


Ala 


Ser 


Asn 


Glu 


Val 


He 


Ala 


Asp 


He 


Asn 


Cys 


Lys 


Gly 


Arg 


Ser 




1 








5 










10 










15 






aag 


tct 


aac 


ttg 


ggc 


tgg 


aca 


ccc 


eta 


cat 


ctg 


gca 


tgc 


tat 


ttt 


gga 


213 


Lys 


Ser 


Asn 


Leu 


Gly 


Trp 


Thr 


Pro 


Leu 


His 


Leu 


Ala 


Cys 


Tyr 


Phe 


Gly 










20 










25 










30 








cac 


aga 


caa 


gtg 


gtc 


cag 




ctg 


ttg 


aag 


get 


ggt 


gca 


gaa 


gtg 




261 


His 


Arg 


Gin 


Val 


Val 


Gin 


Asp 


Leu 


Leu 


Lys 


Ala 


Gly 


Ala 


Glu 


Val 


Asn 








35 










40 










45 










gtg 


ttg 


aat 


gac 


atg 


gga 


gac 


acg 


ceg 


ctt 


cat 


cga 


get 


gee 


ttt 


aca 


309 


Val 


Leu 


Asn 


Asp 


Met 


Gly 


Asp 


Thr 


Pro 


Leu 


His 


Arg 


Ala 


Ala 


Phe 


Thr 






50 










55 










60 












gga 


cga 


aag 


gtg 


aaa 


ate 


att 


eta 


tgt 


tea 


atg 


ttt 


gta 


agt 


gag 


gta 


357 


Gly 


Arg 


Lys 


Val 


Lys 


He 


He 


Leu 


Cys 


Ser 


Met 


Phe 


Val 


Ser 


Glu 


Val 




65 










70 










75 










80 




ttt 


gga 


gga 


gta 


gtt 


acc 


att 


gtt 


ttc 


tct 


gtt 


ata 


ace 


ate 






399 


Phe 


Gly 


Gly 


Val 


Val 


Thr 


He 


Val 


Phe 


Ser 


Val 


He 


Thr 


He 









85 90 



tgaccagcaa cegagaaage cacacaaaaa aatgtataca ccagcacttt gggtcaaagg 459 
ccacaggatc ttttgagtct gacagtgagg tccagtact 4 98 



<210> 687 
<211> 550 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 82. .306 



agtctcgcca gctgccggtc tttcgggggc tccgtaactt tctatccgtc cgcgtcagcg 60 
cttgccaccc tcatctccaa t atg cct ggt ccg acc ccc agt ggc act aac 111 
Met Pro Gly Pro Thr Pro Ser Gly Thr Asn 
15 10 
gtg gga tec tea ggg cgc tct ccc age aaa gca gtg gcc gee egg gcg 159 
Val Gly Ser Ser Gly Arg Ser Pro Ser Lys Ala Val Ala Ala Arg Ala 

15 20 25 

ggg gat cca ctg tec ggc aga gga aaa atg cca get gtg gga caa gga 207 
Gly Asp Pro Leu Ser Gly Arg Gly Lys Met Pro Ala Val Gly Gin Gly 
30 35 40 

I cct egg cag gca ccg ggg gga tgt ggc gat tct 255 
I Gin Pro Arg Gin Ala Pro Gly Gly Cys Gly Asp Ser 
45 50 55 

aca cag aag att cac ctg ggc tea aag ttg gee ctg ttc cag tat tgg 303 
Thr Gin Lys He His Leu Gly Ser Lys Leu Ala Leu Phe Gin Tyr Trp 

60 65 70 

tta tgagtettct gtteateget tetgtattta tgttgeaeat ttggggeaag 356 

75 

tacactegtt egtagattca gttacatcca tetgteatet gaagaaggag gaaaaaacce 416 

aacatttctt ggaeeaaaag tatagtgact atetgtteat gagagaaatt ttctgtaagc 476 

ttgetgtttt acaggggatt tatcaataat tgattttgag gaatcagttt ttttctatgg 536 

ctaataaact tttt 550 



<210> 688 
<211> 501 
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<212> DNA 

<213> Homo sapiens 



<220> 
<221> CDS 
<222> 8. .487 



<400> 688 

agctcct atg agg ccc ctg 
Met Arg Pro Leu 
1 

teg cct ccg ccg ccg ctg 
Ser Pro Pro Pro Pro Leu 
15 20 
gtc ggc tec tet ggg cgt 
Val Gly Ser Ser Gly Arg 
35 

gac egg etg cge ctg ccg 
Asp Arg Leu Arg Leu Pro 
50 

ttt tac tat ggg ate ctg 
Phe Tyr Tyr Gly lie Leu 

65 

gaa gga gcc aag cag gag 
Glu Gly Ala Lys Gin Glu 
80 

att caa tgk gtg ate aat 
He Gin Xaa Val He Asn 
95 100 
aee egg age tgg etc tat 
Thr Arg Ser Trp Leu Tyr 
115 

atg gtc tec age aat tea 
Met Val Ser Ser Asn Ser 
130 

gtc ctt ggt aaa tec tgc 
Val Leu Gly Lys Ser Cys 
145 

ace tet tgaagaagaa gtac 
Thr Ser 

160 



ccg ccg gtc ggc gat gtc 
Pro Pro Val Gly Asp Val 
5 10 
ctg ccg gtg ccg gtt gtg 
Leu Pro Val Pro Val Val 
25 

etc atg gee tet age age 
Leu Met Ala Ser Ser Ser 
40 

etc tgc ttc ctg ggt gtc 
Leu Cys Phe Leu Gly Val 
55 

cag gaa aag ata aca aga 
Gin Glu Lys He Thr Arg 
70 

acg ttc ace ttt gcc tta 
Thr Phe Thr Phe Ala Leu 
85 90 
get gtg ttt gcc aag ate 
Ala Val Phe Ala Lys He 
105 

get gee tgt tet ate tec 
Ala Ala Cys Ser He Ser 
120 

gea eta eag ttt gtc aae 
Ala Leu Gin Phe Val Asn 
135 

aag cca ate cca gtc atg 
Lys Pro He Pro Val Met 
150 



egg etg gag etg 4 9 

Arg Leu Glu Leu 

age ggg tet cca 97 
Ser Gly Ser Pro 
30 

tee etg gtg eec 145 
Ser Leu Val Pro 
45 

ttt gtc tgc tat 193 
Phe Val Cys Tyr 
60 

gga aag tat ggg 241 

Gly Lys Tyr Gly 

75 

act ttg gtc ttc 289 
Thr Leu Val Phe 

tgr tgg gat cgt 337 
Xaa Trp Asp Arg 
110 

tat etg ggt gee 385 
Tyr Leu Gly Ala 
125 

tae eea act eag 433 
Tyr Pro Thr Gin 
140 

etc ctt ggg gtg 481 

Leu Leu Gly Val 

155 

501 



<210> 689 
<211> 494 
<212> DNA 
<213> Homo 



sapiens 



<220> 
<221> CDS 
<222> 282. .461 



<400> 689 

attttcttaa tttttcagaa ctttaaacag atatgtaatt taaaagatgc agcccctcct 60 
gtgcatgccc tcctcatgct gcccctctgt tccagggccc acctttcctg tcctttgagg 120 
aggacccgcg gcagcaggca ctggagctgg tggtgcagga ggtctccagc gtgctcaggt 180 
eggeggagga gaaggtggtg gtctgtcacg tggtgggcca ggctattcag ttcttggtca 240 
gegagaegee tgeettgggt getggatgee ggetgtaega e atg egg etc tee eet 296 

Met Arg Leu Ser Pro 
1 5 
I etg gag tte eae atg aag gag aag aga gag gae etc 34 4 

1 Leu Glu Phe His Met Lys Glu Lys Arg Glu Asp Leu 
10 15 20 

cag att age tgg tet ttc ate agt gtg ccg gaa atg gcc gtt aat ate 392 
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Gin He Ser Trp Ser Phe He Ser Val Pro Glu Met Ala Val Asn He 

25 30 35 

cag ccc aaa cac tgg ggg agg acc agg tgg ctg aga caa gtg cga tgt 440 

Gin Pro Lys His Trp Gly Arg Thr Arg Trp Leu Arg Gin Val Arg Cys 

40 45 50 

ctg acg ttc tea agg aca tct tgaagcattt ggctggttct gcctctccat cag 4 94 

Leu Thr Phe Ser Arg Thr Ser 



<210> 690 
<211> 582 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 120. .530 



<400> 690 

aaagtcgtgc tgcaggcgtc gcgccaatct tcgctctgag gtgctgtctc accggtgaga 60 
cctggaagcg ggcgagtctc gtgctgtgtc ggacctgcag cccctggcct tccgccacc 119 
atg gag tac etc ate ggt ate caa ggc ccc gac tat gtt ctt gtc gcc 167 
Met Glu Tyr Leu He Gly He Gin Gly Pro Asp Tyr Val Leu Val Ala 
15 10 15 

tec gac egg gtg gcc gcc age aat att gtc cag atg aag gac gat cat 215 
Ser Asp Arg Val Ala Ala Ser Asn He Val Gin Met Lys Asp Asp His 



gac 


aag 


atg 


ttt 




atg 


agt 


gaa 


aag 


ata 


tta 


etc 




tgt 


gtt 


gga 


263 




Lys 


Met 


Phe 


Lys 


Met 


Ser 


Glu 


Lys 


He 


Leu 


Leu 


Leu 


Cys 


Val 


Gly 








35 










40 










45 










gag 


get 


gga 


gac 




gta 




ttt 


gca 


gaa 


tat 


att 


cag 


aaa 


aac 


gtg 


311 


Glu 


Ala 


Gly 


Asp 


Thr 


Val 


Gin 


Phe 


Ala 


Glu 


Tyr 


He 


Gin 


Lys 


Asn 


Val 






50 










55 










60 












caa 


ctt 


tat 


aag 


atg 


cga 


aat 


gga 


tat 


gaa 


ttg 


tct 


ccc 


acg 


gca 


gca 


359 


Gin 


Leu 


Tyr 


Lys 


Met 


Arg 


Asn 


Gly 


Tyr 


Glu 


Leu 


Ser 


Pro 


Thr 


Ala 


Ala 




65 










70 










75 










80 




get 


aac 


ttc 


aca 


ego 


cga 


aac 


ctg 


get 


gac 


tgt 


ctt 


egg 


agt 


egg 


acc 


407 


Ala 


Asn 


Phe 


Thr 


Arg 


Arg 


Asn 


Leu 


Ala 


Asp 


Cys 


Leu 


Arg 


Ser 


Arg 


Thr 





85 90 95 

cca tat cat gtg aac etc etc ctg get ggc tat gat gag cat gaa ggg 
Pro Tyr His Val Asn Leu Leu Leu Ala Gly Tyr Asp Glu His Glu Gly 

100 105 110 

cca gcg ctg tat tac atg gac tac ctg gca gcc ttg gcc aag gee ctt 
Pro Ala Leu Tyr Tyr Met Asp Tyr Leu Ala Ala Leu Ala Lys Ala Leu 

115 120 125 

ttg cag ccc acg get atg gtg cct tee tgactctcag tatectcgac 
Leu Gin Pro Thr Ala Met Val Pro Ser 

130 135 
egatactaca caccgactat ctcacgtgar ag 



<210> 691 
<211> 506 
<212> DNA 
<213> Homo sapie 



<220> 
<221> CDS 
<222> 4. .168 



<400> 691 

aag atg gcg gac cgt ggc ggc gtg ggt gaa gcc gca get gtt gga gcg 
Met Ala Asp Arg Gly Gly Val Gly Glu Ala Ala Ala Val Gly Ala 
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tct cct gca tct gtc cct ggc eta aac ccg acg eta ggc tgg agg gag 96 
Ser Pro Ala Ser Val Pro Gly Leu Asn Pro Thr Leu Gly Trp Arg Glu 

20 25 30 

cga ctg cgt ccg ggc tgg egg gga ctg ggg cct cgt tgt ggt teg tgg 144 
Arg Leu Arg Pro Gly Trp Arg Gly Leu Gly Pro Arg Cys Gly Ser Trp 

35 40 45 

egg ggc tgg ggc tgc ttt aeg ece tgaggateee tttgaggetg tgtgagaatt 198 
Arg Gly Trp Gly Cys Phe Thr Pro 

50 55 
tggcagcggt gactgtattt ttaaattcat tgacacccaa attctatgtg gcacttacag 258 
ggaeetctte attgatatea ggaetaatat ttatatttga atggtggtac ttccataage 318 
atggcacatc ttttattgag caagtatctg taagceattt gcaaccactg atgggaggaa 378 
eagagagcag catttcagaa ccaggttctc cttcgaggaa cagagaaaat gaaaccagca 438 
gacagaattt gteaggtgae taettttcta atgtgtttte agagctgtgt atttaagatt 4 98 
gagtttgg 506 



<210> 692 
<211> 499 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 64. .459 



<400> 692 

gtcctctggg ccacgetcea cttgccegct tceacccgga aagcccccca ggctgagtgc 60 



ggc 




ate 


tec 


ate 






tgg 


cag 


aag 


att 


ggt 


gtg 


ggg 


ate 


acc 


108 




Met 
1 


He 


Ser 


He 


Thr 

5 


Glu 


Trp 


Gin 


Lys 


He 
10 


Gly 


Val 


Gly 


He 


Thr 
15 




ggt 


ttc 


ggc 


ate 


ttc 


ttc 




etc 


ttt 


gga 


aca 


etc 


ctg 


tac 


ttt 


gat 


156 


Gly 


Phe 


Gly 


He 


Phe 
20 


Phe 


He 


Leu 


Phe 


Gly 
25 


Thr 


Leu 


Leu 


Tyr 


Phe 
30 


Asp 




tec 


gtg 


etc 


ctg 


gcc 


ttt 


gga 


aac 


ctg 


ctg 


ttc 


ctg 


acg 


ggc 


ctg 


tec 


204 


Ser 


Val 


Leu 


Leu 
35 


Ala 


Phe 


Gly 


Asn 


Leu 
40 


Leu 


Phe 


Leu 


Thr 


Gly 
45 


Leu 


Ser 




etc 


ate 


att 


ggc 


ctg 


agg 


aag 


acc 


ttt 


tgg 


ttc 


ttc 


ttc 




egg 


cac 


252 


Leu 


He 


He 
50 


Gly 


Leu 


Arg 


Lys 


Thr 
55 


Phe 


Trp 


Phe 


Phe 


Phe 
60 


Gin 


Arg 


His 




aaa 


etc 


aag 


gga 


acc 


age 


ttc 


etc 


ctg 


ggg 


ggt 


gtg 


gtt 


ate 


gtg 


etc 


300 


Lys 


Leu 
65 


Lys 


Gly 


Thr 


Ser 


Phe 
70 


Leu 


Leu 


Gly 


Gly 


Val 
75 


Val 


He 


Val 


Leu 




eta 


cgc 


tgg 


ccc 


etc 


etc 


ggc 


atg 


ttc 


ctg 


gaa 


acc 


tac 


gga 


ttc 


ttc 


348 


Leu 


Arg 


Trp 


Pro 


Leu 


Leu 


Gly 


Met 


Phe 


Leu 


Glu 


Thr 


Tyr 


Gly 


Phe 


Phe 




80 










85 










90 










95 




age 


etc 


ttt 


aag 


ggc 


ttt 


ttc 


cct 


gtc 


gee 


ttc 


ggc 


ttc 


ctg 


ggc 


aat 


396 


Ser 


Leu 


Phe 


Lys 


Gly 
100 


Phe 


Phe 


Pro 


Val 


Ala 
105 


Phe 


Gly 


Phe 


Leu 


Gly 
110 


Asn 




gtc 


tge 




ate 


ece 


ttc 


ctg 


ggt 


geg 


ctg 


ttc 


egg 


aga 


ett 




ggc 


444 


Val 


Cys 


Asn 


He 


Pro 


Phe 


Leu 


Gly 


Ala 


Leu 


Phe 


Arg 


Arg 


Leu 


Gin 


Gly 





115 120 125 



act age teg atg gtc tgaaaaeaga gatgagetcc ttgaaettgg ateattggtt 499 
Thr Ser Ser Met Val 
130 



<210> 693 
<211> 756 

<212> DNA 

<213> Homo sapiens 



<220> 
<221> CDS 
<222> 188. .616 
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<400> 693 

attttattac ccggagtcct ggttcctggg ccagaccggt gtgtcgtttt tggcccaagc 60 
tagagaatgt taagggcttc tgcggtgggt tggtgctaga ggcgccgcga acaggtgctg 120 
cgggggcggc gcgggaggcg gtgcccttgc ttccggatcc ggtctcagct ctgggaggga 180 
acgggag atg ttg cag gcg ccg aga ggg egg gcc agg gcc gca etc egg 229 

Met Leu Gin Ala Pro Arg Gly Arg Ala Arg Ala Ala Leu Arg 

15 10 



aga 


etc 


gcg 


gtt 


get 


acg 


cgc 


ace 


atg 


get 


gga 


gcg 


ecc 


acg 


gte 


teg 


277 


Arg 


Leu 


Ala 


Val 


Ala 


Thr 


Arg 


Thr 


Met 


Ala 


Gly 


Ala 


Pro 


Thr 


Val 


Ser 




15 










20 










25 










30 




ctt 


cet 


gaa 


etc 


cgt 


tea 


etc 


eta 


gcc 


tec 


gga 


egg 


gcc 


egg 


etc 


tte 


325 


Leu 


Pro 


Glu 


Leu 


Arg 


Ser 




Leu 


Ala 


Ser 


Gly 


Arg 


Ala 


Arg 


Leu 


Phe 












35 










40 










45 






gac 


gtg 


cgc 


tet 


cgc 


gag 


gag 


gcg 


gca 


get 


ggg 


ace 


ate 


eca 


ggg 


gcg 


373 


Asp 


Val 


Arg 


Ser 


Arg 


Glu 


Glu 


Ala 


Ala 


Ala 


Gly 


Thr 


He 


Pro 


Gly 


Ala 










50 










55 










60 








etc 


aac 




ccg 


gtg 


tec 


gag 


ttg 


gag 


agt 


get 


etg 


eag 


atg 


gag 


cca 


421 


Leu 


Asn 


He 


Pro 


Val 


Ser 


Glu 


Leu 


Glu 


Ser 


Ala 




Gin 


Met 


Glu 


Pro 





get gcc ttc cag get tta tat tct get gag aag cca aag ctg gaa gat 469 
Ala Ala Phe Gin Ala Leu Tyr Ser Ala Glu Lys Pro Lys Leu Glu Asp 

80 85 90 

gag cat etc gtt ttc ttc tgt cag atg ggc aag egg ggc etc cag gca 517 
Glu His Leu Val Phe Phe Cys Gin Met Gly Lys Arg Gly Leu Gin Ala 
95 100 105 110 

cgc age tgg ecc gga gtc ttg gat aca ctg ggg etc gca act acg ctg 565 
Arg Ser Trp Pro Gly Val Leu Asp Thr Leu Gly Leu Ala Thr Thr Leu 

115 120 125 

gag cot ata gag aat ggt tgg aga aag aga gtt agg cag gag gca get 613 
Glu Pro He Glu Asn Gly Trp Arg Lys Arg Val Arg Gin Glu Ala Ala 

130 135 140 

tac tgattgecac ccectggccc cttaatggcc accttaacta agggtgtgaa 666 
Tyr 

cgggetgact tggtgaattg ggcaactect tatagtgttg tgcaeacaaa ageateaaat 726 
aaagaacatt taatcaaaaa aaaaaaaaaw 756 

<210> 694 
<211> 563 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 296. .463 

<400> 694 

aatgtttaat gcccgcacct cgtecccggc eaccecgtga ggeagceaga caetctecaa 60 
aaagcagaga cageaggaag aggggagtgg aggcagceca ttcaeetggg gaaatgactg 120 
ggttgtegat ggaeggtggc ggcacccaag ggggacgtgg aeccgttcta ctatgactat 180 
gagaccgttc gcaatggggg cctgatcttc gctggactgg cettcatcgt ggggctccte 240 
atcctcctca gcagaagatt ccgctgtggg ggcaataaga agcgcaggca aatea atg 2 98 

Met 
1 

aag atg age cgt aac age age etc ggc ggt gee ace cac tgc act ggg 34 6 

Lys Met Ser Arg Asn Ser Ser Leu Gly Gly Ala Thr His Cys Thr Gly 

5 10 15 

gcc age tgg gaa gee aag cat ggc cet gee tet ggc gcc tec cet tet 394 
Ala Ser Trp Glu Ala Lys His Gly Pro Ala Ser Gly Ala Ser Pro Ser 

20 25 30 

tec etg ggc ttt aga cet ttg tec ccg tea ctg cca gcg ctt ggg etg 442 
Ser Leu Gly Phe Arg Pro Leu Ser Pro Ser Leu Pro Ala Leu Gly Leu 
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aag gaa get cca gac tea atg tgacccccag gtggcatcgc aactcctgcc 4 93 

Lys Glu Ala Pro Asp Ser Met 
50 55 

tcgtgccacc tcatgcttat aataaagccg gcgtcagaga ccgctgcttc ccycaaaaaa 553 

aaaaaaaaaa 563 



<210> 695 
<211> 502 
<212> DNA 
<213> Homo sapiens 



<400> 695 

agaggcccta cacccaccga ggcatggggc 
ctgccagcag cttctccaag gcacgggagg 
acaaagtcct ggaagttttc cccaaaggcc 
agccaccacc gcccatcacc tattccctct 
aggtggtgaa gacccacgag ccggcctcct 
cagacctgct cacctacttc tgcygggcgt 
ccaggctaca gatgcactgg gagctgtggt 
tcactctgca ggacagaggg gcaggcccca 
gcagcccacc tatcaccaac ag 



tccctgggct 


gttctgcttg 


gccgtgctgg 


60 


aagaaattac 


ccctgtggtc 


tccattgcct 


120 


gctgggtgct 


cataacctgc 


tgtgcacccc 


180 


gtggaaccaa 


gaacatcaag 


gtggccaaga 


240 


tcaacctcaa 


cgtcacactc 


aagtccagtc 


300 


cctccacctc 


aggtgcccat 


gtggacagtg 


360 


ccaagccagt 


gtctgagctg 


cgggccaact 


420 


gggtggagat 


gatctgccag 


gcgtcctcgg 


480 








502 



<210> 696 
<211> 589 
<212> DNA 
<213> Homo sapiens 



<400> 696 

gtccctcctc ctagttgcag tctggtagtt gtcgctggcc gtgtgacggc tcgctgttgc 60 

cctgaaggca ggcgagccag ctgcccagga aaggtggaaa gtggtagaag ctgacccctg 120 

agccctggca ggtctttaag tgcgtttgtg cagccgattt caaggctaag agagaaagac 180 

tgcctctgat ccctgaagga agaaaaaaaa aaaaaaamcg ggaaaaaaac tcaacwkgga 240 

aaatgtcccc arggaaaaca aakttgtgga raaggcccca gtgcaraatg aagcccccgc 300 

tttaggaggt ggtgaatacc aggagcctgg aggaaatgtt aaaggggttt gggctcmacy 360 

tgccccgggt ttkggagagg atgtgcccaa taggcttgtc gataacattg atatgataga 420 

tggagatgga gatgatatgg aacggttcat ggaggagatg agagagctaa ggmggaaaat 480 

tagggaactt cagttgaggt actctgcgca ttcttatagg ggaccctcct caccatgatc 540 

atcatgatga gttttgcctt atgccttgaa tcttgaggtt aataatcat 589 



<210> 697 
<211> 420 
<212> DNA 
<213> Homo 



sapiens 



<400> 697 

gattcatagg agcatataat ataatctttg taaagtagag ggacaattca gtattttgta 60 

caatatgtag aataacatgt tctcttagta ggcataggat atactacatt aagaaatctg 120 

ggctaagcgt ggtggttcat gcctgtaatc ccagcacttt gggaggccga ggcgggcgga 180 

tcacttgggg tcaggagttc gggaccggcc tggccgacat ggtgaagccc cgtctttact 240 

agaaatacaa aagttggccg ggcgtggtgg tgggcgcctg tggtctcagc tactcggggg 300 

gctgaggcgg gagagtcact tgaacccggg aggcggaggt tgcagtgagc cgagatcacg 360 

ccactgcact ccagcctgga tgacagagtg agactctgtc tcaaaaaaaa aaaaaaaaaa 420 



<210> 698 
<211> 496 
<212> DNA 
<213> Homo sapiens 



<400> 698 

gacataatca gagctatgct ggaggagaag agggcagcca tttgctggct ggcttgcagt 60 

gagccaggag gtggcaggac gagttaggag gctggttcag tagctcgggc aagagcaggg 120 

ccccccagga tctgaaggcc tcccaggccc ccaggcccag cgggtcccag aggagagcga 180 

ggaccccaag gtaactccgg tgagaagggc gaccagggat ttcaaggcca gccaggcttt 240 
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ccgggcccac cgggtccccc tggattccca ggcaaagttg gatcacctgg cccacctggc 300 

cctcaagcag agaagggcag cgaagggatt cgaggcccat caggcctgcc tggctcccct 360 

gggccaccgg gacctcctgg gattcagggc cccgccggtc tggatggttt ggatgggaag 4 20 

gatggcaagc ctggcttgag gggggacctg gtcctgctgg cccccctgga ctcatgggac 480 

caccgggctt tagggg 4 96 



<210> 699 
<211> 729 
<212> DNA 
<213> Homo sapiens 



<400> 699 

atctctgagc ctccgcggcc ttctctagca 
caagctggga tgcgcggagt agcgggtgga 
gggtgacttc ggtgcactag ggatagtaac 
aagtgagtta atatttgtaa agtgtttaga 
atttgttaga gagactccag gaggtcttgc 
cascccccas attcrncgttt gtgagccaag 
cctggaaatg catacccatc tcaaacctgg 
cctctacttc tcacagttca ggcctgttct 
aataccagag tctgcatggg accatttcaa 
tggccctgac ttttctaaga tgagaggagc 
ctgagagtat cccacccact tcacagatga 
cttgcccaag atcacccagc aagtaagaaa 
gkaccacgc 



aaatggtgga 


gccggtaccc 


ggctgtaagg 


60 


aaactggast 


ggagttctca 


ctcggccgct 


120 


agtaccacct 


cgtagggttg 


tagaaagacm 


180 


acagcgtcta 


ctgcatggaa agtagtattt 


240 


ccagggattg 


acttctccst 


tccstgtctc 


300 


ccttgctttt 


gcaccggccg 


ggctctcttg 


360 


gctttaaaat 


cacctcctct 


gggacgcctt 


420 


gccccacaca 


gcagaaaatg 


cccacgatgc 


480 


caacccttag 


aagctgactc 


agctgtctcc 


540 


tgataggaac 


aatttgggaa 


ccctcaaagg 


600 


agcaaactga 


ggcccagaga 


agggaaatta 


660 


cagagctgga 


gatgagctca 


ggccagcacg 


720 








729 



<210> 700 
<211> 462 
<212> DNA 
<213> Homo sapiens 

<400> 700 

caggaaaatc gcgagggacg ggcggccccc cgacagtcca gctcctttcg gctcttgcag 60 

gaagccctgg aggctgagga gagaggtggc acgccagcct tcttgcccag ctcactgagc 120 

ccccagtcct ccctgcccgc ctccagggcc ctggccaccc ctcccaagct ccacacttgt 180 

gagaagtgca gtaccagcat cgcgaaccag gctgtgcgca tccaggaggg ccggtaccgc 240 

caccccggct gctacacctg tgccgactgt gggctgaacc tgaagatgcg cgggcacttc 300 

tgggtgggtg acgagctgta ctgtgagaag catgcccgcc agcgctactc cgcacctgcc 360 

accctcagct ctcgggcctg agccgccatg ccctcagcct gcctcactgc tgggccaggg 420 

tcatgcctat ataagttggc atggcaggga caatggtggg ca 4 62 

<210> 701 
<211> 417 
<212> DNA 
<213> Homo sapiens 



<400> 701 

aatgctcagc gcgctgcccg gctggggacc cgcgcacctg cagcgcccgc tgctcggccc 60 

tgcatcctgc ctgggcatcc tgcgcccggc catgacggcg cactcattcg ccctcccggt 120 

catcatcttc accacgttct ggggcctcgt cggcatcgcg ggccctggtt cgtgccgaag 180 

gacccaaccg cggagtgatc atcaccatgc tggtcgccac cgccgtctgc tgttacctct 240 

tctggctcat cgccatcctg gcgcastgaa ccccctgttc gggccccagc tgaagaatga 300 

gaccatctgg trcccagctc tcggaatgac tgtggctcca ctgtccctga caaccccttc 360 

gtccggaccc tcccccacac aactatgtct ggtcaccagc tccctcctgc tggcacc 417 

<210> 702 
<211> 470 
<212> DNA 
<213> Homo sapiens 



<400> 702 

gcgcgcccgc cgcgtgccca gcccgggtag ctgctgcagc ctacgctgcc tcggcactgc 
Ctgccgcgtg cggagccgga gcccgagcct gagtggcgcc gggcccgacg tggggctcct 



60 
120 



[fije>'/^:/yv'GG208389e.s^] 



WO 02/083898 PCT/lBOl/00914 

383 

gggccgcggc ggcgggcggg cgatgctcca gaggcctgac cagccatgga ggccgaggca 180 

ggcggcctgg aggagctgac ggacgaggag atggcggcgc taggcgaaga gctagtgcgg 240 

cgcctgcggc gggaggaggc ggcgcgcctg gcggcactgg tcagcgcggc cgcctcatgc 300 

aggaggtgaa tcggcagctg cagggccacc tgggcgagat ccgcgagctc aagcagctca 360 

accggcgtct rcaggcagag aaccgtgagc tgcgcgacct ctgctgcttc ctggactcgg 420 

agcgccagcg cgggcggcgc gccgcacgcc agtggcagct cttcgggacc 470 

<210> 703 
<211> 452 
<212> DNA 
<213> Homo sapiens 



<400> 703 

aatgcgcaag aaggtgcctg cgggactgga gcagagsggc tgcgagggtc ttcccagcgc 60 

aggcgggttt tccagtgtta cttgcggctg ggcgtggggg actagctgcc tttctggcac 120 

aggcaggaag ccgcaaaaag tttctgagcc cccgaacctg tagcggacgt ggaaaaagaa 180 

cgcccctcct caagtgtctg gctgaaagat gccacccagg gaakggaact cgggctagct 240 

aaggaggcca ttcttgatgt tgcttctaga tctcatgtca tcaccgagcc ctcagctgct 300 

ggtggcagct gctcagcaga cccttggcat gggaaagaga cggagtccac cccaagccat 360 

ctgccttcac ttagctggag aggtgctggc tgtggccggg gactgaagcc agctgtgctc 420 

tatgattgca actgtgcagg ggcatcagag ct 452 



<210> 704 
<211> 304 
<212> DNA 
<213> Homo sapiens 



<400> 704 

atttcgggag cgggattgtt tcgcgcagga agcaggctcc attttagcgc cgcccgccgt 60 

cgccatctgt ttcccttccc tcccccatac ctagcccgag tctgagccct aacgagaagg 120 

ctgggcctag gccgctggat gctggagtga aaggaaggga gaaagggaaa aagcgggaag 180 

agtcgagaag ggagtgttaa gaggccaagt gcgacgcgcg tatccgggca gacggactga 240 

cggacgggcc cgtgcttctg ccgcggctgc gcccggggcc tcttaaccac tgatctttga 300 

aata 304 



<210> 705 
<211> 269 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 84,216 

<223> n=a, g, c or t 



<400> 705 

agtcgtggac tcgtgcagct ggggcgtccg cagccgctcg tcacccgcgt gatgctgttt 60 

ctggcgttgg gcagcccgtg gggntcgaac tgcctctctg cggaaggagg actgcattgt 120 

gtgcggccgc cgcgctccga ggtccccggg cctctgtctc ccgggcgtcc tccagsagcg 180 

ggysttcggg gmgggtagcc ggctggagta cgggmncttc gggagccgcg cgccttctcc 240 

ggcgtccggg tcgagcgcag atccctgtt 269 

<210> 706 
<211> 476 
<212> DNA 
<213> Homo sapiens 

<400> 706 

atttgttcag cctccataat tcgaatacca gggcaggccg agccagccgt gcgccgcgct 60 

ccagggccca gggcgccgca cacgcatcca cccacccacc cagcctcgca gcgccatggg 120 

caagaacaag cagccacgcg gccagcagag gcaggggggc ccgccggccg cggacgccgc 18 0 

tgggcccgat gacatggagc cgaagaaggg cacgggggcc cccaaggagt gcggggaggg 24 0 

atcattgtct gcttagtggc ctatcttggc ttgtttatgc tttgtgtctc atatcaggtt 300 
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gacgaacgga catgtattca attttctatg aaactgttat actttctgct gagtgccctg 360 
ggcctgacgg tctcgtgtgc tggccgtggc ctttgccgcc caccactatt cgcagctcac 4 20 
acagtttacc tgtgagacca cactcgactc ttgccagtgc aaactgccct cctcgg 47 6 

<210> 707 
<211> 266 
<212> DNA 
<213> Homo sapiens 

<400> 707 

ctctgcgtgc gccggaggct gccgtggcgg gtgggccgcc tgacttctcc tcccggccag 60 

ttctcgagcg cctcaccggg cctcgccctg cagcctcgct ctcgctggcg ctgcgcggcc 120 

taggggactg ggctgctggc ctccgggtgc ggggtggggg caggctccga cctggggcgt 180 

cctggcagcg cgagccgcgg gatgggggcc cgggccgcgg aggaggcrcc gctggygtgt 24 0 

cccttggtgg agagggcgct gccggy 266 

<210> 708 
<211> 384 
<212> DNA 
<213> Homo sapiens 

<400> 708 

aaaaaacaac atgccacagt taaaacttac ttcagtttgg acaactactc acagctacta 60 

cacagagacc cgaacgagtc actgatatac acctggacca ccaccaatgg atatacaaat 120 

ggcaaacaat tttactccgc cctctgcaac tcctcaggga aatgactgtg acctctatgc 180 

acatcacagc acggccagta tagtaatgcc tctgcattac agcctcgtct tcatcattgg 240 

gctcgtggga aacttactag ccttggtcgt cattgttcaa aacaggaaaa aaatcatctc 300 

accttgagcc cacccccacc ccattgtcta atctgtagaa gctaataaat aatcatccct 360 

ccttgcctag caaaaaaaaa aaaa 384 

<210> 709 
<211> 497 
<212> DNA 
<213> Homo sapiens 

<400> 709 

ctcttttgct ytaacagaca agccascgay tttaggctgg ataatagtca aattcttacc 60 

tcgctctttc actgctagta agatcagatt gcgtttcttt cagttactct tcaatcgcca 120 

gtttcttgat ctgcttctaa aagaagaagt agagaagata aatcctgtct tcaatacctg 180 

gaaggaaaaa caaaataacc tcaactccgt tttgaaaaaa acattccaag aactttcatc 24 0 

agagatttta cttagatgat ttacacaatg aagaaagtac atgcactttg ggcttctgtc 300 

cctgctgctt aatcttgccc ctgamcctct taatgctgat tctgaggaag atgaagaaca 360 

cacaattatc acagatacgg agttgccacc actgaaactt atgcattcat tttgtgcatt 420 

caaggcggat gatagcccat gtaaagcaat catgaaaaga tttttcttca atattttcac 480 

tcgacagtgc gaagaat 4 97 

<210> 710 
<211> 471 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> raiscfeature 
<222> 423 

<223> n=a, g, c or t 
<400> 710 

attcccaact gccagtgatc tctgaagccg actctgaggc tccctctttg ctctaacaga 60 
cagcagcgac tttaggctgg ataatagtca aattcttacc tcgctctttc actgctagta 120 
agatcagatt gcgtttcttt cagttactct tcaatcgcca gtttcttgat ctgcttctaa 180 
aagaagaagt agagaagata aatcctgtct tcaatacctg gaagaaaaac aaaataacct 24 0 
caactccgtt ttgaaaaaaa cattccaaga actttcatca gakattttac ttagatgatt 300 
tacacaatga agaaagtaca tgcactttgg gcttctgtat gcctgctgct taatcttgcc 360 
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cctgcccctc ttaatgctga ttctgaggaa gatgaagaac acacaattat cacagatacg 420 

gantgccacc actgaaactt atgcattcat tttgtgcatt caagtcggat g 471 

<210> 711 
<211> 405 
<212> DNA 
<213> Homo sapiens 

<400> 711 

aatgatcctg cagcagccct tgcagcgagg cccccaggga ggggccagcg cctcccgcgg 60 

gccgccttgg gggtgacttg gggcctggac gccagctccc ctctccgagg agctgtgccc 120 

atgagcacca agcggcgcct ggaggaggag caggagcctc tgcgcaagcr rtttctgtct 180 

gaggagaaca tggccaccca cttctctcaa ctcagcctgc acaatgacca cccctactgc 240 

agccccccca tgaccttctc cccagccctg cccccactca ggagcccttg ctctgagctg 300 

cttctctggc gctatcctgg cagcctcatc cctgaggccc tccgtctgct gakgctgggg 360 

gacaccccca tgttccccct accctgcaac cccagctggg gacat 405 

<210> 712 
<211> 160 
<212> DNA 
<213> Homo sapiens 



<220> 

<221> misc_feature 

<222> 105 

<223> n=a, g, c or t 



<400> 712 

aagaaaccca atgatcctgc agcagccctt gcagcgaggc ccccagggag gggccagcgc 60 

ctcccgcggg ccgccttggg ggtgacttgg ggccwggacg ssagntcccc yctccwagga 120 

gctgtgccca tgagcaccaa gcggcgvctg gaggaggaga 160 



<210> 713 
<211> 427 
<212> DNA 
<213> Homo sapiens 



<400> 713 

caatcttcgc tctgaggtgc tgtctcaccg gtgagacctg gaagcgggcg agtctcgtgc 60 

tgtgtcggac ctgcagcccc tggccttccg ccaccatgga gtacctcatc ggtatccaag 120 

gccccgacta tgtkctkkkk hvggcytccg accgggtggg ccgccagcaa tattgtccag 180 

atgaaggacg atcatgacaa gatgtttaag atgagtgaaa agatattact cctgtgtgtt 240 

ggakaggctg gagacactgt acagtttgca gaatatattc agaaaaacgt gcaactttat 300 

aagatgcgaa atggatatga attgtctccc acggcagsag ctaacttcac acgccgaaac 360 

ctggctgact gkcttcggag tcggacccca tatcatgtga acctcctcct ggctggctat 420 

gatgagc 427 



<210> 714 
<211> 433 
<212> DNA 
<213> Homo sapiens 



<400> 714 

agactgcgtg cagaaggtga ctgtctcagt ggagctgggt catctcagca ggccttggct 60 

ccttgaactt ttggccgcca tgtgcttccc gaagtcctct ctgatgacat gaagaagctg 12 0 

aaggcccgaa tgcaccaggc catagaaaga ttttatgata aaatgcaaaa tgcagaatca 180 

ggacgtggac aggtgatgtc gagcctggca gagctggagg acgacttcaa agagggctac 240 

ctggagacag tggcggctta ttatgaggag cagcacccag agctcactcc tctacttgaa 300 

aaagaaagag atggattacg gtgccgaggc aacagatccc ctgtcccgga tgttgaggat 360 

cccgcaaccg aggagcctgg ggagagcttt tgtgacaagg tcatgagatg gttccaggcc 420 

atgctgcagc ggc 433 



<210> 715 
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<211> 326 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> 126 

<223> n=a, g, c or t 
<400> 715 

atggcggtct tggcacctct aattgctctc gtgtattcgg tgccgcgact ttcacgatgk 60 
ctcgcccaac cttactacct tctgtcggcc ctgctctctg ctgccttcct actcgtkagg 120 
aaactnccgc ygctctgcca cggtctgccc acccaacgcg aagacggtaa cccgtgtgac 180 
tttgactgga gagaaktgga gatcctgatg tttctcagtg ccattgtgat gatgaagaac 240 
cgcagatcca tgccccatga agagggacaa agaatgggat ataggagtgc acagcccagc 300 
ctgacctgtg acatctctgt gtttca 326 

<210> 716 
<211> 507 
<212> DNA 
<213> Homo sapiens 

<400> 716 

agactgaggc tgaggctggg gaacatcggg cagcatgagc gctgcggctc ttcctgcgca 60 

ccacggctgc ggctcgtgcc tgccggggtc tggtggtctc taccgcgaac cggcggctac 120 

tgcgcaccag cccgcctgta cgagctttcg ccaaagagct tttcctaggc aaaatcaaga 180 

agaaagaagt tttcccattt ccagaagtta gccaagatga acttaatgaa atcaatcagt 240 

tcttgggacc cgtggaaaaa ttcttcactg aagaggtgga ctcccgaaaa attgaccagg 300 

aagggaaaat cccagatgaa actttggaga aattcaagag cctagggctt tttgggctgc 360 

aagtcccaga agaatatggt ggcctgggct tctccaacac catgtactca cgactagggg 420 

agatcatcag catggatggg tccatcactg tgaccctggc agcgcaccag gctattggcc 480 

tcaaggggat catcttggct ggcactg 507 

<210> 717 

<211> 561 

<212> DNA 

<213> Homo sapiens 

<400> 717 

cttcagaatg ctggatgaaa aaattgaaaa gggtcgggat tactgttcgg aagaagagga 60 

tatcacatag caccaatttt accactcaaa ccaggagcta ctactgtgta aataggttac 120 

accccagttg aaatctttgc aaaggtcggt tctattcagc gaacagcact atagcaaaag 180 

aagatcgttc catattgtac gccccattaa attacagtgt ttcttaatga acttgcaaag 240 

gaatattgct aaaaacaaac aaaaaaaact gttatctaac tttctttgtt gctgctagtt 300 

aaaacttgtt gcaacttttc acttctcttg tgtccaggta tgcagcaaaa ttctgcaatt 360 

tcaccttaaa gatactgttg gttttacaga tgctctccaa cctattttct ataagatgag 420 

gtagtggtga actcagataa caaacttctc ttctaaactg gttctgcttc taagacaagc 480 

atctcctgcc ctctctcctt cctccccatc tctcgcacgc agtctagaga tggactgagc 54 0 

cttgcttctc actggcagtg t 561 

<210> 718 
<211> 530 
<212> DNA 
<213> Homo sapiens 

<400> 718 

aacatctggg gacagcggga aaacatgagt gactccaagg aaccaagggt gcagcagctg 60 

ggcctcctgg aagaagatcc aacaaccagt ggcatcagac tttttccaag agactttcaa 120 

ttccagcaga tacatggcca caagagctct acagggtgtc ttggccatgg cgccctggtg 180 

ctgcaactcc tctccttcat gctcttggct ggggtccttg tggccatcct tgtccaagtg 240 

tccaaggtcc ccagctccct aagtcaggaa caatccgagc aagacgcaat ctaccagaac 300 

ctgacccagc ttaaagctgc agtgggtgag ctctcagaga aatccaagct gcaggagatc 360 

taccaggagc tgacccagct gaaggctgca gtgggtgagt tgccagagaa atccaagctg 420 



[fije>'/^:/yv'GG208389e.s^] 



WO 02/083898 PCT/JlBOl/00914 
387 

caggagatct accaggagct gacccggctg aaggctgcag tgggtgagtt gccagagaaa 480 

tccaagctgc aggagatcta ccaggagctg acccggctga aggctgcagt 530 

<210> 719 
<211> 512 
<212> DNA 
<213> Homo sapiens 



<400> 719 

aataatgctc ccatcaaatg tcatcaactt ctcccaagca gagaaacccg aacccaccaa 60 

ccaggggcag gatagcctga agaaacgtct acaggcaaaa gtcaaagtta ttggggtgca 120 

tagcagcctg gctggaagat tctgagtgct ctgtctgccc tggtgggttt cattctcctg 180 

tctgtcaacc cggctgcatt aaatcctgcc tcattgcagt gtaagttgga cgaaaaggat 240 

ataccaacca gacttcttct ttcttatgat tatcattcac cttacaccat ggactgccat 300 

agagccaaag ccagtctggc tggaactctg tctctgatgc tggtttctac tgtgttggag 360 

ttctgcctag ctgtgctcac tgctgtgctg cagtggaaac agactgtctg acttccctgg 420 

ttactcatgt cctcaaagac gactcatgay gctggatatg aagaactatt gacttcttgg 4 80 

gaaaaaacgg agaaatatta attggaaagt tt 512 

<210> 720 
<211> 507 
<212> DNA 
<213> Homo sapiens 



<400> 720 

acctcgccag ggccggggcc gggacgggac ggaaaactgg cggcttggtt tgttcagcgc 60 

gtcgcgtcat ggctctcgtt tcaggcctcc acggtgtgca cctggagtca ggagcgcgct 120 

tgtgcgtaca gttcggttca gttactaaaa gatgtaactg aactgcagat ccttggtgaa 180 

atatctttca acaaatctct atatgaggga ctgaatgcag agaaccacag aactaagatc 240 

actgtcgtct tcctgaaaga tgagaagtac cattctttgc ctatcatcat taaaggcagc 300 

gttggtggac ttctggtgtt gatcgtgatt ctggtcatcc tgttcaagtg tggctttttt 360 

aaaagaaaat atcaacaact gaacttggag agcatcagga aggcccagct gaaatcagag 420 

aatctgctcg aagaagagaa ttaggacctg ctatccactg ggagaggcta tcagcagtcc 4 80 

tgggacttgg agacccagca tcctttg 507 

<210> 721 
<211> 408 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> 261 

<223> n=a, g, c or t 



<400> 721 

acctttttta ctttcacagc aatagtgcag aatccagaat ggatgtcctc tttgtagcca 60 

tctttgctgt gccacttatc ctgggacaag aatatgagga tgaagaaaga ctgggagagg 120 

atgaatatta tcaggtggtc tattattata cagtcacccc cagttatgat gactttagtg 180 

cagatttcac cattgattac tccatatttg agtcagagga caggctgaac aggttggata 240 

aggacataac agaagcaata nagactacca ttagtcttga aacagcacgt gcagaccatc 300 

cgaagcctgt aactgtgaaa ccagtaacaa cggaacctgt ggaagaaggc tgctatgact 360 

ctttggatgg gagtctggca agaggaaatt ggaagataaa ataaataa 4 08 



<210> 722 
<211> 371 
<212> DNA 

<213> Homo 



sapiens 



<220> 

<221> raisc_feature 

<222> 281 

<223> n=a, g, c or t 
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<400> 722 

agagaaagat gttccaggca gagggaagag gcctctttcc tccaggcctg gagctttcat 60 

gacgctggtc attatcttgc tgcatgtatt ctggggcatt gtattttttg atggctgtga 120 

gaagaaaaag tggggcatcc tccttatcgt tctcctgacc cacctgctgg tgtcagccca 180 

gaccttcata agttcttatt atggaataaa cctggcgtca satttataat cctggtgctc 240 

atgggcactg ggcattctta gctgcgggag gcactgccga nagcctgaaa ctctgcctgc 300 

tctgccaaga caagaacttt cttctttacr accagcgctc cagataacct cagggaacca 360 

gcacttccca a 371 

<210> 723 
<211> 404 
<212> DNA 
<213> Homo sapiens 

<400> 723 

aagctaggag ccgcgcragt cgtagtgtcg ctgtttgcgg gtctccgcgc gggaccgggg 60 

cgcagggggt cgctgaggcg agggtgtcat gtcagacaac gaggacaatt ttgatggcga 120 

cgactttgat gatgtggakg aggatraagg gctagatgac ttggagaatg ccgaagagtt 180 

ttagtagtga cggggttcac cgtgttagcc aggatggtct cgatctcctg ccttgtgatc 240 

cgcccgcctc ggcctcccaa agtgctggga ttacaggaag gccaggagaa tgtcgagatc 300 

ctcccctctg gggagccgac cgcagccaac cagaagcgaa tcaccacacc atacatgacc 360 

aagtacgagc gagcccgcgt gctgggcacc cgagcgctcc agat 404 

<210> 724 
<211> 486 
<212> DNA 
<213> Homo sapiens 

<400> 724 

aaagtggaag gcaacgacac acggcccgtg gcgccttcag ctagggagaa agagaggacg 60 

ggggctggcg ggcgggatta gctgccggag gctgagtttc cgcagcctga gtctccatct 120 

gtcgctagac tggagtgcag tggcgcgatc ttggctcact tcaacctcca catcctgggt 180 

tcaagcgact ctcctgcctc agcctcccaa gtagctggga ctacaggtgc gcaccaccat 240 

gttcaactaa tttttgtaat tttagtggag acgaggtttc accatgttgg ccaagatggt 300 

ctcgatctct tgaccttgtg atccacccac cttggcctcc caaaggtact gagattahag 360 

atgtgagcca ctgcgsctag ccagaaaaat atttattgat cctaaagtgg gagtcagcgc 420 

cactgttgmt gctagtgttg aatccagaam catagctgga catgggatca gaggatgaac 4 80 

aagaga 4 86 

<210> 725 
<211> 374 
<212> DNA 
<213> Homo sapiens 

<400> 725 

aaaaaaaaag tcctagctag ggtcccaagt gacagccagc atcaactata acatgtaaag 60 

gagccagtct tcagatgatt ccagcctcag ccttcaggcc actcagctgg tgccaaatag 120 

agtagggatg agctgtcccc acagagacct gcccagtgca cattgtgaga aagccccaga 180 

atgttgacag tcgctctcct agcccttctc tgtgcctcag cctctggcaa tgccattcag 240 

gccaggtctt cctcctatag tggagagtaa ggaagtggtg gtggaaagcg attctctcat 300 

tctggcaacc agttggacgg scccatcacc gccctscggt ccgagtcaac acatactaca 360 

tcgtaggtct tear 374 

<210> 726 
<211> 523 
<212> DNA 
<213> Homo sapiens 

<400> 726 

aacaagatgg cggcgctgcg ggacggctag cggccctgcg tggaggcgag gaatccgcat 60 

ctatggagat gtccctgcat cccatgactc ggagctgatg gccttcatga cgaggaagtt 120 

gtgggacctg gagcagcagg tgaaggccca gactgatgag atactgtcca aggatcagaa 180 
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gatagcggcc ctagaggacc tggtgcagac cctccggcca cacccggccg aggcaaccct 24 0 

gcagcggcag gaggaactgg agacgatgtg tgtgcagctg cagcggcagg tcagggagat 300 

ggagcggttc ctcagtgact atggcctgca gtgggtgggc gagcccatgg accaggagga 360 

ctcagagagc aagacagtct cagagcatgg cgagagggac tggatgacag ccaagaagtt 42 0 

ctggaagcca ggggactcat tggcgccccc tgaggtggac tttgacaggc tgctggccag 480 

cctgcaggat cttagtgagc tggtggtaga gggtgacacc aaa 523 

<210> 727 
<211> 492 
<212> DNA 

<213> Homo sapiens 



<400> 727 

attttgcgaa cggcgagcag cggcggcggc 

ttcggacccc agcggccgga tggtgaaatc 

cttcgccaga gagaaggaag gggataaacc 

tgccgctcct aagcctgcac agccgcggcg 

tccactgctg tagtaacccg ggtccggggc 

cccctgaaga tcccaggtgg gcgaggagga 

ggcccctggg gccacccctt gtcagccggg 

cctgggtttg tccgcatgtt gtaatcgtgc 
tatttcttga ag 



gcggagagac 


gcagcggagg 


ttttcctggt 


60 


ctccctgcag 


cggatcctca 


atagccactg 


120 


cagcgccayc 


catccacgcc 


agccgcacca 


180 


gcagcagcag 


tgagagttcc 


agggtctccc 


240 


ctcggtggtg 


ctcctgatgc 


ccctcaccca 


300 


ggagtagggy 


bgcctcgggg 


ctgggcatcc 


360 


tgggtaggaa 


ccgtagactc 


gctcatctcg 


420 


aaataaacgc 


tcactccgaa 


tttagcggtg 


480 








492 



<210> 728 
<211> 467 
<212> DNA 
<213> Homo sapiens 

<400> 728 

acagggccac tgctgctcac agaagcagtg aggatgatgc caggatgatg tctgcctcgc 60 

gcctggctgg gactctgatg cccagccatg gccttcctct cctgcgtgag accagaaatc 120 

tgggagccct gcgtggaggt gtgaaatcca gacaattgaa gatggggcat atcagagcct 180 

aagccacctc tctaccttaa tattgacagg aaaccccatc cagagtttag ccctgggagc 240 

cttttctgga ctatcaagtt tacagaagct ggtggctgtg gagacaaatc tagcatctct 300 

agagaacttc cccattggac atctcaaaac tttgaaagaa cttaatgtgg ctcacaatct 360 

tatccaatct ttcaaattac ctgagtattt ttctaatctg accaatctag agcacttgga 420 

cctttccagc aacaagattc aaagtattta ttgcacagac ttgcggg 4 67 

<210> 729 
<211> 528 
<212> DNA 
<213> Homo sapiens 

<400> 729 

atctttggga gctgcatttt tgataaataa gaaaaacctc ctcattttaa tgaagtagtt 60 

tcaatggagc aaaaacatgg aattaaagaa ccacagttca actccatcaa tgaagtaaca 120 

tcatttgcca actaacaaac tgagtctaca tctctttatt tgcaatgtaa aaccccatcc 180 

tgttagggga aaaaaaaata gtactaatgt tggatttaga agtccggagt tcaattttag 240 

gctttggaga gaagtaatcc acttttactt gagatttcca atctcatatc accaccatca 300 

tcattatcac cattatcatc accaccacca ccaccatgag caaccaacat catcatcatc 360 

aagaacaact tgcttttgct agccaaagca atgggaacac agcactctat tagggggcaa 420 

ggtattaatt tgctttttat ttaattataa caaaagcgaa atacatgttt ttgtaatttc 480 

ctatcaataa tttagaagaa ttaaatataa atataaaata aaaaccaa 528 



<210> 730 
<211> 379 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 221,233 

<223> n=a, g, c or t 
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<400> 730 

gtcggcaggt ctgaggcggc gctgtgtgtg tgaagcgtac ctagggcggg aggcgacatg 60 

gagacagggg cggccgagct gtatgaccag gcccttttgg gcatcctgca gcacgtgggc 120 

aacgtccagg atttcctgcg cgttctcttt ggcttcctct accgcaagac agacttctat 180 

cgcttgctgc gccacccatc ggaccgcatg ggcttycccg nccbkrggcc ggngaagccy 240 

tgggksctgc aggtattcaa aacctttgac cacatgrccc cgtcagratg atgagaagag 300 

aaggcaggaa cttgaagaga aaatcagaag aaaggaagag gaagaggcca agactgtgtc 360 

agctgctgca gctgagaar 379 

<210> 731 
<211> 103 
<212> DNA 
<213> Homo sapiens 

<400> 731 

aaagcaaccg cgtggtacga gtaacaactc caccggctgg cgtccgckgc gcttggcsac 60 
aagtcacttt tccagatcta ggagcaccat ggacacctcg tea 103 

<210> 732 
<211> 842 
<212> DNA 
<213> Homo sapiens 

<400> 732 

actgtttacc aaacaccttg gtcataataa tgtcattagt ttctccattt ttattttctg 60 

aactgtacat tcacaactta tgtttctttg agattaatag atattggggg aaaaacgcct 120 

ttttaggaaa attatagtga aaatttgaca gttgattggc ataatttctt gtttgaatgc 180 

tgcctccatt atataggtcc ttccaggaac tcaaacactg taagtgaaat atgggagtat 240 

agtttttatt atttcttctt ttccttttgt tttcataata taatgcagtt tgttcaggaa 300 

atcagcacaa agcctgatag tactttacta aaatgactgc attctttgga ttccttcagt 360 

ctatggttca agtcactaaa gattcatttt tgttgagtcc ttatgagaaa cagcagtatg 420 

aatcttgacg gtttctgccc gtcctaatgg cagagctctc tgacttgggt gtatgctgcc 480 

aggctgggta ctttcatact ttgttttctt gttttgcttt aaaactacga ctcagcatac 540 

attttcccac atacattttt acattgtacc ttaggactca gtcatctcca cttaaattga 600 

tgacacaagc agctaataac catttctggg tttctgccta accccctaat tgtctgttaa 660 

agccaattct ctgggtgtcc agtgagtggt ggcttttttt ctttccacat tggcacattc 720 

acttctccca ctcttggcat gtaagaaata agcatttaca taattggaaa aatctggatt 780 

tctgatgcca aagggttaaa gcttcttgga tttcatttca ttgatataca gccactattt 840 

ta 842 

<210> 733 
<211> 351 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 333,345 

<223> n=a, g, c or t 

<400> 733 

agcaacgggg tttcctgaac cttcgggcca cccgaatcca ccttctcaga cggaaataga 60 

atccagggat tcttctaccc atggaaaaac gtgcggagga agcacatgtg ggcgctggtc 120 

tggacgtgca gcggcctcct cctcctgagc tgcagcatct gcttgttctg gtgggccaag 180 

cgccgggacg tgctgcatat gcccggtttc ctggcgggtc cgtgtgacat gtccaagtcc 240 

gtctcgctgc tctccaagca ccgagggacc aagaagacgc cgtccacggg cagcgtgcca 300 

gtcgccctgt ccaaagagtc cagggatgtg ganggaggca ccgangggga a 351 

<210> 734 
<211> 200 
<212> DNA 
<213> Homo sapiens 
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<220> 

<221> misc_feature 
<222> 43 

<223> n=a, g, c or t 
<400> 734 

tttgcttgac tgcttcctcc taaccctcta sccactagca ckntacttcc taaagctgtt 60 
gtgtcattaa ctctgttgga tcaactctct gggaaaagat tctgttaatg taagtgcact 120 
tactccctgg atgttgtcac tagtctagtg gcttttgcta aataaacctt tcttatttct 180 
agaaaaaaaa aaaaaaaaaa 200 

<210> 735 
<211> 160 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> 118 

<223> n=a, g, c or t 
<400> 735 

aaaaggactt gaattctgat cgggctgcag acgctctgca ccccgcccaa gcttctatgg 60 
tactttggtg tcctctctag cactggcaag tcgccttcct cttcgcccgg cgccagdntc 120 
ccctctctat gatcgccgag ttcccggcgc cgtctcgttk 160 

<210> 736 
<211> 369 
<212> DNA 
<213> Homo sapiens 

<400> 736 

gcactggatg cacgcagcca catcagaaag gcggttaaga ggggcggggg aagcgctctg 60 

atccacgatg tcctgcgagc tgatcaatca gcttcacgct ggaaataacc atataaaaac 120 

ctgaactcaa agaagagaac tggaataatc ctccctaatc ccacttccca tggtgagccc 180 

cagctgggac gctgacacat ggccccaact ttttctcctg cctgatgtct ggacccacag 240 

gggacagatg gtggtccctg gtcactgaag tacccacaga acccatgcaa gaggcacaaa 300 

accatctgct tagtggtcat gtggcagcag acagtatctc agaagtcctc gcagcatatg 360 

acttcactg 369 

<210> 737 
<211> 382 
<212> DNA 
<213> Homo sapiens 

<400> 737 

aaacacacag atcgtcccca ggctgctgtt cgagaatgag tattcctata cgacccagat 60 

agactacaac cccaaggacc gcctgctcta tgcctgggac aatggccacc aggtcactta 120 

ccatgtcatc tttgcctact gacacccttg tccccacaag cagaagcaca gaggggtcac 180 

tagcaccttg tgtgtatgtg tgtgcgcgca cgtgtgtgta ggtgggaggc aggcaacacc 240 

aggagcagaa atgaaagagg caagaaataa gtgctatgtg gcgagaaaaa aagttttaat 300 

gtattggaga agttttaaaa aacccagaaa aacgcttttt ttttttaata aagaagaaat 360 

ttaaaatcaa aaaaaaaaaa ar 382 

<210> 738 
<211> 406 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> 197 
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<223> n=a, g, c or t 
<400> 738 

agttctagag gctgggaagt ccgggattga ggggccagca tctaccaagg gccttcttgt 60 

tacttcatcc catgatggaa ggtggaaggg taaagagagg agaagaagca gtaccgggaa 120 

tcttacatca gtgacaacct ggacctcgac atggaccagc tggaaaaacg gtcgcgggcc 180 

agcgggagca gtgcggngca gcatgaaaca caagcgcctg gtsccgkcca tycaacsgcm 240 

akccacagca gttcccacac ctcgggcatt gaggcagaca ccaagccccg ggacacgggg 300 

ccggaagaca gctactccag cagtgcsats caccgcaagc tgaaaacctg cagctcaatg 360 

accagtcatg gcagctccca cacctcaggg gtggagartg gcggca 406 

<210> 739 
<211> 502 
<212> DNA 
<213> Homo sapiens 

<400> 739 

cactttgaaa gaaaactgcc attttgtatt taatagcctc caggttcttt cagtaatgtt 60 

atttgctctg tgtgtttttg tgtgtttgtt gatgtgcgtt tgtgcatatg cgtgagtttc 120 

attgccgggg ttggggcaca attgtggact ggggccatga ggcttccctg gtccccactg 180 

aacccacctt agttccacat ttggctgcat cttgaattat gccgactcca gacttctcct 240 

ccttttttgc ccttggctct tgacactcta aacccctgga ccatctgaat ggagcagcca 300 

agttcagtcc cacatttctg tactgttcct ctttcacagc tggaatatgt cacatgatga 360 

agttgtatag aaacagaacc atggatggat ggccaggatt gccgtggtcc ctagctagat 420 

ccccttccta tcaatcacct gatagcaaca gggacagctg ccaataccct gctctttact 480 

caatggtacc agggagggag ca 502 

<210> 740 
<211> 367 
<212> DNA 
<213> Homo sapiens 

<400> 740 

cagcaagccc aggcacttcg ggctgtcccc ggcccccaga gcccgggcac cgtggtcatc 60 

atcgccatcc tgtctatcct cctggccgtc ctcctcacgg tcctcctggc tgtgctcata 120 

tacacctgct tcaacagctg caggagcact tccctatcag gcccagagga ggcagggagt 180 

gtgagaagat acaccacgca cctcgcgttc agcactcctg ccgagggggc ttcctgaggg 240 

gttccagagg ggcccacgtg tccctccacc tcctccctgg cccaggctgc agagcctgag 300 

ctgggacacg ccctgaagct tctggaccct gagagagatt gtttcttttc tttactattt 360 

ttttttt 367 

<210> 741 
<211> 427 
<212> DNA 
<213> Homo sapiens 

<400> 741 

aatgttcttg tgaatcttag acaaccccca caacatcatt ccctgcttta tttgcttagt 60 

ccattcttgg tggattttat gagatgtcaa cactcctcat taggattccc tccttctgaa 120 

gcctttggga ttttctaccc acccagcaac caccagaggt gcagaaaccc ctgcagacat 180 

ccttcttgcg caasgcagtt gtaccatgaa tgcggtcgtc ctgctgcaga ccacgcccac 240 

caagggcggg gccacatcca ctgcagctga gctcagagcc ttctgtgggg tctgcagtgg 300 

caaggccagc cgccccctgt cttccctctg gagctgtttg aggaatgcag ttcttttgca 360 

taagaaaggc ttttctctcc agtcattctg cttgctgatc aagaacactt aacagcttca 420 

ggctttt 427 

<210> 742 

<211> 408 

<212> DNA 

<213> Homo sapiens 

<400> 742 

tcatctatgt ttatcacatt ttaataccac agcacttata atgatgtcac tacatataga 60 
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agctcaaagt taagggattt gctgaagact gtaaagttaa tggaagaatt gagacaaaaa 120 

tccagtgtag ctggccactt atcccagggc tttttctact tcatcacaag gaatgttttg 180 

aaagtgtctg ctttttttat ccttaaaatt cacctgtcag ggaggcatta aaaatttgga 240 

aatgtatgcc agcaaaatgt gagctctgta ttttttggca ttcttatgtt tgggtttaat 300 

aagattaaga aaatgatact gggaattttc tttttcctga aactttgaat caccctagta 360 

agtcaaagta ctaaaaaatg tactagatca ttaagactta tgtgctct 408 



<210> 743 
<211> 449 
<212> DNA 

<213> Homo sapiens 



<400> 743 

aacttgttat ctattgcagg aaagtgcctg 

tcatcatgtc cctttaccag gagggcccgt 

ggtggactgc tttggacaaa tggacctgcg 

cagatcttgc tgtttcaacc aaaacctcat 

ggtgaaggcc tgcagtcagc agaatatgtc 

tgggcttttc tgtgccatgt gttctcccag 

gcccaagacc accataatca tcattgatta 

tttctcttct ttctctgaaa aaaaaaaar 



caatatgcca ttgtgacaag gcaccatcaa 60 

gtcataagcc actatataaa gggaatgatg 120 

gtaggagaga gggacaacag taggagcagg 180 

gctgaccaga gttgaggaac agaagaagat 240 

aatgttttgg ttgaagaagc tgcttgaatc 300 

ggccagcaca aagawgggct tttggtgcag 360 

ttcctctcca cgccagktct ctaaataaac 420 
449 



<210> 744 
<211> 207 
<212> DNA 
<213> Homo sapiens 



<400> 744 

atttccggtt ccggtgtcag ttcgaggcgc 
aatgcctaaa ggaggaagaa agggaggcca 
tgaggagatc gacgcgcast gcaggctgag 
aaaaaaaaaa aaaaagctta tggatac 



cgccgccgcc gccgcagccg ccggagccgc 60 
caaaggccgg gcgaggcagt atacaagccc 120 
aagcagaagg ccagggaaga agaggagcaa 180 
207 



<210> 745 
<211> 208 
<212> DNA 
<213> Homo sapiens 

<400> 745 

agaacatcct aatcrwaaaa gttcgaggcg ccgccgccgc cgccgcagcc gccggagccg 60 

caatgcctaa aggaggaaga aagggaggcc acaaaggccg ggcgaggcag tatacaagcc 120 

ctgaggagat cgacgcgcas tgcaggctga gaagcagaag gccagggaag aagaggagca 180 

aaaaaaaaaa aaaaaagctt atggatac 208 

<210> 746 
<211> 442 
<212> DNA 
<213> Homo sapiens 



<400> 746 

cgtcatttag gcttttggga tttgttggta tggaaaaata catggattct catggtactc 60 

ggtggtttga gtagtctggc gttctctagg acactgtggg gaggtaggac gcagttgtct 120 

gcccctgtgc cccaaacatt cagctcagga tggctgctga gctkggkttt gggaaaactc 180 

acaaccctgt ggatgtaaca ggagtcagca gttttgatca ctggttcatg gaaacatgtg 24 0 

agaatatata tgaacatctt ttgagtcttg ctggtattgt cattcattct ttcaatcact 300 

aattcattca tttatacatt cagccccttg tttaaatatg tgcttcctgc cggtcagcag 360 

gaagtcttaa gaatggagcc tggggtccaa atgagagccc tcagtggctg ttcaggggsc 420 

aatccgttat gtacagtggg tc 442 



<210> 747 
<211> 487 
<212> DNA 
<213> Homo sapiens 
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<220> 

<221> misc_feature 

<222> 221 

<223> n=a, g, c or t 
<400> 747 

tctgtggaaa ggcgaggtct ggaaagcagc ccctcccctg ggagagagcs cagggcaggc 60 
atgctgtcct gtttccccca ctcsscaaca gggctggcst ccggctctgt cataccctat 120 
ctactcctgc cttcttccag ctggcattcc ctggcagcga tggcggggaa actggcagac 180 
atgagcaggt gccatctgta tccaccacat actcctgctg ntgtggtgtg gcgaagacaa 240 
ggctgcccag ccccaacacc acgaggcacc ctcgggcatg ctgttgcttc tgggaccagt 300 
gaccagcctg stgsscctgc tggaccttag gaggccaagg caggtgaatt gcttgagtcc 360 
aggagtttga gaccagcctg agcaacatgg tgaaactccg tctctacaaa aaacaaaaaa 420 
aatccagcgt ggaggcatgc acctgtggtt tcagctactc aagaagtgag gcaggaggat 480 
tgcttka 487 

<210> 748 
<211> 601 
<212> DNA 
<213> Homo sapiens 

<400> 748 

aaaaagcgcc attacgcaga gagaaagtta cgaggttcgt ggccgcggtt tccccaggca 60 

gctggcgctg gaggcttcgg cgtcacgtgc tggtctggat ttttctcgat gcactgggga 120 

aagcggtgga ctcttatcgt gggagggctc ttgatctgtg atttatagat aggcacagct 180 

actcccgttc gggaacccaa cggcagacag gtcctagtgc ccatcagata cccgcggccg 240 

ggactcggag ctgtggggtg tggggaggcg gaggcaccaa ctaagagcga cctagcatcg 300 

caaagccgcc ctcggggcct catggcggga cgctcctggg aaaggcttta gcgcggtgtc 360 

tctctctctg gccttggctc tgtgactatc aggtcctcgc gctgtcgcgg catccaggcg 420 

ttcagaaact cgttttcatc ttcttggttt catcttaata ccaacgtcat gtctggttct 480 

aatggttcca aagaaaattc tcacaataag gctcggacgt ctccttaccc aggttcaaaa 540 

gttgaacgaa gccaggttcc taatgagaaa gtgggctggc ttgttgagtg gcagactata 600 

a 601 

<210> 749 
<211> 210 
<212> DNA 
<213> Homo sapiens 

<400> 749 

aaaaattaaa cctcaccgmc ggattcaatt ctctcgttct caggccgtct tcatcgccgg 60 

agsgtttwct tacgaaaaga atgggcttct ggacattgtt aaaaggtctg ctgctctttg 120 

caaatgcact ggcaatcctc aacgaagacc gtttccttgt tccaagagga tggacactcg 180 

gagaactcca ccaaaccggt agaagaaast 210 

<210> 750 
<211> 380 
<212> DNA 
<213> Homo sapiens 

<400> 750 

tgacataaaa ggttaaagaa caggcaacac aatgagcact taagttttta acatgtgggg 60 

aatagggcat tttaaaggct ggaaccagtt cagaggaaac aagggtttgg gtagaggtag 120 

aaaggtttaa ttaaccctgg agatctgcaa atggagggag gggtgaagga ggaatcttaa 180 

gacggaggga gaagagccaa ggacaaggtc aggttgaatk agtwagrggk gtgtacacah 240 

atcctcttgt gtgctaggta gtgggcgtgc atctggatct gagaggcagt aaagcagtga 300 

tgaagacagt gcagcagaac attgaagtcc tgattgcagc agtgactctg caaccttgga 360 

taaattattt caatctttat 380 



<210> 751 
<211> 245 
<212> DNA 
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<213> Homo sapiens 
<400> 751 

actcttctga ggatccggca agatggcaga agtagagsag aagaagaagc ggaccttccg 60 

caagttcacc taccgcggcg tggacctcga ccagctggca rsatggtggg sgtctacaac 120 

ggcaagacct tcaaccaggt ggagatcaag cccgagatga tcggccasta cctgggcgag 180 

ttctscatca cctacaagys cgtaaagcat ggscggsccg gcatcggggc cacccactcc 240 

tcccg 245 



<210> 752 
<211> 369 
<212> DNA 
<213> Homo sapiens 



<400> 752 

aagttagtac actgaaattc aaagtcatgc tcataactgt taatgaaagc agattcaaag 60 

caacaccacc accactgaag tatttttagt tatataagat tggaactacc aagcatgtgg 120 

ctcctggtca gtgtaattct aatctcacgg atatcctctg ttgggggata aggactgtgt 180 

ttctttcctt ttgtggaaaa tkgtcattct gaatcttcag gvcaaacaca tctggaaggt 240 

gatactgtac aaattatttg caacacagga tacagacttc aaaacaatga gaacaacatt 300 

tcatgtgtag aacggggctg gtccactcct cccaaatgca rgtccactat ttctgcagaa 360 

aaatgttgg 369 

<210> 753 
<211> 491 
<212> DNA 

<213> Homo sapiens 



<400> 753 

accaggtgcc cagtcttcca gttgcgaggg caagcaaacc cgtcatgagc aactcccttc 60 

cccatctctg ctcaccatgt ggacgctgaa atcgtccctg gtcctgcttc tgtgcctcac 120 

ctgcagctat gcctttatgt tctcttctct gagacagaaa actagcgaac cccaggggaa 180 

ggtgcmrtry ggagagcact ttcggattcg gcagaaycta ccagagcaca cccaaggctg 240 

gcttgggagc aaatggctct ggcttytktt tgytgttgtg ccgtttgtga tactgraagtg 300 

tcaaagagac agtgagaaga ataaggagca gagtcctcct ggccttcgag gcttcccatt 360 

tcgcactcca ctaaagaaaa atcaaaatgc ttctctttac aaagactgtg tattcaatac 420 

cttaawsgaa ctygargtgg agcttwtgaa atttgtgtcc raagtgcrra atcttaaasg 480 

tgccatggca a 491 

<210> 754 
<211> 336 
<212> DNA 

<213> Homo sapiens 



<400> 754 

ataaatgctt tccaatcagt gctcccattc ttcattctat tcaccactta gttactagaa 60 

tgtttttctc aactatgaat ctgacttcac tcccattctt gaaaacctcc tgtagttcct 120 

gttgtgtacg ggataaagac tgtgagcccc tggagttcag ggaccattac ttaatcatat 180 

tccagaacct atcacaatga atgacacgaa gaatatattt gataaaaatc tgatgaatga 240 

accactctaa aatgcaactc taataaagaa aattactgcc tawgtcrgdd skgkmtcams 300 

gktayaaacc gtdaaaatta tgtatagtct gataaa 336 

<210> 755 
<211> 497 
<212> DNA 
<213> Homo sapiens 

<400> 755 

ataaatgctt tccaatcaat gctcccattc ttcattctat tcaccactta gttactagaa 60 

tgtttttctc aactatgaat ctgacttcac tcccwttctt gaaaacctcc tgtagttcct 120 

gttgtgtacg ggataaagac tgtgagcccc tggagttcag ggaccattwc ttaatcatat 180 

tccagaacct atcacaatga atgacacgaa gaatatattt gataaaaatc tgatgaatga 240 

accactctaa aatgcaactc taataaagaa aattactgcc tattcagttg gaactgtacc 300 
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aygttacaaa cgtaaaatta tgtatagtct gataaagaaa tactrtaata gccttggaaa 360 

cagctcacat acaatttttt ttataaatgg attcattacc atataactgg accagcttac 420 

tgaagagtag ccctacagca gtcacttcgc agtttgcaat tcatttgagg ttcaaagcta 480 
gctcaaggga aaacaac 497 

<210> 756 
<211> 400 
<212> DNA 
<213> Homo sapiens 



<400> 756 

atcattttcc ctgcaggaca cccggcctcg 
tactgcagcc acctcgcaac ccaccctggg 
attcatactc ctggtgactt actgctgcag 
atcacagcaa agcccagaaa acagaaaggg 
gctatcgtgg ttgttgcagt ctggccgacc 
tgacacctgc caatgacaga tgcaagccca 
aataaattcc cccaaagctc caaaaaaaaa 



gtcctgcttt agtattcatc ccagcttcat 60 

cagctgctaa aagggggaaa aattaaagca 120 

aataattggt ttctgtttgc aaggaacatt 180 

atgatcatgg tgattttact gctgcttgtg 240 

aactgatggc agtaaagaga ccaccagcag 300 

acaccctttt ggtacgcaaa acctgctctc 360 

aaaaaaaagg 400 



<210> 757 
<211> 524 
<212> DNA 
<213> Homo 



sapiens 



<400> 757 

acttcctcgt tgcccccgcc gcgggccgag 
gagttcgagg actcggtgtt cgaggagagg 
actgcgccaa gctctgcgag ccgcagtggt 
ttgaagtgct gactctcaag aaattcaaag 
agaaagcact gcaggagtat tccagtatct 
tgaaaaggga tgtccaggaa ggtcaggctc 
aggcgctgga gattgctgca aacttggaaa 
csgtactcta cctccagctt gctatttgtt 
tctgcctgca gaaactgatt tctttgcatc 



atggattccg ggtgctggtt gttcggcggc 60 

ccggagcggc ggtcaggacc gccgcgtcct 120 

tttatgaaga aacagaaagc agtgatgatg 180 

gagacctggc ctacagacga caagagtatc 240 

ctgaaaaatt gtcatcaacc aattttgcca 300 

ggtgtctggc tcacctgggt aggcatatgg 360 

ataaagcarc carcacagac catttaacca 420 

caagtttgca gaacttggag aaaacaattt 4 80 

cttttaatcc ttgg 524 



<210> 758 
<211> 543 
<2'l2> DNA 
<213> Homo sapiens 



<400> 758 

aagaagggag gtggtcgccc tccgtcgtgg tctggcgtgt attccgagcs ttggtgtctg 60 

gcggtttccg agcgttggtg tctggcggtt tccgaccgtt ggtgtctggc ggtttccgac 120 

cgttggtgtc tggcacgcgc caccctctct tgctttggtt gcgccatgcc gatgtaccag 180 

acaagaagac aagaaaatga tttgaggaca gcttcaatcg cggtgtgaag aagaaagcag 240 

caaaacgacc actgaaaaca acgccggtgg caaaatatcc aaagaaaggg tcccaagcgg 300 

tacatcgtca tagccggaaa cagtcagagc caccagccaa tgatmttttc aatgctgcga 360 

aastgccaaa agtgacatgc agggatgtcc ttcctgagat ccgtgctatc tgcattgagg 420 

aaattgggtg ttggatgcaa agctacagca cgtctttcct caccgacagc tatttaaaat 4 80 

atattggttg gactctgcat gataagcacc gagaagtccg cgtgaagtgc gtgaaggctc 540 

tga 543 

<210> 759 
<211> 850 
<212> DNA 
<213> Homo sapiens 



<400> 759 

aagtttcatg ggttcctgga cgctcagcca ggggtagtct ctgaagggcc gcctggcaat 
tgggagcaga agccagtttc ccgccatctg ctctccgtgg aggcagtgct ctcgcggctg 
ctgcccgttg gggtccagct gaggaaggag cgcggatccc aggctcgttc tttgcctggg 
ctgcccctgc ggcccctggg gcgggtgctg ctgcggcgcc agctccaagg gcgkatccag 
gcgggagggg cctcctcgga gaagcggggc gcggtcccaa ctacgcagag gctggcacgc 
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cgaccctcca cacctcacca cgcccccatc tccgtccgtg tacacacact cacacaagga 360 

cgccaacccc acctagatgc aaagcaggat tcaaaagaac atctttgcgt tttctaccgg 420 

ctccccatca tcgtactagg gaggaagaag cgggtgagaa acaaaacttc tttccattgt 4 80 

cctgcccgtt tctgcggact tgttctgagg ccgaggcacc tctaagatac tgatggctct 540 

gcagaggacc cattcattgc ttctgctttt gctgctgacc ctgctggggc tggggctggt 600 

ccagcctcct atggccagga tggcatgtac cagcgattcc tgcggcaaca cgtgcaccct 660 

gaggagacag gtggcagtga tcgctactgc aacttgatga tgcaaagacg gaagatgact 720 

ttgtatcact gcaagcgctt caacaccttc atccatgaag atatctggaa cattcgtagt 780 

atctgcagca ccaccaatat ccaatgcaag aacggcaaga tgaactgcca tgagggtgta 840 

gtgaggtcac 850 



<210> 760 
<211> 532 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> raisc_feature 
<222> 524 

<223> n=a, g, c or t 



<400> 760 

ataaaacaaa atagtatgca acttacagca atgtctctcc cacttataat aaaacaccta 60 

actctcatgt agcattttat ccatattgag agctctgtga gatagaaatt attatattac 120 

aagtcacaga tatgtaaact gagatgcaga gacatcaata acttgctcaa ggttacaagg 180 

tggacttagt agttttaagc agaaccatga aaacctctgt gacaactccc tccagctcca 240 

agagtgccgg gaggtggggg gcggcgcatc cgcggcctcg agcttgctac ctcagcccat 300 

ccccaccacc cctgacatcg agaacgctga gctcaccccc atcttgccct tcctgttcct 360 

tggcaatgag caggatgctc aggacctgga caccatgcag cggctgaaca tcggctacgt 420 

catcaacgtc accactcatc ttcccctcta ccactatgag aaaggcctgt tcaactacaa 4 80 

gcggctgcca gcactgacag caacaagcag aacctgcggc agtntttgaa ga 532 



<210> 761 
<211> 275 
<212> DNA 
<213> Homo sapiens 



<400> 761 

tgtttgtttt gcttctacta agactgtttt gtttcaaaaa ggaaacaagt tttgtgtttg 60 

ctgtctacgc tggagtcctg aactgtgggt agaaaacacg acctggcttt gtagaaagga 120 

cacagggctg ttttatgaac taagcggtga ggctcaggtg gcggctctcg cagacccctg 180 

atgctgttgt tcttkgaggg cttaaggcct gatgaacgta ggcacgtgat gcataatagt 240 

cttcaatggt acacttaact agtctcttct gtgta 275 



<210> 762 
<211> 546 
<212> DNA 
<213> Homo sapiens 



<400> 762 

aggtatctag gtatcctccc atctcaggcc agactggaga ttgagcgcct gttgtatacg 60 

ggccctgggc taaccctccc cccatcacac aactcaacag caacgcccaa gccacaactt 120 

ctctggacct caggttccct acctgcaaaa cagtttgcac ctcctccctc cacaaaactg ISO 

caaccggggc cggcgcggtg gctcacgcct gtcatcttaa caccttggga ggcggaggtg 240 

ggcggatcat cagaggtcag gagttcgaga ccagcctggc caacaaggtg aaaccccatc 300 

tgtactaaaa atacagaaat cagccgggca tggtggctcg cgcctgtagt cccagctact 360 

tgggaggctg aggtaggaga atcgcttgaa tccgggaggt ggaagtcgca gtgagccgag 420 

atcgcgccac tgcactccag cctgggtgac acagtgagac tccacctcaa aaacaaaaca 480 

aaacaaaaaa ctgcccgcag agagctgccc cgaccacaag gcctgaatcc agccttccag 540 

ccccag 546 



<210> 763 
<211> 545 
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<212> DNA 

<213> Homo sapiens 



<400> 763 

ctggtgtctc tcaaaagcct gatcctgcca 
ccacttctga tgattctgac tctaattttg 
agaaatccaa gggggagagt gatgacttcc 
gggcaaaatc tgtacgggca aagaaaccta 
atctgtttta aaatgtgagg cgattatttt 
ttttaaagtt acctgaagct cttaacttcc 
ttttagtaca agacatcaaa gtgaagtaaa 
tgtctaaata gtgaccatct catgggcatt 
gtctgctttc ttttgtcttt aaacctgatt 
tcttg 



aaaccaagaa 


tcgccgcaaa 


aggaagccat 


60 


agaaaattgt 


ttcgaaagca 


gtcacaagca 


120 


atatggactt 


tgactcagct 


gtggctcctc 


180 


taaagtacct 


ggaagagtca 


gatgaagatg 


240 


aagtaattat 


cttaccaagc 


ccaagactgg 


300 


tcccctctga 


atttagtttg 


gggaaggtgt 


360 


gcccaagtgt 


tctttagctt 


tttataatac 


420 


gttttcttct 


ctgctttgtc 


tgtgttttga 


480 


tttaagttct 


tctgaactgt 


agaatagcta 


540 








545 



<210> 764 
<2I1> 500 
<212> DNA 
<213> Homo sapiens 



<400> 764 

gacctgggct gcagttaggt tgtcgcctgc aggaggtgtg ctgttgaggg agaggaaaac 60 

cgtccgagga gaggatcaag tcttccatac cagggaccag gccaaaacca gactgccact 120 

gcccctactt gggccccact gtccccagaa agcagctgct gtgatcatgg gcaatatctt 180 

tggaaacctt ctcaagagcc tgattgggaa gaaggagatg cgcatcctga tggtgggcct 240 

ggatgccgca gaaagaccac catcctatac aagctgaaac tgggggagat cgtcaccacc 300 

atccctacca ttgggttcaa tgtggagaca gtggagtata agaacatcag ctttacagtg 360 

tgggatgtgg gtggccagga caagattcga cccctctgga gacactactt ccagaacacc 420 

caagggttga tatttgtggt cgacagcaat gatcgggagc gagtaaatga ggcccgggaa 480 

gagctgatga gaatgctggc 500 



<210> 765 
<211> 527 
<212> DNA 
<213> Homo sapiens 



<400> 765 

gctttgacaa aaccagggaa agagacagag agagataatg gcggctaaac taatctgttc 60 

gtctttaaca gtgcattcaa tggcaaataa gaagccatca ccttctgctg caacaagaac 120 

aataacctca aagaagagca cagcgctcca caggtgaaac tgctgacaag agtcgagcag 180 

ctcaagcttc tgaccaaagc cgagaaagca ggacttttgt ctttagcaga gaaatcaggt 240 

ttctctctat cgaccatcga gcgtcttgga ttgctgacca aagcagagga gttcggcgtt 300 

ttgtctgccg ccacaaaccc ggaaacgcct ggaacgttat tcactttgag cctcggttta 360 

ctccttcttg gaccggtttt tgcatatgtg gttcctgaag attacacttg ggaagtagtg 420 

attcaggttc ttgtggctct actctctgtt cttggtggct ctgctgcttt tgctgcttct 480 

ggttttgtct ccaatttgca gaaatctgat tagcccgatc gtttttt 527 



<210> 766 
<211> 921 
<212> DNA 
<213> Homo sapii 



<400> 766 

agtagggagg aaaaccggag gagagcgcag gaggaaacag taccggctgg aggccggtct 60 

tgcaggagcg ggggactgct gggggcgggg cttggtggtg accgctggcg gggcggggcc 120 

tggggctcag aggggtgggc tttggagatc agagggtcga cgctgcttcg ttgcctggac 180 

tctggtttcc gccctggagc aagccggggc ctggtcggca gctgggccgc catggagtcc 240 

acgctgggcg cgggcatcgt gatagccgag gcgctacaga accagctagc ctggctggag 300 

aacgtgtggc tctggatcac ctttctgggc gatcccaaga tcctctttct gttctacttc 360 

cccgcggcct actacgcctc ccgccgtgtg ggcatcgcgg tgctctggat cagcctcatc 420 

accgagtggc tcaacctcat cttcaagtgg tttctttttg gagacaggcc cttttggtgg 480 

gtccatgagt ctggttacta cagccaggct ccagcccagg ttcaccagtt cccctcttct 54 0 

tgtgagactg gtccaggcag cccttctgga cactgcatga tcacaggagc agccctctgg 600 
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cccataatga 


cagccctgtc 


ttcgcaggtg 


gccactcggg 


cccgcagccg 


ctgggtaagg 


660 


gtgatgccta 


gcctggctta 


ttgcaccttc 


cttttggcgg 


ttggcttgtc 


gcgaatcttc 


720 


atcttagcac 


atttccctca 


ccaggtgctg 


gctggcctaa 


taactggcgc 


tgtcctgggc 


780 


tggctgatga 


ctccccgagt 


gcctatggag 


cgggagctaa 


gcttctatgg 


gttgactgca 


840 


ctggccctca 


tgctaggcac 


cagcctcatc 


tattggaccc 


tctttacact 


gggcctggat 


900 


ctttcttggt 


ccatcagcct 


a 








921 



<210> 767 
<211> 570 
<212> DNA 

<213> Homo sapiens 

<400> 767 

aagggaatct ccgggcgggg tagtgcaggc gccgggtttc ccgcggtccg agctggcgcg 60 

ggcggaggag aatcgctctt aaagggccag cgcacacgcg ttcttttgtt ccggggccgc 120 

aggcggggca ggcccgactt tcgccgtctt cttgtctact ctccagaacg gccatgattt 180 

cccaattctt cattctgtcc tccaaggggg accgctcatc tacaaagact tccgcgggga 240 

cagtggcggc cgggatgtgg ccgagctctt ctaccggaag ctgacgggac tgccaggaga 300 

cgagtccccg gttgtcatga ggcagaggtt gcagtgagcc gagatccgga caccgcactc 360 

cagcctggga gacagagcga gactccctct cgggcatcac catggccgtc atttcattca 420 

catcagacac agcggcctct atttggtggt cacaacttca gaaaacgttt ctcccttcag 4 80 

cctcctagaa ctgctctcca ggttggccac ccttctgggc gattactgtg gctccctggg 540 

cgaggggaca tctcccgmaa tgtggctctg 570 

<210> 768 
<211> 490 
<212> DNA 
<213> Homo sapiens 



<400> 768 

cactcgctgc ctcggcagcg ctgctcttct 

gcctcggccg cctcgggtca ggcggaaggt 

ctgaagtccg agctgaatcg gcggaactta 

tcccgactca agcaggctat tgaagaggaa 

gtttcaactg atactccaaa caagaaacca 

gatgagttga gtggagatgc ttctgtggaa 

gagaatcaag aggcacatga gcaagatgga 

ggtgaaaatg aagaagaaaa tgtgcattcc 
agagctcatg 



aagatggctg ccgctaccgg tgcggtggca 60 

aaaaagatca ccgatctgcg ggtcatcgat 120 

gacatcaccg gagtcaagac cgtgctcatc 180 

ggaggcgatc cagataatat tgaattaact 240 

actaaaggca aaggtaaaaa acatgaagca 300 

gatgatgctt ttatcaagga ctgtgaattg 360 

aatgatgaac taaaggactc tgaagaattt 420 

aaggagttac tctctgcaga agaaaacaag 480 
4 90 



<210> 769 
<211> 491 
<212> DNA 
<213> Homo sapiens 



<400> 769 

cactcgctgc ctcggcagcg gctgctcttc taagatggct gccgctaccg gtgcggtggc 60 

agcctcggcc gcctcgggtc aggcggaagg taaaaagatc accgatctgc gggtcatcga 120 

tctgaagtcc gagctgaatc ggcggaactt agacatcacc ggagtcaaga ccgtgctcat 180 

ctcccgactc aagcaggcta ttgaagagga aggaggcgat ccagataata ttgaattaac 240 

tgtttcaact gatactccaa acaagaaacc aactaaaggc aaaggtaaaa aacatgaagc 300 

agatgagttg agtggagatg cttctgtgga agatgatgct tttatcaagg actgtgaatt 360 

ggagaatcaa gaggcacatg agcaagatgg aaatgatgaa ctaaaggact ctgaagaatt 420 

tggtgaaaat gaagaagaaa atgtgcattc caaggagtta ctctctgcag aagaaaacaa 480 

gagagctcat g 491 

<210> 770 

<211> 403 
<212> DNA 
<213> Homo sapiens 

<400> 770 

cgcacacgcg gaactcgaac tcctgcaggg cctcgtagtc cagcgacctr agggcgaaca 60 
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ggtggccgtt gtctgtgttg atggagacca gggaggcgag gggcagaatc gcggtcttga 120 

atgctgaaaa gagctgcaac ggtctcagga gaattttcgc ggatggggct tgtgaatgaa 180 

gccatggtca gttccggagc gttatcgttg atatctacca cctgtattat gacagtgcat 240 

tttccagaaa gacctgcccc atcagaggct ttaatatcca cctgatatga cacaatttww 300 

wmaaaatcta gttttctgat tattttaatt tctccccttc gttcattaag tgcaaatgtc 360 

ttagaaatct cttcatcacc ataaaaaatg agtagaataa aaa 403 



<210> 771 
<211> 374 
<212> DNA 

<213> Homo 



sapiens 



<400> 771 

attcttttcc cccttctcac cggccaggat gcagacataa tgatggcagg agctggagca 60 

gccacctgag gacccagagc tcaaagccac atgttgagaa gggcagagat aactgtatcc 120 

actctggact gctgaccttt gaactattat gttatttcca gggaaatgca aaccaaagga 180 

tgtggtctct gatctaatcc ttagagaatg tgaccatgaa gacacttttc ctacctggta 240 

aacaaaagat aatgagaaaa gtgaggttgg aagttggttt actgagccag gagctataac 300 

aggtgctgga gcaggggtcg tgatctgaat gaccagaggg aaggactgat ggaattggat 360 

ggtgakagct ccag 374 

<210> 772 
<211> 380 
<212> DNA 
<213> Homo sapiens 



<400> 772 

agaaacagtc tctccccagc tcacttggct 
agtgttatcc tgctgatccc aatttcctat 
ggtttttctg aagctgctga catcgggtca 
caggaagtac tcaaagatca tggctcaaaa 
agaggtgttg atgagggcaa taaaagcaaa 
caggagtcgg gatccacatc tgcagactac 
aatcaccatt gaaaaaaaar 



gaagatacta aagagaacta gatgttatct 60 

agttctggca ccagaaaatc ctgaatgata 120 

gccctcaaca ccatcacctt tggccatgtt 180 

gtctgcagag cagagacaag agattgacaa 240 

ggtggcttta tcacggtgag ctacttctca 300 

acaaaaacaa acaacacaga ttaaaagcac 360 
380 



<210> 773 
<211> 472 
<212> DNA 
<213> Homo sapiens 



<400> 773 

agatggctca ggtgcttcgg ctgccacggc agaggttctc tgctctcttt tctctgctgg 60 

aactgggccc tagagactgg ggcaggcggt tctgactcag aagatctttt ggaagaattg 120 

ctgtgctgtt tgatgcagtt gatcactgat attccactct tagatattac atatgaaata 180 

tcagtagaag ctatatcaac aatggttgtt ttcctttcct gccaactctt ccacaaagaa 240 

gttttgcgac agagcatcag ccacaagtat ttgatgcgag gtccatgtct tccatacacc 300 

agcaaacttg tgaagacctt attatataac tttatcagac aagaaaagcc acctcctcca 360 

ggggcccatg ttttccctca gcagtcggat gggggaggac tgctttatgg acttgcatca 420 

ggagtagcaa caggactctg gactgtcttc acactaggtg gtgtgggcag ca 472 



<210> 774 
<211> 521 
<212> DNA 
<213> Homo sapiens 



<400> 774 

attccaggcg cctgcaacta aacgtggccg 

tttccctgct tcttatcgtc tgctttaacg 

gtgctaggta gcagcctccc ggccctcggg 

agcctcctcg acctcccttc ctggtgacgg 

cgagtgacct tgagcccagg gcgacggtca 

tgagaatttt aatgcatgga aaagctgtcc 

cagtgtaata gcaggaccaa aatattctgt 



ggtctgcaag ctaggtgcca gcggggaaag 60 

ccttaaatag cccgctgaag gctgcagcag 120 

aaaggcgggg tggggaggcg agagcagctt 180 

acgaacagtt cccgtagaat ttcgcttcac 240 

gcttgttaat tcctggctgc aggaactttg 300 

atgttccaac tgctgtacat ccaaaagtct 360 

caatcagctg accatatact taatgactcc 420 
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taaaatctcg 


tggacttcta 


agaaagcgcc 


atggcctgtg 


ctgctgttat 


gattcctggg 


480 


ttgttgcggt 


gctctgttgg 


agccatccgt 


attgaggctg 


y 




521 


<210> 775 














<211> 447 














<212> DNA 














<213> Homo 


sapiens 












<400> 775 














acttagctga 


tagctagcat 


tagctgctgg 


ccaggtgcat 


aagccatctt 


agatgtccag 




cccaggttca 


tagaatggga 


tgcccactga 


tgacaatgcc 


aggccctcca 




120 


gtaagagagg 


ccgaagagga 


gaatctggtc 


ctcctggaca 


gcctggtcct 


cagggccctc 


180 


ctggtccaaa 


aggcgataag 


ggagaacaag 


gtgatcaggg 


acctaggatg 


stgtttccta 


240 


aaatcaatca 


tgggtttctc 


tctgctgatc 


agcagctcat 


taaacgccgc 


ctgattaagg 


300 


gtgaccaagg 


acaggcaggg 


cctccaggac 


cccctggccc 


tccaggccca 


agagggccac 


360 


ctggggacac 


agggaaagat 


ggcccccgtg 


gaatgccagg 


agtacccggt 


gaaccaggaa 


420 


agccaggaga 


acaaggcttg 


atgggtc 








447 



<210> 776 
<211> 491 
<212> DNA 
<213> Homo sapiens 



<400> 776 

attctacagg aagcaatttt ttggttgtga agggagaaat gggctcctca tggaggttcc 60 

tgggctccac tgaggaagag cccttcctcc cagcagtgtt gtataaagta gagaagataa 120 

atggatggag aacgcctgct ccaagccgga cgacgacatt ctagacatcc cgctggacga 180 

tcccggcgcc aacgcggccg ccgccaaaat ccaggcgagt tttcggggcc acatggcgcg 240 

gaagaagata aagagcggag actaggccag aactgagcat tttcaaagtt cccgaggaga 300 

gatggatgcc gcgtcccctt cgcagcgacg agacttccct gccgtgtttg tgaccccctc 360 

ctgcccagca acctgccagc tacaggagcc ccctgcgtcc cagagactcc ctcacccagg 420 

caggctccgt cgcggagtcg ctgagtccgt gcccttttag ttagttctgc agtctagtat 4 80 

ggtcccawtt g 4 91 

<210> 777 
<211> 389 
<212> DNA 
<213> Homo sapiens 

<400> 777 

tgtgtaatgt tgaatctgga aattgatcag cattaaaggg cacatgaagc agtgtctgca 60 

ggcgttcagt gctgcggasc tgttaaaggt cactcagatg tgcaggtgtt aatcttctct 120 

aaaagcctgg tgatacagct ctggctttct gagcacacta cggatctgga aaatactgga 180 

aaatgtgata cttagaatac tttggctgct aaggaaactt cctctccatt gcagaatagc 240 

tgagccaagt gagtgagttt gcagaaagca ggtggtgagc tcctgcctgc tggaggttgc 300 

catggagggc cattcctgcc cggcaacagc accgtcctgc agggagccac ttggcagaag 360 

ggtgcagggc tgctggtgtc agagcaaga 389 

<210> 778 
<211> 440 
<212> DNA 

<213> Homo sapiens 



<400> 778 

tgtagtgttt gcagtagcgt ttctcacctt ccctgatgaa gggtcttttc agttgcaaac 60 

caacttgcca cctcagcggc agaagttgag tgttgttcgt gtcaccacag agtctcgttc 120 

tgtcacccag cctggtgtgc agtggttcar gccattctcc tgcctcagcc tcccaagtag 180 

ctgggattac aggcgcccac caccaagccc agctaatttt tgtatgttcg gtagagacgg 240 

ggtttcacca tgttggccag gctggtctca aactcctgac ctcaagcgac ccacccgcct 300 

cggcctccca gagttttgcg attacaggcg tgacaccacg cccagcggaa tttcttattc 360 

tttttgatct atcaaaacaa cctcattgtt ctcaatgact gaccttataa taagtaaatg 420 

ttatagaaca taaaaaaaaa 440 
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<210> 779 
<211> 418 
<212> DNA 
<213> Homo sapiens 



<400> 779 

atagattaca acggtgatgg cgggcaagcg gtccggctgg agccgggcgg ctctcctcca 60 

gctccttctc ggcgtgaacc tggtggtgat gccgcccacc cgggcccgga gtctgcgctt 120 

cgttaccttg gaggggatgc tacagcactg ggaactgggc caggccctgc ggcasgctat 180 

cacggcttcc taaacacctc ttatcaccgg caagaggttt atgtgcgasr gacttgaccg 240 

gactctcatg agtgctgagg ccaacctggc tggactcttc cctcccaacg gatgcagcgc 300 

ttcaacccga acatctcgtg gcagcctatt cctgtgcaca ctgtsccatc actgaggaca 360 

ggctgctgaa gttcccgttg ggcccatgtc cccgttatga gcagctgcag aacgagac 418 



<210> 780 
<211> 362 
<212> DNA 
<213> Homo sapiens 



<400> 780 

aaacttgccc ttctcaaaca tggccgccac ggcgcctctg gaagggaacc gctctgggcc 60 

ccgcctttga tctcgttggt ggggctgggg gatgagagct gcaccgcgcg ggacaagtcg 120 

ccggcggccc gacggagcag aagagagagc atggagctgg agaggatcgt cagtgcagcc 180 

ctccttgcct ttgtccagac acacctcccg gagccgacct cagtggcttg gatgaggtca 24 0 

tcttctccta tgtgcttggg gtcctggagg actgggcccc tcggcattca gaggagaact 300 

tcgatatgga ggctttcact gagatgatgg aggcctatgt gcctggcttc gccacatccc 360 

ca 362 



<210> 781 
<211> 379 
<212> DNA 
<213> Homo sapie: 



<400> 781 

agattatttc tacttggttg ccttccagat 
aaccactcyt tttcaaagat ctctgaggga 
caggtctggc cacggccatg agcatgctga 
gagcaggttg gggagtctgg ctggttccag 
tcggctgcct ttgcgcccat ctgtgtgggc 
cactgccaga gtcctggggt ggctgagctg 
agctgaacta tacagtgcc 



gtttcacact 


tggacagcaa 


actgatttca 


60 


gacattgcac 


ctggccactg 


cagcccagag 


120 


gccatcatgc 


ccaccgtgga 


tgacattctg 


180 


aagcaagcct 


tcctcatctt 


atgcctgctg 


240 


atcgtcttcc 


tgggtttcac 


acctgaccac 


300 


agccagcgct 


gtggctggag 


ccctgcgagg 


360 








379 



<210> 782 
<211> 453 
<212> DNA 
<213> Homo 



sapiens 



<400> 782 

ttctcatctt ttgtaatgtg tggtcctcac ttggcctctg ggacacctcc aagtcttagt 60 

tgtcttccta actcactagt tgctccttct cactctcctt tgctggttct tccttttctc 120 

cttgacctct taatgttaga gtcccagagt gtagtcatag tccttctctg tgatctattt 180 

ttaccctgtc tttgtgatct agtataatgg ccaccaccta taaacttata gctcacgaat 240 

ttatatcact agctctgaga tctctcacga actccagatg tgcacatctg actgcttact 300 

caacatctcg aatgcctgtt cttcaagagt ttgagtcatg aagtgtgaag cctagagatg 360 

gtgtgataca cagagatgaa actggttttc acaggtgata gatggaaggc tgttttcaaa 420 

agatcaagct tgaccacata ttatcccctc cac 453 



<210> 783 
<211> 553 
<212> DNA 
<213> Homo sapiens 



<400> 783 
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aactacttcc gggggagcgg cgcggcggcg sggagttggt tctaaagagt ggtgagtcag 60 

aagagacgtc aggcagcaag cgacttgggc catggctctg acctagactt ctcacctccg 120 

cagtgcccga gcccactttc ctggagaacc tgctacggta cggactcttc ctgggagcca 180 

tcttccagct catctgtgtg ctggccatca tcgtacccat tcccaagtcc cacgaggcgg 24 0 

aggctgaacc gtctgagccc agaagtgctg aggtgacgag gaagcccaag gctgctgttc 300 

cttctgtgaa caagaggccc aagaaagaga ctaagaagaa gcggtagaag aggaggcctg 360 

aggagctggg cgggcaggga gagggtcttg gggacagccc tcctgggaat ctacattgtg 420 

ttcccccgca ttccaggctc agggtctgag gaggctgtga cgccctatga ccgcagagat 480 

ctagacagtc gtaacagtcc ccaggctcba gctgggcaat ccaccacttc ctcttccttc 540 

tgcttctgtg acg 553 

<210> 784 
<211> 472 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> 445 

<223> n=a, g, c or t 

<400> 784 

ttccctttcg aattccaggg tatatctggg aggccggaga cgtgtctggt tattacacag 60 
atgcacagct ggacgtggga tccacacagc tcagaacagt tggatcttgc tcagtctctg 120 
tcagaggaag atcccttgga caagaggacs ctgccttggt gtgagagtga gggaagarga 180 
agctggaacg agggttaagg aaaaccttcc agtctggaca gtgactggag agctccaagg 240 
aaagcccctc ggtaacccag ccgctggcac catgaaccca gagagcagta tctttattga 300 
ggattacctt aagtatttcc aggaccaagt gagcagagag aatctgctac aactgctgac 360 
tgatgatgaa gcctggaatg gattcgtggc tgctgctgaa ctgcccaggg atgaggcaga 420 
tgagctccgt aaagctctga acaangcttg caagtcacat ggtcatgaag ga 472 

<210> 785 
<211> 157 
<212> PRT 

<213> Homo sapiens 

<220> 

<221> SIGNAL 
<222> -29. .-1 

<220> 

<221> UNSURE 
<222> 120 

<223> Xaa = His, Gin 



<400> 785 



Met 


Ala 


Thr 


Met 


Val 
-25 


Pro 


Ser 


Val 


Leu 


Trp 
-20 


Pro Arg 


Ala 


Cys 


Trp 

-15 


Thr 


Leu 


Leu 


Val 


Cys 
-10 


Cys 


Leu 


Leu 


Thr 


Pro 
-5 


Gly 


Val Gin 


Gly 


Gin 
1 


Glu 


Phe 


Leu 


Leu 

5 


Arg 


Val 


Glu 


Pro 


Gin 

10 


Asn 


Pro 


Val 


Leu Ser 
15 


Ala 


Gly 


Gly 


Ser 




Phe 


Val 


Asn 


Cys 


Ser 


Thr 


Asp 


Cys 


Pro 


Ser Ser 


Glu 


Lys 


He 


Ala 


20 










25 










30 








35 




Glu 


Thr 


Ser 


40 


Ser 


Lys 


Glu 




Val 
45 


Ala Ser 


Gly 


Met 


Gly 
50 


Trp 


Ala 


Ala 


Phe 


Asn 
55 


Leu 


Ser 


Asn 


Val 


Thr 
60 


Gly 


Asn Ser 


Arg 


He 
65 


Leu 


Cys 


Ser 


Val 


Tyr 
70 


Cys 


Asn 


Gly 


Ser 


Gin 
75 


He 


Thr 


Gly Ser 


Ser 
80 


Asn 


He 


Thr 


Val 


Tyr 
85 


Arg 


Leu 


Pro 


Glu 


Arg 
90 


Val 


Glu 


Leu 


Ala Pro 
95 


Leu 


Pro 


Pro 


Trp 


Gin 


Pro 


Val 


Gly 


Gin 




Phe 


Thr 


Pro 


Ala 


Leu Pro 


Ser 


Gly 


Gly 


Trp 
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100 

Val Ala Pro Asp Xaa 
120 

<210> 786 
<211> 151 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> SIGNAL 
<222> -16. .-1 



<400> 7! 


36 


















Met 


Val 


Met 


Leu 


Leu 


Leu Leu 


Leu 


Ser Ala Leu Ala 


Gly Leu 


Phe 


Gly 




-15 








-10 




-5 








Ala 


Ala 


Glu 


Gly 


Gin 


Ala Phe 


His 


Leu Gly Lys Cys 


Pro Asn 


Pro 


Pro 


1 








5 






10 




15 




Val 


Gin 


Glu 


Asn 


Phe 


Asp Val 


Asn 


Lys Tyr Leu Gly 


Arg Trp 


Tyr 


Glu 








20 








25 


30 






He 


Glu 


Lys 


He 


Pro 


Thr Thr 


Phe 


Glu Asn Gly Arg 


Cys He 


Gin 


Ala 






35 








40 




45 






Asn 


Tyr 


Ser 


Leu 


Met 


Glu Asn 


Gly 


Lys He Lys Val 


Leu Asn 


Gin 


Glu 




50 








55 




60 








Leu 


Arg 


Ala Asp 


Gly 


Thr Val 


Asn 


Gin He Glu Gly Glu Ala 


Thr 


Pro 


65 










70 




75 






80 


Val 


Asn 


Leu 


Thr 


Glu 


Pro Ala 


Lys 


Leu Glu Val Lys 


Phe Ser 


Trp 


Phe 










85 






90 




95 




Met 


Pro 


Ser 


Ala 


Pro 


Tyr Trp 


He 


Leu Ala Thr Asp 


Tyr Glu 


Asn 


Tyr 








100 








105 


110 






Ala 


Leu 


Val 


Tyr 


Ser 


Cys Thr 


Cys 


He He Gin Leu 


Phe His 


Val 


Asp 






115 








120 




125 






Phe 


Ala 


Trp 


He 


Leu 


Ala Arg 













130 135 



<210> 787 
<211> 143 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> SIGNAL 
<222> -17. .-1 



<400> 787 



Met 


Ala 


Leu 
-15 


Glu 


Val 


Leu 


Met 


Leu 

-10 


Leu 


Ala Val 


Leu 


He 

-5 


Trp 


Thr 


Gly 


Ala 


Glu 
1 


Asn 


Leu 


His 


Val 
5 


Lys 


He 


Ser 


Cys Ser 
10 


Leu 


Asp 


Trp 


Leu 


Met 
15 


Val 


Ser 


Val 


He 


Pro 


Val 


Ala 


Glu 


Ser Arg Asn 


Leu 


Tyr 


He 


Phe 


Ala 










20 










25 








30 




Asp 


Glu 


Leu 


His 


Leu 


Gly Met 


Gly 


Cys 


Pro Ala 


Asn 




He 


His 


Thr 








35 










40 








45 






Tyr 


Val 


Tyr 
50 


Glu 


Phe 


He 


Tyr 


Leu 
55 


Val 


Arg Asp 


Cys 


Gly 
60 


He 


Arg 


Thr 


Arg 


Val 


Val 


Ser 


Glu 


Glu 


Thr 


Leu 


Leu 


Phe Gin 


Thr 


Glu 


Leu 


Tyr 


Phe 


65 










70 








75 










Thr 


Pro 


Arg 


Asn 


He 


Asp 


His 


Asp 


Pro 


Gin Glu 


He 


His 


Leu 


Glu 


Cys 


80 










85 








90 










95 


Ser 


Thr 


Ser 


Arg 


Lys 
100 


Ser 


Val 


Trp 


Leu 


Thr Pro 
105 


Val 


Ser 


Thr 


Glu 

110 


Asn 


Glu 


He 


Lys 


Leu 
115 


Asp 


Pro 


Ser 


Pro 


Phe 
120 


He Ala 




Phe 


Gin 
125 


Thr 
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105 110 115 

Pro His Gly Gly Ala Ala Ser Leu 
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<210> 788 
<211> 148 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -17. .-1 

<400> 788 



Met 


Ala 


Leu 
-15 


Glu 


Val 


Leu 


Met 


Leu 
-10 


Leu 


Ala 


Val 


Leu 


He 
-5 


Trp 


Thr 


Gly 


Ala 


Glu 


Asn 


Leu 


His 


Val 


Lys 


He 


Ser Cys 


Ser 


Leu Asp 


Trp 


Leu 


Met 




1 








5 










10 










15 


Val 


Ser 


Val 


lie 


Pro 


Val 


Ala 


Glu 


Ser Arg Asn Leu 


Tyr 


He 


Phe 


Ala 










20 










25 










30 




Asp 


Glu 


Leu 


His 


Leu 


Gly Met 


Gly 


Cys 


Pro 


Ala 


Asn 


Arg 


He 


His 


Thr 








35 










40 










45 






Tyr 


Val 


Tyr 


Glu 


Phe 


He 


Tyr 




Val 




Asp Cys 


Gly 


He 




Thr 






50 










55 










60 








Arg 


Val 
65 


Val 


Ser 


Glu 


Glu 


Thr 
70 


Leu 


Leu 


Phe 


Gin 


Thr 
75 


Glu 


Leu 


Tyr 


Phe 


Thr 


Pro 


Arg 


Asn 


He 


Asp 


His 


Asp 


Pro 


Gin 


Glu 


He 


His 


Leu 


Glu 


Cys 


80 










85 










90 










95 


Ser 


Thr 


Ser 


Arg 


Lys 
100 


Ser 


Val 


Trp 


Leu 


Thr 
105 


Pro 


Val 


Ser 


Thr 


Glu 
110 


Asn 


Glu 


lie 




Leu 
115 


Asp 


Pro 


Ser 


Pro 


Phe 
120 


He 


Ala 


Asp 


Phe 


Gin 
125 


Thr 


Thr 


Ala 


Glu 


Glu 
130 


Leu 



























<210> 789 
<211> 75 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -24. .-1 

<220> 

<221> UNSURE 
<222> 51 

<223> Xaa = Glu, Lys 
<400> 789 

Met Leu Pro Pro Leu Pro Ser Arg Leu Gly Leu Leu Leu Leu Leu Leu 
-20 -15 -10 

Leu Cys Pro Ala His Val Gly Gly Leu Trp Trp Ala Val Gly Ser Pro 
-5 15 

Leu Val Met Asp Pro Thr Ser He Cys Arg Lys Ala Arg Arg Leu Ala 
10 15 20 

Gly Arg Gin Ala Glu Leu Cys Gin Ala Glu Pro Glu Val Val Ala Glu 

25 30 35 40 

Leu Ala Arg Gly Ala Arg Leu Gly Val Arg Xaa 
45 50 

<210> 790 
<211> 64 
<212> PRT 

<213> Homo sapiens 
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<220> 

<221> SIGNAL 
<222> -22. .-1 

<400> 790 

Met Arg Gly Met Lys Leu Leu Gly Ala Leu Leu Ala Leu Ala Ala Leu 

-20 -15 -10 

Leu Gin Gly Ala Val Ser Leu Lys lie Ala Ala Phe Asn lie Gin Thr 

-5 15 10 

Phe Gly Glu Thr Lys Met Ser Asn Ala Thr Leu Val Ser Tyr lie Val 

15 20 25 

Gin lie Leu Ser Arg Tyr Asp lie Ala Leu Val Gin Glu Val Arg Asp 
30 35 40 

<210> 791 
<211> 117 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -33. .-1 

<220> 

<221> UNSURE 
<222> 82 

<223> Xaa = Glu, Gly 
<400> 791 



Met 


Ser 


Ala 


Gin 


Glu 


Ser 


Cys Leu Ser 


Leu 


He 


Lys 


Tyr 


Phe 


Leu 


Phe 








-30 






-25 










-20 






Val 


Phe 


Asn 


Leu 


Phe 


Phe 


Phe Val Leu Gly 


Ser 


Leu 


He 


Phe 


Cys 


Phe 






-15 








-10 








-5 








Gly 


He 


Trp 


He 


Leu 


He 


Asp Lys Thr 


Ser 


Phe 


Val 


Ser 


Phe 


Val 


Gly 




1 








5 






10 










15 


Leu 


Ala 


Phe 


Val 


Pro 


Leu 


Gin He Trp 


Ser 


Lys 


Val 


Leu 


Ala 


He 


Ser 










20 






25 










30 




Gly 


He 


Phe 


Thr 


Met 


Gly 


He Ala Leu 


Leu 


Gly Cys 


Val 


Gly Ala 


Ser 








35 






40 










45 






Arg 


Ser 


Ser 


Ala 


Ala 


Ser Trp Ala Cys 


He 


Leu 


Gly Cys 


Cys 


Cys 


Ser 






50 








55 








60 








Cys 


Leu 


Pro 


His 


Arg Ser 


Pro Trp Glu 


Ser 


Ser 


Ser 


Pro 


Leu 


Ser 


Gly 


65 










70 






75 










Pro 


Ala 


Xaa 


Ala 


Lys 





















80 

<210> 792 
<211> 82 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -32. .-1 

<220> 

<221> UNSURE 
<222> 35 

<223> Xaa = His, Gin 



<400> 792 

Met Ala Lys Met Phe Asp Leu Arg Thr Lys He Met He Gly He Gly 
-30 -25 -20 
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Ser Ser Leu Leu Val Ala Ala Met Val Leu Leu Ser Val Val Phe Cys 

-15 -10 -5 

Leu Tyr Phe Lys Val Ala Lys Ala Leu Lys Ala Ala Lys Asp Pro Asp 
15 10 15 

Ala Val Ala Val Lys Asn His Asn Pro Asp Lys Val Cys Trp Ala Thr 

20 25 30 

Asn Ser Xaa Ala Lys Ala Thr Thr Met Glu Ser Cys Pro Ser Leu Gin 
35 40 45 

Cys Cys 
50 



<210> 793 
<211> 60 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -19. .-1 



<400> 793 

Met Lys He Leu Val Ala Leu Ala 
-15 

Leu Phe Ala Glu Glu He Gly Ala 

1 5 
Asp Trp Tyr Asp Ser Asp Gin lie 

15 20 
Glu His Leu Leu Gin Arg He Ala 
30 35 



Val Phe Phe Leu Val Ser Thr Gin 

-10 -5 
Asn Asp Asp Leu Asn Tyr Trp Ser 
10 

Lys Glu Glu Leu Pro Glu Pro Phe 

25 

Arg Arg Pro Lys 
40 



<210> 794 
<211> 79 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -28. .-1 



<400> 794 

Met He Tyr Thr Met Lys Lys Val 

-25 

Leu Leu Leu Asn Leu Ala Pro Ala 

-10 -5 
Asp Glu Glu His Thr He He Thr 
5 10 
Leu Met His Ser Phe Cys Ala Phe 
25 

Ala He Met Lys Arg Phe Ser Ser 
40 



His Ala Leu Trp Ala Ser Val Cys 

-20 -15 
Pro Leu Asn Ala Asp Ser Glu Glu 
1 

Asp Thr Glu Leu Pro Pro Leu Lys 

15 20 
Lys Ala Asp Asp Gly Pro Cys Lys 

30 35 
He Phe Ser Leu Asp Ser Ala 
45 50 



<210> 795 
<211> 76 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -24. .-1 



<400> 795 

Met Gly Ser Ala Cys He Lys Val Thr Lys Tyr Phe Leu Phe Leu Phe 
-20 -15 -10 
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Asn Leu 


He 


Phe 


Phe 


He Leu Gly Ala Val 


He 


Leu 


Gly 


Phe 


Gly Val 






-5 




1 






5 






Trp He 


Leu 


Ala 


Asp 


Lys Ser Ser Phe He 


Ser 


Val 




Gin 


Thr Ser 


10 








15 




20 








Ser Ser 


Ser 


Leu 


Arg 


Met Gly Ala Tyr Val 


Phe 


He 


Gly Val 


Gly Ala 


25 








30 


35 








40 


Val Thr 


Met 


Leu 


Met 


Gly Phe Leu Gly Cys 


He 


Gly 














45 


50 












<210> 796 


















<211> 151 


















<212> PRT 


















<213> Homo sapiens 














<220> 




















<221> SIGNAL 
















<222> -18. .- 


-1 
















<220> 




















<221> UNSURE 
















<222> 117 


















<223> Xaa = 


Ser, 
















<400> 796 


















Met Lys 


Ala 


Leu 


He 


Val Leu Gly Leu Val 


Leu 


Leu 


Ser 


Val 


Thr Val 






-15 




-10 








-5 




Gin Gly 


Lys 


Val 




Glu Arg Cys Glu Leu 


Ala 


Arg 


Thr 


Leu 


Lys Arg 




1 






5 




10 








Leu Gly 


Met 


Asp 


Gly 


Tyr Arg Gly He Ser 




Ala 




Trp 


Met Cys 


15 








20 


25 








30 


Leu Ala 


Lys 


Trp 


Glu 


Ser Gly Tyr Asn Thr 


Arg Ala 


Thr 


Asn 


Tyr Asn 








35 


40 










45 


Ala Gly 


Asp 


Arg 


Ser 


Thr Asp Tyr Gly He 


Phe 


Gin 


He 


Asn 


Ser Arg 






50 




55 








60 




Tyr Trp 


Cys 




Asp 


Gly Lys Thr Pro Gly 


Ala 


Val 


Asn 


Ala 


Cys His 




65 






70 






75 






Leu Ser 


Cys 


Ser 


Ala 


Leu Leu Gin Asp Asn 


He Ala Asp Ala 


Val Ala 


80 








85 




90 








Cys Ala 


Lys 


Arg 


Val 


Val Arg Asp Pro Pro 


Thr 


Ser 


Ala 


Ser 


Gin Ser 


95 








100 


105 








110 


Ala Gly 


He 


Thr 


Gly 


Val Xaa Asn Cys Ala 


Arg 


Pro 


His 


Ser 


Val Leu 








115 


120 










125 


He Lys 


Glu 


He 


Thr 


Gin Thr 
















130 
















<210> 797 


















<211> 82 


















<212> PRT 


















<213> Homo sapiens 














<220> 




















<221> SIGNAL 
















<222> -29. .- 


-1 
















<400> 797 


















Met Asp 


Leu 


Gin 


Gly 


Arg Gly Val Pro Ser 


He Asp Arg Leu 


Arg Val 








-25 


-20 










-15 


Leu Leu 


Met 


Leu 


Phe 


His Thr Met Ala Gin 


He 


Met 


Ala 


Glu 


Gin Glu 






-10 




-5 








1 




Val Glu 


Asn 




Ser 


Gly Leu Ser Thr Asn 


Pro 


Glu 


Lys 


Asp 


He Phe 


5 








10 




15 








Val Val 


Arg 


Glu 


Asn 


Gly Thr Thr Cys Leu 


Met 


Ala 


Glu 


Phe 


Ala Ala 


20 








25 


30 








35 
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Lys Phe lie Val Pro Tyr Asp 
40 

He Thr 

<210> 798 

<211> 60 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SIGNAL 
<222> -24 . .-1 

<400> 798 

Met Glu Glu Gly Gly Asn Leu 
-20 

Leu Val Leu Ser Gly Ala Trp 

-5 

Ser Gly Phe Leu Leu Phe Arg 

10 15 
Val Gin Ser Lys Leu Phe Pro 
25 30 

<210> 799 
<211> 142 
<212> PRT 

<213> Homo sapiens 

<220> 

<221> SIGNAL 
<222> -26. .-1 

<220> 

<221> UNSURE 
<222> 47 

<223> Xaa = Gly, Arg 



<400> 799 



Met 


Lys 


Thr 


Glu 


Gin Leu 


Asn 




-25 








-20 


Ala 


Ser 


Leu 


Phe 


Thr Glu 


Asn 


-10 








-5 




Arg 




Gin 


Cys 


Gin Val 


Asn 








10 






Pro 


Phe 


Thr 


Val 


He Asn 


He 






25 








Pro 


Arg 


Ala 


Leu 


Trp Lys 


Gly 




40 








45 


Val 


Thr 


Leu 


Gly 


Ala Glu Gly 


55 








60 




Arg 


Glu 


Val 


Leu 


His Lys 


Trp 










75 




Leu 






Ser 


Leu Thr 


Tyr 








90 






Ser 


Leu 


He 


Glu 


Thr Val 


Gin 






105 









<210> 800 
<211> 126 
<212> PRT 

<213> Homo sapiens 



PCT/lBOl/00914 

409 

Val Trp Ala Ser Asn Tyr Val Asp Leu 



Gly Gly Leu He Lys Met Val His Leu 

-15 -10 
Gly Met Gin Met Trp Val Thr Phe Val 

1 5 
Ser Leu Pro Arg His Thr Phe Gly Leu 
20 

Phe Tyr Phe His He 
35 



Arg Phe Ala Gly Phe Gly He Gly Leu 
-15 

Val Leu Ala His Pro Cys He Val Leu 

1 5 
Tyr His Ala Gin His Tyr His Leu Thr 

15 20 
Met Tyr Ser Phe Asn Lys Thr Gin Gly 
30 35 
Met Xaa Ser Thr Phe He Val Gin Gly 
50 

He He Ser Glu Phe Thr Pro Leu Pro 

65 70 
Ser Pro Lys Gin He Gly Glu His Leu 
80 85 

Val Val Ala Met Leu Phe Tyr Ser Ala 

95 100 
Ser Glu He He Arg Asp Asn 
110 115 



<220> 
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<400> 8C 


)0 
















Met 


His 


Glu 


Glu 


Glu He Tyr 


Thr Ser 


Leu 


Gin 


Trp Asp Ser 


Pro Ala 










-40 




-35 






-30 


Pro 


Asp 


Thr 


Tyr 


Gin Lys Cys 


Leu Ser 


Ser 


Asn 


Lys Cys Ser 


Gly Ala 








-25 




-20 






-15 




Cys 


Cys 


Leu 


Val 


Met Val He 


Ser Cys 


Val 


Phe 


Cys Met Gly Leu Leu 






-10 






-5 






1 




Thr 


Ala 


Ser 


lie 


Phe Leu Gly Val Lys 


Leu 


Leu 


Gin Val Ser 


Thr He 


5 








10 






15 




20 


Ala 


Met 


Gin 


Gin 


Gin Glu Lys 


Leu He 


Gin 


Gin 


Glu Arg Ala 


Leu Leu 










25 




30 






35 


Asn 


Phe 


Thr 


Glu 


Trp Lys Arg 


Ser Cys 


Ala 


Leu 


Gin Met Lys 


Tyr Cys 








40 




45 






50 




Gin 


Ala 


Phe 


Met 


Gin Asn Ser 


Leu Ser 


Ser 


Ala 


His Asn Ser 


Ser Pro 






55 






60 






65 




Cys 


Pro 




Asn 


Trp He Gin 


Asn Arg 


Glu 


Ser 


Cys Tyr Tyr 






70 






75 








80 





<210> 801 
<211> 52 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -20. .-1 



<400> 801 

Met Gly Arg Ala Arg Glu Val Gly 
-20 -15 
Gly Ala Gly Ala Cys Tyr Cys Val 
1 

Asp Ser Glu Lys Leu Glu Glu Glu 

15 20 
Gin Glu Leu Asp 
30 



Trp Met Ala Ala Gly Leu Met He 

-10 -5 
Tyr Lys Leu Thr He Gly Arg Asp 
5 10 
Gly Glu Glu Glu Trp Asp Asp Asp 
25 



<210> 802 
<211> 154 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> SIGNAL 
<222> -15. .-1 



<400> 802 


















Met 


Asn 


He 


Leu 


Met Leu 


Thr 


Phe He 


He 


Cys 


Gly Leu 


Leu Thr Arg 


-15 








-10 








-5 




1 


Val 


Thr 


Lys 


Gly 


Ser Phe 


Glu 


Pro Gin 


Lys 


Cys 


Trp Lys 


Asn Asn Val 








5 






10 








15 


Gly 


His 


Cys 


Arg 


Arg Arg 


Cys 


Leu Asp 


Thr 


Glu 


Arg Tyr 


He Leu Leu 






20 








25 






30 




Cys 


Arg 


Asn 


Lys 


Leu Ser 


Cys 


Cys He 


Ser 


He 


He Ser 


His Glu Tyr 




35 








40 








45 




Thr 


Arg 


Arg 


Pro 


Ala Phe 


Pro 


Val He 


His 


Leu 


Glu Asp He Thr Leu 


50 








55 








60 




65 


Asp 


Tyr 


Ser 


Asp 


Val Asp 


Ser 


Phe Thr Gly 


Ser 


Pro Val 


Ser Met Leu 








70 






75 






80 


Asn 


Asp 


Leu 


He 


Thr Phe Asp Thr Thr 


Lys 


Phe 


Gly Glu 


Thr Met Thr 
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85 90 95 

Pro Glu Thr Asn Thr Pro Glu Thr Thr Met Pro Pro Ser Glu Ala Thr 

100 105 110 

Thr Pro Glu Thr Thr Met Pro Pro Ser Glu Thr Ala Thr Ser Glu Thr 

115 120 125 

Met Pro Pro Pro Ser Gin Thr Ala Leu Thr 
130 135 



<210> 803 

<211> 165 

<212> PRT 

<213> Homo sapiens 



<220> 

<221> SIGNAL 
<222> -39. .-1 



Met 


Met 


Thr 


He 


Thr 


Phe 


Leu Pro Tyr 


Thr 


Phe 


Ser Leu Met 


Val 


Thr 










-35 






-30 






-25 




Phe 


Pro 


Asp 


Val 


Pro 


Leu 


Gly He Phe 


Leu 


Phe 


Cys Val Cys 


Val 


He 








-20 






-15 






-10 






Ala 


He 


Gly 


Val 


Val 


Gin 


Ala Leu He 
1 


Val 


Gly 


Tyr Ala Phe 


His 


Phe 


Pro 


His 


-5 
Leu 


Leu 


Ser 


Pro 


Gin He Gin 


Arg 


Ser 


Ala His Arg Ala 


Leu 


10 










15 






20 






25 


Tyr 


Arg 


Arg 


His 


Val 


Leu 


Gly He Val 


Leu 


Gin 


Gly Pro Ala 


Leu 


Cys 










30 






35 






40 




Phe 


Ala 


Ala 


Ala 


He 


Phe 


Ser Leu Phe 


Phe 


Val 


Pro Leu Ser 


Tyr 


Leu 








45 






50 






55 






Leu 


Met 


Val 


Thr 


Val 


He 


Leu Leu Pro 


Tyr 


Val 


Ser Lys Val 


Thr 


Gly 






60 








65 






70 






Trp 


Cys 


Arg 


Asp 


Arg 


Leu 


Leu Gly His 


Arg 


Glu 


Pro Ser Ala 


His 


Pro 




75 










80 






85 






Val 


Glu 


Val 


Phe 


Ser 


Phe 


Asp Leu His 


Glu 


Pro 


Leu Ser Lys 


Glu 


Arg 


90 










95 






100 






105 


Val 


Glu 


Ala 


Phe 


Ser 


Asp 


Gly Val Tyr 


Ala 


He 


Val Ala Thr 


Leu 


Leu 










110 






115 






120 




lie 


Leu 


Asp 


He 


Trp 






















125 



















<210> 804 
<211> 145 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> SIGNAL 
<222> -22. .-1 



<220> 

<221> UNSURE 
<222> 84 

<223> Xaa = Ala, Pro, Ser, Thr 



<220> 

<221> UNSURE 

<222> -1 

<223> Xaa = Ala, Ser 



<220> 

<221> UNSURE 
<222> 21 
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<223> Xaa = Cys , Phe, He, Asn, Ser , Thr, Tyr 

<220> 

<221> UNSURE 
<222> 46 

<223> Xaa = Cys, Phe, Ser, Tyr 



<400> 804 



Met 


Gly 


Arg 


Trp 


Ser 


Leu Thr Ala 


Ser 


Pro 


Val 


Thr 


Leu 


Thr 


Ser 


Leu 






-20 






-15 










-10 








Leu 


Pro 


Met 


Val 


Thr 


Xaa Gly Pro 


Ala 


Ala 


Gly Lys 


Ser 


Ser 


Ser 


Ser 




-5 








1 






5 










10 


Thr 


Ser 


Trp 


Asp 


Thr 


Val Leu Thr 


Ala 


He 


Xaa 


Cys 


Ala 


Ser 


Thr 


Val 










15 






20 










25 




Gin 


Leu 


He 


Ser 


Thr 


Thr Leu Gly Ala 


Ser 


Ala 


Ser 


Gly Ala Arg Met 








30 






35 










40 






Pro 


Thr 


Thr 


Xaa 


Cys 


Ser Gly Thr 


Thr 


Val 


Phe 


Leu 


Thr 


Ala 


Leu 


Gin 






45 






50 










55 








Asp 


Thr 


Met 


Gin 


Arg 


Glu Glu Leu 


Val 


Lys 


Asn 


Ala 


Thr 


Pro 


Pro 


Ala 




60 








65 








70 










Glu 


Pro 


Ala 


Arg 


Ala 


Glu Asp Leu 


Ser 


Xaa 


Ala 


Pro 


His 


Val 


Thr 


Pro 


75 










80 






85 










90 


Thr 


Ser 


Cys 


Cys 


Pro 


Thr Leu Ala 


Pro 


Ala 


Ala 


Pro 


Pro 


Ala 


Ser 


Leu 










95 






100 










105 




Gly 


Thr 


He 


Leu 


Met 


Thr He Met 


Phe 


Ala 


Ser 


Val 


Gly 


Phe 


Ser 


Asn 








110 






115 










120 






Glu 





























<210> 805 
<211> 109 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -30. .-1 

<220> 

<221> UNSURE 
<222> 64 

<223> Xaa = He, Met 
<220> 

<221> UNSURE 
<222> 58 

<223> Xaa = Leu, Val 
<220> 

<221> UNSURE 
<222> 42 

<223> Xaa = Lys, Arg 



<400> 805 

Met Ala Gly Lys Arg Ser Gly Trp 
-30 -25 
Leu Leu Gly Val Asn Leu Val Val 
-10 

Leu Arg Phe Val Thr Leu Glu Gly 

5 10 
Gin Ala Leu Arg Gin Arg Tyr His 

20 25 
Arg Gin Glu Val Tyr Val Arg Xaa 
35 40 



Ser Arg Ala Ala Leu Leu Gin Leu 

-20 -15 

Met Pro Pro Thr Arg Ala Arg Ser 

-5 1 

Met Leu Gin His Trp Glu Leu Gly 
15 

Gly Phe Leu Asn Thr Ser Tyr His 
30 

Tyr Phe Asp Arg Thr Leu Met Ser 

45 50 
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Ala Glu Ala Asn Leu Ala Gly Xaa Phe Pro Pro Asn Gly Xaa Gin Arg 

55 60 65 

Phe Asn Pro Asn lie Ser Trp Gin Pro lie Pro Val His 



<210> 806 
<211> 57 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -32. .-1 



<400> 806 

Met Ala Thr Pro Pro Phe Arg Leu lie Arg Lys Met Phe Ser Phe Lys 



-30 



-25 



-20 



Val Ser Arg Trp Met Gly Leu Ala Cys Phe Arg Ser Leu Ala Ala Ser 

-15 -10 -5 

Ser Pro Ser lie Arg Gin Lys Lys Leu Met His Lys Leu Gin Glu Glu 
15 10 15 

Asn Ala Phe Arg Glu Glu Met Lys lie 



20 



25 



<210> 807 
<211> 79 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> SIGNAL 
<222> -19. .-1 



<400> 807 




















Met 


Val 


Gly 


He 


Leu 
-15 


Pro 


Leu 


Cys 


Cys 


Ser Gly Cys Val Pro 
-10 


Ser 
-5 


Leu 


Cys 


Cys 


Ser 


Ser 


Tyr Val 


Pro 


Ser 


Val 


Ala Pro Thr Ala Ala 


His 


Ser 








1 








5 




10 






Val 


Arg 


Val 


Pro 


His 


Ser 


Ala 


Gly His 


Cys Gly Gin Arg Val 




Ala 




15 










20 






25 






Cys 


Ser 


Leu 


Pro 


Gin 


Val 


Phe 


Leu 


Lys 


Pro Trp He Phe Val 


Glu 


His 


30 










35 








40 




45 


Phe 


Ser 


Ser 


Trp 


Leu 


Ser 


Leu 


Glu 


Leu 


Phe Ser Phe Leu Arg 


Tyr 





50 55 60 



<210> 808 
<211> 147 
<212> PRT 

<213> Homo sapi( 



<220> 

<221> SIGNAL 
<222> -20. .-1 



<400> 808 

Met Asp Ser Ser Thr Ala His Ser 
-20 -15 
Glu He Thr Ala Ser Glu Tyr Glu 
1 

Phe Ser Thr Gin Ser Pro Leu Gin 

15 20 
He Leu Gly Thr He Gin He Leu 
30 35 



Pro Val Phe Leu Val Phe Pro Pro 
-10 -5 

Ser Thr Glu Leu Ser Ala Thr Thr 
5 10 

Lys Leu Phe Ala Arg Lys Met Lys 
25 

Phe Gly He Met Thr Phe Ser Phe 
40 
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Gly Val lie Phe 


Leu 


Phe 


Thr 


Leu 


Leu 


Lys 


Pro 


Tyr 


Pro 


Arg 


Phe 


Pro 


45 




50 










55 










60 


Phe lie Phe Leu 


Ser 


Gly 


Tyr 


Pro 


Phe 


Trp 


Gly 


Ser 


Val 


Leu 


Phe 


He 




65 










70 










75 




Asn Ser Gly Ala 


Phe 


Leu 


He 


Ala 


Val 


Lys 


Arg 


Lys 


Thr 


Thr 


Glu 


Thr 


80 










85 










90 






Leu lie lie Leu 


Ser 


Arg 


He 


Met 


Asn 


Phe 


Leu 


Ser 


Ala 


Leu 


Gly Ala 


95 








100 










105 








lie Ala Gly lie 


He 


Leu 


Leu 


Thr 


Phe 


Gly 


Phe 


His 


Pro 


Arg 


Ser 


Lys 


110 






115 










120 










Leu His Leu 


























125 


























<210> 809 


























<211> 96 


























<212> PRT 


























<213> Homo sapiens 
























<220> 


























<221> SIGNAL 


























<222> -33. .-1 


























<400> 809 


























Met Lys Leu Lys 


Leu 


Lys 


Asn 


Val 


Phe 


Leu 


Ala 


Tyr 


Phe 


Leu 


Val 


Ser 


-30 










-25 










-20 






lie Ala Gly Leu 


Leu 


Tyr 


Ala 


Leu 


Val 


Gin 


Leu 


Gly 


Gin 


Pro 


Cys 


Asp 


-15 








-10 










-5 








Cys Leu Pro Pro 


Leu Arg 


Ala 


Ala 


Ala 


Glu 


Gin 


Leu Arg 


Gin 


Lys 


Asp 


1 




5 










10 










15 


Leu Arg lie Ser 


Gin 


Leu 


Gin 


Ala 


Glu 


Leu Arg Arg 


Pro 


Pro 


Pro 


Ala 




20 










25 










30 




Pro Ala Gin Pro 


Pro 


Glu 


Pro 


Glu 


Ala 


Leu 


Pro 


Thr 


He 


Tyr 


Val 


Val 


35 










40 










45 






Thr Pro Thr Tyr Ala Arg 


Leu 


Val 


Gin 


Lys 


Ala 


Glu 


Leu 


Val 


Arg Leu 


50 








55 










60 








<210> 810 


























<211> 169 


























<212> PRT 


























<213> Homo sapiens 
























<220> 


























<221> SIGNAL 


























<222> -16. .-1 


























<400> 810 


























Met Ser Gly Cys 


Gly Leu 


Phe 


Leu Arg 


Thr 


Thr 


Ala 


Ala 


Ala Arg Ala 


-15 






-10 










-5 










Cys Arg Gly Leu 


Val 


Val 


Ser 


Thr 


Ala 


Asn Arg Arg 


Leu 


Leu Arg 


Thr 


1 


5 










10 










15 




Ser Pro Pro Val 


Arg Ala 


Phe 


Ala 


Lys 


Glu 


Leu 


Phe 


Leu 


Gly 


Lys 


He 


20 










25 










30 






Lys Lys Lys Glu 


Val 


Phe 


Pro 


Phe 


Pro 


Glu 


Val 


Ser 


Gin 


Asp 


Glu 


Leu 


35 








40 










45 








Asn Glu lie Asn 


Gin 


Phe 


Leu 


Gly 


Pro 


Val 


Glu 


Lys 


Phe 


Phe 


Thr 


Glu 


50 






55 










60 










Glu Val Asp Ser 


Arg 


Lys 


He Asp 


Gin 


Glu Gly 


Lys 


He 


Pro 


Asp 


Glu 


65 




70 










75 










80 


Thr Leu Glu Lys 


Leu 


Lys 


Ser 


Leu 


Gly 


Leu 


Phe 


Gly 


Leu 


Gin 


Val 


Pro 




85 










90 










95 




Glu Glu Tyr Gly 


Gly 


Leu 


Gly 


Phe 


Ser 


Asn 


Thr 


Met 


Tyr 


Ser Arg 


Leu 


100 










105 










110 






Gly Glu He He 


Ser 


Met 


Asp Gly 


Ser 


He 


Thr 


Val 


Thr 




Ala 


Ala 
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115 120 ■ 125 

His Gin Ala lie Gly Ser Arg Gly Ser Ser Trp Leu Ala Leu Arg Ser 

130 135 140 

Arg Lys Pro Asn Asn Leu Pro Lys Leu 
145 150 

<210> 811 
<211> 136 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> SIGNAL 
<222> -36. .-1 



<400> 811 



Met Glu Glu 


Leu 


Gin 


Glu Pro Leu 


Arg Gly Glu 


Leu 






Cys 


Phe 


-35 






-30 






-25 










Thr Gin Ala 


Ala 


Arg 


Thr Ser Leu 


Leu 


Leu Leu 


Arg 


Leu 


Asn 


Asp 


Ala 


-20 






-15 














-5 


Ala Leu Arg 


Ala 




Gin Glu Cys 


Gin 


Arg Gin 


Gin 


Val 


Arg 


Pro 


Val 






1 




5 








10 






lie Ala Phe 


Gin 


Gly 


His Arg Gly 


Tyr 


Leu Arg 


Leu 


Pro 


Gly 


Pro 


Gly 


15 






20 








25 








Trp Ser Cys 


Leu 


Phe 


Ser Phe He 


Val 


Ser Gin 


Cys 


Cys 


Gin 


Glu 


Gly 


30 






35 






40 










Ala Gly Gly Ser Leu 


Asp Leu Val 


Cys 


Gin Arg 


Phe 


Leu 


Arg 


Ser 


Gly 


45 






50 




55 










60 


Pro Asn Ser 


Leu 


His 


Cys Leu Gly 


Ser 


Leu Arg 


Glu 


Arg 


Leu 


He 


He 






65 






70 








75 




Trp Ala Ala 


Met 


Asp 


Ser He Pro 


Ala 


Pro Ser 


Ser 


Val 


Gin 


Gly 


His 




80 






85 








90 






Asn Leu Thr 


Glu Asp 


Ala Arg His 
















95 






100 
















<210> 812 






















<211> 109 






















<212> PRT 






















<213> Homo sapiens 


















<220> 






















<221> SIGNAL 




















<222> -23. .- 


-1 




















<220> 






















<221> UNSURE 




















<222> 81 






















<223> Xaa = 


Pro, 


,Ser 


















<400> 812 






















Met Cys Tyr 


Gly 


Lys 


Cys Ala Arg 


Cys 


He Gly 


His 


Ser 


Leu 


Val 


Gly 




-20 






-15 








-10 






Leu Ala Leu 




Cys 


He Ala Ala 


Asn 


He Leu 


Leu 


Tyr 


Phe 


Pro 


Asn 


-5 






1 






5 










Gly Glu Thr 


Lys 


Tyr 


Ala Ser Glu 


Asn 


His Leu 


Ser 


Arg 


Phe 


Val 


Trp 


10 






15 




20 










25 


Phe Phe Ser 


Gly 


He 


Val Gly Gly 


Gly Leu Leu 


Met 


Leu 


Leu 


Pro 


Ala 






30 






35 








40 




Phe Val Phe 


He 


Gly 


Leu Glu Gin 


Asp Asp Cys 


Cys 


Gly Cys 


Cys 


Gly 




45 






50 








55 






His Glu Asn 


Cys 


Gly 


Lys Arg Cys 


Ala 


Met Leu 


Ser 


Ser 


Val 


Leu 


Ala 


60 






65 








70 








Ala Leu lie 


Gly 


He 


Ala Gly Xaa 


Gly Tyr Cys 


Val 


He 
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<210> 813 
<211> 74 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -17. ,-1 



<400> 813 

Met Trp Pro Asn Leu Leu Pro Leu 
-15 -10 
Gly Arg Phe Arg Cys Asn Gly Phe 

1 5 
Glu Glu Asn Cys Thr Ala Asn Pro 
20 

His Cys Lys Asn Gly Leu Cys lie 
35 

Gin Asn Asn Cys Gin Asp Asn Ser 
50 55 



Cys Gin Arg He His Cys He He 
-5 

Glu Asp Cys Pro Asp Gly Ser Asp 

10 15 
Leu Leu Cys Ser Thr Ala Arg Tyr 

25 30 
Asp Lys Ser Phe He Cys Asp Gly 
40 45 
Asp Glu 



<210> 814 
<211> 137 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> SIGNAL 
<222> -47.. -1 



<220> 

<221> UNSURE 
<222> 87 

<22 3> Xaa = Pro, Arg 
<400> 814 



Met 


Thr 


Leu 


Gin Trp 


His 


Leu 


Tyr Glu Met Ala 


Arg Asn Leu Lys 


Val 






-45 








-40 


-35 




Gin 


Asp 


Met 


Leu Arg 


Ala 


Glu 


Val Leu Ala Ala 


Arg His Gin Ala 


Gin 




-30 








-25 




-20 




Gly 


Asp 


Met 


Ala Thr 


Met 


Leu 


Gin Leu Val Pro 


Leu Leu Lys Ala 


Ser 


-15 








-10 




-5 




1 


He 


Lys 


Glu 


Thr Leu 


Arg 


Leu 


His Pro He Ser 


Val Thr Leu Gin 










5 






10 


15 




Tyr 


Leu 


Val 


Asn Asp 


Leu 


Val 


Leu Arg Asp Tyr 


Met He Pro Ala 


Lys 






20 








25 


30 




Thr 


Leu 


Val 


Gin Val 


Ala 


He 


Tyr Ala Leu Gly 


Arg Glu Pro Thr 


Phe 




35 








40 




45 




Phe 


Phe 


Asp 


Pro Glu 


Asn 


Phe 


Asp Pro Thr Arg 


Trp Leu Ser Lys 


Asp 


50 








55 




60 




65 


Lys 


Asn 


He 


Thr Tyr 


Phe 


Arg 


Asn Leu Gly Phe 


Gly Trp Gly Val Arg 








70 






75 


80 




Gin 


Cys 


Leu 


Gly Arg Xaa 


He 


Ala Glu 







85 90 



<210> 815 

<211> 98 

<212> PRT 

<213> Homo sapiens 



<220> 
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<221> SIGNAL 
<222> -20. .-1 



<400> 815 



Met Asp Lys 


Asp 


Ser 


Gin 


Gly 


Leu Leu Asp 


Ser Ser 


Leu Met 


Ala 


Ser 


-20 






-15 






-10 






-5 


Gly Thr Ala 


Ser Arg 


Ser 


Glu 


Asp Glu Glu 


Ser Leu 


Ala Gly 


Gin 


Lys 






1 






5 




10 






Arg Ala Ser 


Ser 


Gin 


Ala 


Leu 


Gly Thr He 


Pro Lys 


Arg Arg 


Ser 


Ser 


15 










20 




25 






Ser Arg Phe 


lie Lys Arg 


Lys 


Lys Phe Asp Asp Glu 


Leu Val 


Glu 


Ser 


30 








35 




40 








Ser Leu Ala 


Lys 


Ser 


Ser 


Thr Arg Ala Lys 


Gly Ala 


Ser Gly 


Val 


Glu 


45 






50 






55 






60 


Gin Gly Ala 


Val 


Arg Val 


Val 


Asn Pro Pro 


Pro Val 


Arg Arg 


Arg Arg 






65 






70 






75 




Tyr Gin 




















<210> 816 




















<211> 113 




















<212> PRT 




















<213> Homo sapiens 
















<220> 




















<221> SIGNAL 


















<222> -36..- 


-1 


















<400> 816 




















Met Met Thr 


Glu 


Lys 


His 


Glu 


Asp His Cys 


Leu Lys 


Ser Cys 


Ser Gly 


-35 








-30 




-25 








His Ser Tyr 


lie 


Arg 


Lys 


Asn 


Trp Leu Gly Cys He 


Val Phe 


Pro 


Phe 


-20 






-15 






-10 






-5 


Ser Ala Leu 


Leu 


Gin 


Gin 


Ser 


Glu He Ser 


Gly Thr 


Phe Gin 


Val 


Thr 






1 






5 




10 






lie Pro Pro 


Val 


Leu 


Leu 


Gly 


Tyr Thr Trp 


Ser Asn Thr Tyr 


Val 


Phe 


15 










20 




25 






Pro Lys Glu Asp 


Ser 


Asn 


Glu 


Gin Asn Leu 


Lys Glu Cys Thr 


Phe 


Leu 


30 








35 




40 








Asn lie Phe 


Ala 


Thr 


lie 


Glu 


Pro Gin He 


Ser Tyr Val Thr 


Cys 


Asn 


45 






50 






55 






60 


Pro Thr Leu Asp 


Lys 


Phe 


Leu Asp Gin Thr 


Glu Val 


Leu Gin 


Arg Ala 






65 






70 






75 




Gin 





















<210> 817 
<211> 136 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> SIGNAL 
<222> -41. .-1 

<220> 

<221> UNSURE 
<222> 91 

<223> Xaa = Phe, Leu 

<400> 817 

Met Lys Ala He Lys Lys Ser Leu Thr Glu Glu Glu Tyr Leu Tyr Leu 

-40 -35 -30 

Asp Phe Ser His Gin Thr Glu Gly Cys He Phe Pro Leu His Thr Ser 
-25 -20 -15 -10 
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Val Thr Leu Phe Leu Leu Ser Tyr Cys Asp Cys Lys lie Phe Lys lie 

-5 15 
Cys Leu Val Val Thr Lys Glu Val Ser Arg Asp Ser Ser Leu Leu Arg 

10 15 20 

Asp Asp Leu He Gin Asp Val Glu He Gin He He Ser Arg Gin Glu 

25 30 35 

Leu Pro Pro He Val Gin Asn Cys Cys Leu Pro Ala Val Val Glu Arg 
40 45 50 55 

Ser Asp Asn Phe Cys Arg Ala Gly Leu Ala Val Val Leu Arg His He 

60 65 70 

He Gin Lys Ser Tyr Glu Ala Asp Pro Leu Lys Lys Glu Leu Leu Glu 

75 80 85 

Leu Leu Gly Xaa Lys Arg Leu Ala 
90 95 



<210> 818 
<211> 137 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -16.. -1 



<220> 

<221> UNSURE 
<222> 57 

<223> Xaa = Asp, Gly 
<220> 

<221> UNSURE 
<222> 58 

<223> Xaa = Cys, Ser 



<400> 818 






















Met 


Glu 


Asn 


Phe 


Ser 


Leu 


Leu 


Ser 


He 


Ser 


Gly Pro Pro He 


Ser 


Ser 




-15 










-10 








-5 






Ser 


Ala 


Leu 


Ser 


Ala 


Phe 


Pro 


Asp 


He 


Met 


Phe Ser Arg Ala 


Thr 


Ser 


1 








5 










10 




15 




Leu 


Pro 


Asp 


He 


Ala 


Lys 


Thr 


Ala 


Val 


Pro 


Thr Glu Ala Ser 


Ser 


Pro 








20 










25 




30 






Ala 


Gin 


Ala 


Leu 


Pro 


Pro 


Gin 


Tyr 


Gin 


Ser 


He He Val Arg 


Gin 


Gly 






35 










40 






45 






He 


Gin 


Asn 


Thr 


Val 


Leu 


Ser 


Pro 


Xaa 


Xaa 


Ser Leu Gly Asp 


Thr 


Gin 




50 










55 








60 






His 


Gly 


Glu 


Lys 


Leu 


Arg 


Arg 


Asn 


Cys 


Thr 


He Tyr Arg Pro 


Trp 


Phe 


65 










70 










75 




80 


Ser 


Pro 


Tyr 


Ser 


Tyr 


Phe 


Val 


Cys 


Ala 


Asp 


Lys Glu Ser Gin 


Leu 


Glu 










85 










90 




95 




Ala 


Tyr 


Asp 


Phe 


Pro 


Glu 


Val 


Gin 


Gin 


Asp 


Glu Gly Lys Trp 


Asp Asn 








100 










105 




110 






Cys 




Ser 


Glu 


Asp 


Met 


Ala 


Glu 


Asn 












115 










120 













<210> 819 
<211> 113 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> UNSURE 
<222> 34 

<223> Xaa = Ala, Glu 
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<400> 819 



Met Leu Gly 


Ser 


Arg 


Val Asp Arg 


Pro 


Lys 


Leu Ser 


Val 


Ala 


Pro Ser 


1 




5 










10 








15 


Val Val Leu 


Glu 


Glu 


Asp 


Gin 


Val 


Leu 


Val 


Ser Pro 


Ala 


Val 


Asp Leu 




20 










25 








30 




Glu Xaa Gly 


Cys 


Arg 


Leu Arg 


Asp 


Phe 


Thr 


Glu Lys 


He 


Met 


Asn Val 


35 










40 








45 






Lys Gly Lys 


Val 


He 


Leu 


Ser 


Met 


Leu 


Val 


Val Ser 


Thr 


Val 


He He 


50 








55 








60 








Val Phe Trp 


Glu 


Phe 


He 


Asn 


Ser 


Thr 


Glu 


Gly Ser 


Phe 


Leu 


Trp He 


65 






70 










75 






80 


Tyr His Ser 


Lys 




Pro 


Glu 


Val 


Asp Asp 


Ser Ser 


Ala 


Gin 


Lys Gly 






85 










90 








95 


Trp Trp Phe 


Leu 


Ser Trp 


Phe 


Asn 


Asn 


Gly 


He His 


Asn 


Tyr 


Gin Gin 




100 










105 








110 




Gly 
























<210> 820 
























<211> 174 
























<212> PRT 
























<213> Homo sapiens 




















<400> 820 
























Met Ala Ala Gly 


Leu Phe Gly Leu 


Ser 


Ala 


Arg Arg 




Leu 


Ala Ala 


1 




5 










10 








15 


Ala Ala Thr Arg 


Gly Leu 


Pro 


Ala 


Ala Arg 


Val Arg Trp Glu 


Ser Ser 




20 










25 








30 




Phe Ser Arg 


Thr 


Val 


Val 


Ala 


Pro 


Ser 


Ala 


Val Ala 


Gly 




Arg Pro 


35 










40 








45 






Pro Glu Pro 


Thr 


Thr 


Pro Trp Gin 


Glu 


Asp 


Pro Glu 


Pro 


Glu 


Asp Glu 


50 








55 








60 








Asn Leu Tyr 


Glu 


Lys 


Asn 


Pro Asp 


Ser 


His 


Gly Tyr Asp 


Lys 


Asp Pro 


65 






70 










75 






80 


Val Leu Asp Val 


Trp Asn 


Met 




Leu 


Val 


Phe Phe 


Phe 


Gly 


Val Ser 






85 










90 








95 


lie He Leu 


Val 


Leu Gly 


Ser 


Thr 


Phe 


Val 


Ala Tyr 


Leu 


Pro 


Asp Tyr 




100 










105 








110 




Arg Met Lys 


Glu 


Trp Ser Arg Arg 


Glu 


Ala 


Glu Arg 


Leu 


Val 


Lys Tyr 


115 










120 








125 






Arg Glu Ala 


Asn 


Gly 


Phe 


Pro 


Ser 


Trp Asn 


Pro Thr 


Ala 


Ser 


Thr Pro 


130 








135 








140 








Ala Arg Ser 


Ser 


Cys 


Gin 


Arg 


Met 


Ser 


Asp 


Gin Leu 




Ser 


Gly Ala 


145 






150 










155 






160 


Gin Glu Ala 


Pro 


Pro 


Ser 


Pro 


Pro 


Pro 


Ala 


Cys His 


Ser 


Asp 








165 










170 










<210> 821 
























<211> 159 
























<212> PRT 
























<213> Homo sapiens 




















<400> 821 
























Met Glu Arg 


Tyr 


Ala 


Ala 


Ala 


Leu 


Glu 


Glu 


Val Ala Asp 


Gly 


Ala Arg 


1 




5 










10 








15 


Gin Gin Glu 


Arg 


His 


Tyr 


Gin 


Leu 


Leu 


Ser 


Ala Leu 


Gin 


Ser 


Leu Val 




20 










25 








30 




Lys Glu Leu 


Pro 


Ser 


Ser 


Phe 


Gin 


Gin 


Arg 


Leu Ser 


Tyr 


Thr 


Thr Leu 


35 










40 








45 






Ser Asp Leu 


Ala 


Leu 


Ala 




Leu 


Asp 


Gly 


Thr Val 


Phe 


Glu 


He Val 


50 








55 








60 








Gin Gly Leu 


Leu 


Glu 


He 


Gin 


His 


Leu 


Thr 


Glu Lys 


Ser 


Leu 


Tyr Asn 



65 70 75 80 
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Gin Arg Leu Arg Leu Gin Asn Glu His 
85 

Arg Gin Lys His Gin Glu Ala Gin Gin 
100 105 
Pro Val Val Gin Ala Ala Gin Gin Arg 

115 120 
Arg lie Arg Glu Glu Gin Arg Ala Met 

130 135 
Leu Asp Arg Lys Val Ala Asp Gin Gin 
145 150 

<210> 822 
<211> 143 
<212> PRT 

<213> Homo sapiens 



PCT/lBOl/00914 

420 

Arg Val Leu Arg Gin Ala Leu 
90 95 
Ala Cys Arg Pro His Asn Leu 
110 

Glu Leu Glu Ala Val Glu His 
125 

Asp His Lys He He Leu Glu 
140 

Ser Thr Leu Glu Lys Ala 
155 



<400> 822 








Met 


Ser 


Gly Phe Glu 


Asn 


Leu 


Asn 


1 




5 








Ser 


He 


Asp Asp Gin 


Ser 


Gin 


Gin 






20 








Gly 


Pro 


Tyr Ser Lys 


Gin 


Tyr Ala 






35 






40 


Arg 


Phe 


Val Pro Pro 


Asp 


Met 


Met 




50 






55 




Gin 


He 


Tyr Gin Pro 


Thr 


Gin 


Ala 


65 






70 






Phe 


Tyr 


Gly Asn Asn 


Phe 


Glu 


Asp 






85 








Gly 


He 


Asn Phe Asp 


His 


He 


Trp 






100 








Pro 


Leu 


Lys Val Ala 


Asp 


Gly 


Ser 






115 






120 


Gly 


Pro 


Met Val Phe 


Cys 


Leu 


Ala 




130 






135 





Thr 


Asp Phe 


Tyr 


Gin Thr 


Ser 


Tyr 




10 






15 




Ser 


Tyr Asp 


Tyr 


Gly Gly 


Ser 


Gly 


25 






30 






Gly 


Tyr Asp 


Tyr 


Ser Gin 


Gin 


Gly 








45 






Gin 


Pro Gin 


Gin 


Pro Tyr 


Thr 


Gly 






60 








Tyr 


Thr Pro 


Ala 


Ser Pro 


Gin 


Pro 




75 








80 


Glu 


Pro Pro 


Leu 


Leu Glu 


Glu 


Leu 




90 






95 




Gin 


Lys Thr 


Leu 


Thr Val 


Leu 


His 


105 






110 






He 


Met Asn 


Glu 


Thr Asp 


Leu 


Ala 








125 






Phe 


Trp Ser 


Thr 


Leu Leu 


Leu 





140 



<210> 823 
<211> 138 
<212> PRT 
<213> Homo sapiens 



<400> 823 



Met 


Val 


Lys 


Thr 


Lys 


Leu Lys 


Thr 


1 








5 






Pro 


Phe 


Ala 


Asp 


Asp 


Phe Phe 


Cys 








20 








He 


Ser 


Pro 


Ala 


Ala 


Asp Gly 


Pro 






35 








40 


Pro 


Asp 


Gly 


Glu 


Ser 


Asp Gly 


Glu 




50 








55 




Asn 


Asp 


Gly 


Glu 


Thr 


Val Gly 


Glu 


65 










70 




Gly 


Ala 


He 


Asp 


Pro 
85 


Gly Gly 


Glu 


He 


Val 


Glu 


Asp 


Asp 


Leu Gly Ala 








100 








Gly 


Ala 


Glu 


Asp 


Gly 


Glu Thr 


Gly 






115 








120 


Gly 


Glu 


Val 


Val 


Gly 


Glu Thr 


Gly 




130 








135 





Val 


Val He Asp He Thr Val 


He 




10 15 




Pro 


Gly Ala Pro Ala Gly Val 


Glu 


25 


30 




Ser 


Pro Glu Gly Pro He Ala 


Glu 




45 




Glu 


Glu Val Glu Pro Pro Gly 


Ala 




60 




Phe 


Glu Thr Gly Asp Glu Asp 


Leu 




75 


80 


Val 


He Gly Ala Glu Asp Gly 


Gly 




90 95 




He 


Gly Ala Gly Gly Glu Val 


He 


105 


110 




Asp 


Val Asp Leu Gly Gly Ala 


Gly 




125 




Glu 


Leu 





<210> 824 
<211> 74 
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<212> PRT 

<213> Homo sapiens 



<220> 

<221> UNSURE 
<222> 53 

<223> Xaa = * , Ala, Asp, Glu, Phe, Leu, Ser, Val, Tyr 



<220> 

<221> UNSURE 
<222> 28 

<22 3> Xaa = His,Asn,Tyr 

<220> 

<221> UNSURE 
<222> 26 

<223> Xaa = lie, Lys, Met , Asn 

<220> 

<221> UNSURE 
<222> 27 

<223> Xaa = any one of the twenty amino acids 
<400> 824 



Met 


Arg Ser 


His 


Thr He 


Thr Met 


Thr 


Thr 


Thr 


Ser 


Val 


Ser Ser 


Trp 


1 






5 






10 








15 




Pro 


Tyr Ser 


Ser 


His Arg 


Met Arg 


Phe 


Xaa 


Xaa 


Xaa 


His 


Ser Asp 


Gin 






20 






25 










30 




Pro 


Pro Gin 


Asn 


Phe Ser 


Ala Thr 


Pro 


Asn 


Val 


Thr 


Thr 


Cys Pro 


Met 




35 






40 










45 






Asp 


Glu Lys 


Leu 


Xaa Thr 


Thr Val 


Leu 


Thr 


Thr 


Ser 


Tyr 


Ser Val 


He 




50 






55 








60 








Phe 


He Val 


Gly 


Leu Val 


Gly Asn 


He 


He 













65 70 



<210> 825 
<211> 135 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> UNSURE 
<222> 8 

<223> Xaa = Glu, Gin 
<220> 

<221> UNSURE 
<222> 63 

<223> Xaa = Glu, Gly 
<220> 

<221> UNSURE 
<222> 133 

<223> Xaa = His, Pro 
<400> 825 

Met Lys Glu Ala Gly Gin Met Xaa 
1 5 

Arg Ser Val Ser Thr Gin Thr Gly 
20 

Leu Ser Lys Val Asn Leu Phe Thr 

35 40 
Phe Pro Ala Glu Ala Gin Arg Gin 



Asn Leu Glu Ser Ala Arg Ala Gly 

10 15 
Ser Met Thr Gly Gin He Pro Arg 
25 30 
Leu Leu Ser Leu Trp Met Glu Leu 
45 

Lys Ser Gin Lys Asn Glu Xaa Gly 
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50 






55 








60 










Lys His Gly Pro 


Leu 


Gly Asp 


Asn Glu 


Glu 


Arg 


Thr 


Arg 


Val 


Ser 


Thr 


65 












75 










80 


Asp Lys Arg Gin 


Val 
85 


Lys 


Arg 


Thr Gly 


Leu 
90 


Val 


Val 


Val 


Lys 


Asn 
95 


Met 


Lys He Val Gly 


Leu 


His 


Cys 


Ser Ser 


Glu Asp 


Leu 


His 


Ala Gly Gin 


100 








105 










110 






He Ala Leu He 


Lys 


His 


Gl 
G y 


Ser Arg 


Leu 


Lys 


Asn Cys 


Asp 


Pro 


Tyr 


115 








120 








125 








Phe Ser Arg Lys 


Xaa 


Val 


Leu 


















130 






135 


















<210> 826 
























<211> 61 
























<212> PRT 
























<213> Homo sapiens 






















<220> 
























<221> UNSURE 
























<222> 44 
























<223> Xaa = * , 


,Glu, 


,Gln 




















<400> 826 
























Met Pro Gly Ala 


Ala 


Gly Arg 


Arg Cys 


Pro Asp Arg 


Ser 


Pro 


Pro 


Phe 


1 


5 








10 










15 




Gly Pro Asp Phe 


Gly 


Ser 


Ser 


Gly Val 


Leu 


Met 


Leu 


Leu 


Lys 


Pro 


Glu 


20 








25 










30 






Ser Thr Cys Pro 


Asp 


Pro 


Ser 


Leu Tyr 


Asp Ala 


Xaa 


Leu 


Cys 


Pro 


Val 


35 








40 








45 








Pro Glu Pro Lys 


Pro 


Cys 


Arg 


Gly Val 


Ala 


Ser Asp 


Pro 








50 






55 








60 










<210> 827 
























<211> 130 
























<212> PRT 
























<213> Homo sapiens 






















<400> 827 
























Met Arg Ser His 


Thr 


He 


Thr 


Met Thr 


Thr 


Thr 


Ser 


Val 


Ser 


Ser 


Trp 


1 


5 








10 










15 




Pro Tyr Ser Ser 


His 


Arg Met Arg Phe 


He 


Thr 


Asn 


His 


Ser 


Asp 


Gin 


20 








25 










30 






Pro Pro Gin Asn 


Phe 


Ser 


Ala 


Thr Pro 


Asn 


Val 


Thr 


Thr 


Cys 


Pro 


Met 


35 








40 








45 








Asp Glu Lys Leu 




Ser 


Thr 


Val Leu 


Thr 


Thr 


Ser 


Tyr 


Ser 


Val 


He 


50 






55 








60 










Phe He Val Gly 


Leu 


Val 


Gly Asn He 


He 


Ala 


Leu Tyr 


Val 


Phe 


Leu 


65 




70 








75 










80 


Gly He His Arg 


Lys 


Arg Asn 


Ser He 


Gin 


He 


Tyr Leu 


Leu 


Asn 


Val 




85 








90 










95 




Ala He Ala Asp 


Leu 


Leu 


Leu 


He Phe 


Cys 


Leu 


Pro 


Phe 


Arg 


He 


Met 


100 








105 










110 






Tyr His He Asn 


Gin 


Asn 


Lys 


Trp Thr 


Leu Gly Val 


He 


Leu 


Cys 


Lys 


115 








120 








125 









Val Val 
130 

<210> 828 

<211> 67 

<212> PRT 

<213> Homo sapiens 



<400> 828 
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Asn 


Phe 


Leu 


Ser 


Ala Leu Gly Ala 


lie 


Ala 


Gly 


lie 


lie 


Leu 




1 








5 




10 










15 




Thr 


Phe 


Gly 


Phe 


lie 


Leu Asp Gin Asn 


Tyr 


lie 


Cys 


Gly 


Tyr 


Ser 


His 








20 




25 










30 






Gin 


Asn 


Ser 


Gin 


Cys 


Lys Ala Val Thr 


Val 


Leu 


Phe 


Leu 


Gly 


lie 


Leu 






35 






40 








45 








lie 


Thr 


Leu 


Met 


Thr 


Phe Ser lie lie 


Glu 


Leu 


Phe 


lie 


Ser 


Leu 


Leu 



50 55 60 

Ser Gin Phe 
65 



<210> 829 
<211> 55 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> UNSURE 
<222> 26 

<223> Xaa = His, Gin 



<400> 829 

Met Pro Lys Gly Gly Arg Lys Gly Gly His Lys Gly Arg Ala Arg Gin 



10 



15 



Tyr Thr Ser Pro Glu Glu lie Asp Ala Xaa Cys Arg Leu Arg Ser Arg 

20 25 30 

Arg Pro Gly Lys Lys Arg Ser Lys Lys Gly Gly Asp Gly Ala Ala Gly 

35 40 45 

Asp Pro Lys Lys Lys Lys Lys 



50 



55 



<210> 830 
<211> 55 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> UNSURE 
<222> 26 

<223> Xaa = His, Gin 



<400> 830 

Met Pro Lys Gly Gly Arg Lys Gly 
1 5 
Tyr Thr Ser Pro Glu Glu lie Asp 
20 

Arg Pro Gly Lys Lys Arg Ser Lys 

35 40 
Asp Pro Lys Lys Lys Lys Lys 
50 55 



Gly His Lys Gly Arg Ala Arg Gin 

10 15 
Ala Xaa Cys Arg Leu Arg Ser Arg 
25 30 
Lys Gly Gly Asp Gly Ala Ala Gly 
45 



<210> 831 
<211> 86 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> UNSURE 
<222> 9,72 

<223> Xaa = Gly,Arg,Trp 



<220> 

<221> UNSURE 
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<222> 67 

<223> Xaa = Lys,Met,Arg,Thr 

<220> 

<221> UNSURE 
<222> 2 

<223> Xaa = Phe,Leu 
<220> 

<221> UNSURE 

<222> 85 

<223> Xaa = Ser,Tyr 
<400> 831 

Met Xaa Leu Val Asp Trp 
1 5 
Trp Gin Lys Glu Ala Lys 

20 

Lys Thr Thr Leu Leu His 

35 

Gin Pro Thr Gin Tyr Pro 
50 

Lys Phe Xaa Ala Phe Asp 
65 70 
Trp Lys Asp Tyr Xaa Ala 
85 

<210> 832 
<211> 131 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> UNSURE 
<222> 129 

<223> Xaa = Leu, Val 

<400> 832 



Met 


Ser 


Pro 


Leu 


Glu 


Cys 


Ser 


Glu 


Cys 


Phe Gly Asp Gin Leu Leu 


His 


1 








5 










10 


15 




Arg 


Thr 


Tyr 


Thr 


Trp 


Gin 


Leu 


Thr 


Leu 


His 


Ser Arg Pro Asn Tyr Thr 








20 










25 




30 




Arg 


Lys 


Arg 


Asp 


Thr 


Arg 


Ser 


Glu 


Ser 


Leu 


Glu lie Pro lie Ser 


Val 






35 










40 






45 




Val 


Leu 


Pro 


Gin 


Arg 


Gly 


Thr 


Ala 


Glu 


Pro 


Phe Pro Arg Leu His 


Asn 




50 










55 








60 




Leu 


Tyr 


Ser 


Thr 


Pro Arg 


Cys 


Ala 


Gin 


Gin 


Ala Ala Leu Pro Arg 


Leu 


65 










70 










75 


80 


Ser 






Met 


Ala 


Ser 


Gin 


His 


Ser Tyr 


Pro Leu Asn Arg Phe 


Ser 










85 










90 


95 




Ser 


Val 


Pro 


Leu 


Asp 


Pro 


Met 


Glu 


Arg 


Pro 


Met Ser Gin Ala Asp 


Leu 








100 










105 




110 




Glu 


Leu 


Asp 


Tyr 


Asn 


Pro 


Pro Arg 


Val 


Gin 


Leu Ser Asp Glu Met 


Phe 






115 










120 






125 




Xaa 


Phe 


Gin 





















130 



<210> 833 
<211> 99 
<212> PRT 

<213> Homo sapiens 



PLT/lBOl/00914 



Phe Tyr Xaa Val Leu Ser Ser Leu Gly Leu 

10 15 
lie Leu Phe Leu Gly Leu Asp Asn Ala Gly 

25 30 
Met Leu Lys Asp Glu Arg Leu Val Gin His 

40 45 
Thr Ser Glu Glu Leu Ser lie Gly Arg lie 
55 60 

Leu Xaa Gly His Gin lie Ala Arg Arg Val 
75 80 



:220> 
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<221> UNSURE 
<222> 32 

<223> Xaa = * ,Lys,Leu,Met 



<220> 

<221> UNSURE 

<222> 26 

<223> Xaa = Asp,Glu 

<220> 

<221> UNSURE 

<222> 70,78 

<223> Xaa = His, Gin 

<220> 

<221> UNSURE 
<222> 27 

<223> Xaa = Leu,Val 



<400> 833 




















Met 


Ala 


Thr 


Val 


Met Ala 


Ala 


Thr Ala Ala 


Glu 


Arg 


Ala 


Val 


Leu Glu 


1 








5 




10 










15 


Glu 


Glu 


Phe 


Arg 


Trp Leu 


Leu 


His Asp Xaa 


Xaa 


His 


Ala 


Val 


Leu Xaa 








20 






25 








30 




Gin 


Leu 


Gin 


Asp 


He Leu 


Lys 


Glu Ala Ser 


Leu Arg 


Phe 


Thr 








35 








40 






45 






Gly 


Ser 


Gly 


Thr 


Glu Gly 


Pro 


Ala Lys Gin 


Glu 


Asn 


Phe 


He 


Leu Gly 




50 








55 






60 








Ser 


Cys 


Gly 


Thr 


Asp Xaa 


Val 


Lys Gly Val 


Leu 


Thr 


Leu 


Xaa 


Gly Asp 


65 








70 






75 








80 


Ala 


Leu 


Ser 


Gin 


Ala Asp 


Val 


Asn Leu Lys 


Met 


Pro Arg Asn Asn Gin 










85 




90 










95 


Leu 


Leu 


His 





















<210> 834 
<211> 96 
<212> PRT 

<213> Homo sapiens 



<400> 834 

Met Val Ser His Thr Phe His Met 
1 5 

Gin Glu Gly Asp Asp Leu Pro Lys 
20 

Lys Gin Asp Asp Ser Pro Lys Ser 

35 40 
Glu Gly Asp He Pro Lys Ala Pro 

50 55 
Glu Asp Leu Pro Lys Ser Ser Glu 
65 70 
Asn He Pro Lys Ser Ser Ala Lys 
85 



Arg Thr Glu Glu Ser Asp Ala Ser 
10 15 

Ser Ser Ala Asn Thr Ser His Pro 
25 30 
Ser Glu Glu Thr He Gin Pro Lys 
45 

Glu Glu Thr He Gin Ser Lys Lys 
60 

Lys Ala He Gin Pro Lys Glu Ser 

75 80 
Pro He Gin Pro Lys Leu Ala He 
90 95 



<210> 835 
<211> 97 
<212> PRT 

<213> Homo sapiens 

<220> 

<221> UNSURE 
<222> 94 

<223> Xaa = Leu, Met 
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<400> 835 



Met 


Ser 


Leu 


Leu 


Asp 


Gly 


Leu Ala 


Ser Ser 


Pro Arg Ala 


Pro 


Leu 


Gin 


1 








5 






10 






15 




Ser 


Ser 


Lys 


Ala 


Arg 


Met 


Lys Lys 


Leu Pro 


Lys Lys Ser 


Gin 


Asn 


Glu 
















25 




30 






Lys 


Tyr 


Arg 


Leu 




Tyr 


Leu Arg 


Leu Arg 


Lys Ala Ala 


Lys 


Ala 


Thr 






35 








40 




45 








Val 


Phe 


Glu 


Asn 


Ala 


Ala 


He Cys 


Asp Glu 


He Ala Arg 


Leu 


Glu 


Glu 




50 










55 




60 








Lys 


Phe 


Leu 


Lys 


Ala 


Lys 


Glu Glu Arg Arg 


Tyr Leu Leu 


Lys 


Lys 


Leu 


65 










70 






75 






80 


Leu 


Gin 


Leu 


Gin 


Ala 


Leu 


Thr Glu 


Gly Val 


Ser Thr Gly Xaa 


Gin 


Leu 



85 90 95 



Leu 

<210> 836 
<211> 107 
<212> PRT 

<213> Homo sapiens 



<400> 836 


























Met 
1 


Ala 


Gin 


Glu 


Val 
5 


Ser 


Glu 


Tyr 


Leu Ser 
10 


Gin 


Asn 


Pro 


Arg 


Val 
15 


Ala 


Ala 


Trp 


Val 


Glu 
20 


Ala 


Leu 


Arg 


Cys 


Asp Gly 
25 


Glu 


Thr 


Asp 


Lys 
30 


His 


Trp 


Arg 


His 


Arg 


Arg 


Asp 


Phe 


Leu 


Leu 


Arg Asn 


Ala 


Gly Asp 


Leu 


Ala 


Pro 






35 










40 








45 








Ala 


Gly 
50 


Gly 


Ala 


Ala 


Ser 


Ala 
55 


Ser 


Thr Asp 


Glu 


Ala 
60 


Ala 


Asp 


Ala 


Glu 


Ser 


Gly 


Thr 


Arg 


Asn 


Arg Gin Leu 


Gin Gin 


Leu 


He 


Ser 


Phe 


Ser 


Met 


65 










70 








75 










80 


Ala 


Trp 


Ala 


Asn 


His 


Val 


Phe 


Leu 


Gly Cys Arg 


Tyr 


Pro 


Gin 


Lys 


Val 










85 








90 










95 




Met 


Asp 


Lys 


He 
100 


Leu 


Ser 


Met 


Ala 


Glu Gly 
105 


He 













<210> 837 






<211> 65 






<212> PRT 






<213> Homo sapiens 






<220> 






<221> UNSURE 






<222> 20 






<223> Xaa = Pro,Arg 






<400> 837 






Met Arg Leu Leu Gin 


Pro 


Arg Ser 


1 5 






Trp Leu Phe Xaa Leu 


His 


Gly Asp 


20 






Gly Ser Arg Lys Asn 


Met 


Asp Cys 


35 




40 


Cys Arg Arg Thr Ser 


Leu 


Trp Asn 


50 




55 


Gly 






65 






<210> 838 






<211> 81 






<212> PRT 






<213> Homo sapiens 







Arg Leu Gin Arg Thr Ser Gin Lys 

10 15 
Glu Ser Leu Ala Pro Lys His Pro 
25 30 
Tyr Trp Gin Asn Thr Ser Val His 
45 

Leu His Leu Pro Gly Lys Thr Leu 
60 
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<220> 

<221> UNSURE 
<222> 10 

<223> Xaa = * , Trp 
<220> 

<221> UNSURE 
<222> 9 

<223> Xaa = Ala, Pro 

<220> 

<221> UNSURE 
<222> 16 

<223> Xaa = Arg,Ser 
<220> 

<221> UNSURE 
<222> 12 

<223> Xaa = Gly,Trp 
<220> 

<221> UNSURE 
<222> 8 

<223> Xaa = Lys,Arg 



<400> 838 

Met Leu Ala Arg Gly Leu Val Xaa 
1 5 

Pro Val Phe Thr lie Ser Phe His 
20 

lie Ser Pro Ser Leu Ser Phe Leu 

35 40 
Gly Phe Gin Arg Val Val Lys Asn 

50 55 
Pro Ser Thr Gin Asn Arg Leu Phe 
65 70 



Xaa Xaa Asn Xaa Gly Asn Arg Xaa 
10 15 

His Leu Pro Thr Trp Asn Leu Tyr 
25 30 
lie Cys Glu Thr Ala Ala Asn Thr 
45 

Pro Cys Glu His Leu Leu Gin Leu 
60 

His Trp Arg Arg Arg Arg Asp Gly 
75 80 



<210> 839 
<211> 50 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> UNSURE 
<222> 47 

<223> Xaa = Phe, Ser 



<400> 839 

Met Val Arg Pro Arg Arg Ala Pro 
1 5 

Leu Gly Gly Arg Gly Arg Pro Pro 

20 

Arg Ser Arg Ser Trp Pro Ala Ser 
35 40 

Asp Pro 
50 



Tyr Arg Ser Gly Ala Gly Gly Pro 
10 15 

Arg Pro Leu Val Val Arg Ala Val 
25 30 
Pro Pro Arg Pro Ala Ala Xaa Ala 
45 



<210> 840 
<211> 85 
<212> PRT 
<213> Homo sapiei 
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<400> 840 


















Met Pro Ser Phe Val 


Asp Arg 


lie Phe Gly 


Gly 


Glu 


Leu 


Thr 


Ser 


Met 


1 5 




10 










15 




lie Met Tyr Asp Gin 


Cys Arg 


Thr Val Ser 


Leu 


Val 


His 


Glu 


Ser 


Phe 


20 




25 








30 






Leu Asp Leu Ser Leu 


Pro Val 


Leu Asp Asp 


Gin 


Ser 


Gly Lys 


Lys 


Ser 


35 




40 






45 








Val Asn Asp Lys Asn 


Leu Lys 


Lys Thr Val 


Glu Asp 


Glu Asp 


Gin 


Asp 


50 


55 






60 










Ser Glu Glu Glu Lys 


Asp Asn Asp Ser Tyr 


He 


Lys 


Glu Arg 


Ser 


Asp 


65 


70 




75 










80 


lie Pro Ser Gly Thr 


















85 


















<210> 841 


















<211> 66 


















<212> PRT 


















<213> Homo sapiens 


















<220> 


















<221> UNSURE 


















<222> 59 


















<223> Xaa = Lys, Asn 


















<400> 841 


















Met Ala Ser lie Phe 


Ser Lys 


Leu Leu Thr 


Gly Arg 


Asn 


Ala 


Ser 


Leu 


1 5 




10 










15 




Leu Phe Ala Thr Met 


Gly Thr Ser Val Leu 


Thr 


Thr 


Gly 


Tyr 


Leu 


Leu 


20 




25 








30 






Asn Arg Gin Lys Val 


Cys Ala 


Glu Val Arg 


Glu 


Gin 


Pro 


Arg 


Leu 


Phe 


35 




40 






45 








Pro Pro Ser Ala Asp 


Tyr Pro 


Asp Leu Arg 


Xaa 


Thr 


Thr 


Thr 


Ala 


Trp 


50 


55 






60 










Pro Ser 


















65 


















<210> 842 


















<211> 74 


















<212> PRT 


















<213> Homo sapiens 


















<400> 842 


















Met Ala Ala Ala Ala 


Gly Leu 


Gly Leu Gin 


Arg 


Glu 


Ala 


Ala 


Glu 


Gly 


1 5 




10 










15 




Lys Trp Ser Ala Trp 


Arg Met 


Thr Glu Glu 


Thr 


Arg 


He 


Val 


Tyr 


Trp 


20 




25 








30 






lie Lys Asp Arg Gin 


Leu Thr Asn Arg Asp 


Ser 


Thr 


He 


Leu 


Glu 


Leu 


35 




40 






45 








Gin Lys Val Leu Lys 


Thr Cys 


Trp Phe Val 


Cys 


Trp 


Ser 


He 


Phe 


Gin 


50 


55 






60 










Leu Lys Phe Leu Phe 


Leu Phe 


Phe Phe Phe 














65 


70 
















<210> 843 


















<211> 68 


















<212> PRT 


















<213> Homo sapiens 


















<400> 843 


















Met Lys Lys Asn lie 


Trp Leu 


Leu Ser Gly 


Ser 




Gin 


Gin 


Tyr 


Leu 


1 5 




10 










15 




lie Ala Leu Trp Arg 


Leu Leu 


Leu Leu Glu 


He 


Tyr 


He 


Ser 


Glu 


Gly 


20 




25 








30 
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Thr His Ser Thr Glu 
35 

Arg Val Ala Ala Ala 
50 

Ser Val Ser Leu 
65 

<210> 844 
<211> 53 
<212> PRT 

<213> Homo sapiens 
<400> 844 

Met Phe Pro Pro Pro 
1 5 
Arg Asp His Pro Pro 
20 

He Thr Leu Thr Thr 
35 

Leu Ala Trp Thr Pro 
50 

<210> 845 
<211> 73 

<212> PRT 

<213> Homo sapiens 

<400> 845 

Met Cys Pro Gly Asn 
1 5 
Phe Ser Arg Ser Ser 
20 

Glu Glu Met Pro Thr 
35 

Glu Arg Leu Leu Tyr 
50 

Val Trp Pro Gly Phe 
65 

<210> 846 
<211> 143 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> UNSURE 
<222> 109 

<223> Xaa = Lys,Arg 



<400> 846 



Met 


Thr 


Ser 


Gly 


Ser 


1 








5 


Tyr 


He 


Ala 


Thr 


Phe 








20 




Gin 


Asn 


Pro 


He 


Gin 






35 






Thr 


Leu 


Gin 


Arg 


Asn 




50 








Tyr 


Val 


Asn 


Leu 


Ala 


65 










Leu 


Gly 


Ser 


Gly 


Ala 










85 


Phe 


Asn 


Asp 




Ala 



PCT/lBOl/00914 



Glu Asp He Asn Lys Gin He Asn Asp Lys Glu 

40 45 
Met Glu Asn Pro Asn Leu Arg Glu He Val Asp 



Ala Ala Ser Leu Cys Ser His Val Gly Cys Ser 

10 15 

Val Gly Pro Gly Gin Thr Asn Arg Lys Gly Pro 

25 30 

Ala Glu Leu Thr Ser Thr Thr Val Ser Val Ala 
40 45 



Trp Leu 


Trp Ala 


Ser 


Met Thr 


Phe 


Met 


Ala 


Arg 






10 








15 




Ser Arg 


Ser Pro 


Val 


Arg Thr 


Arg 


Gly 


Thr 


Leu 




25 








30 






Val Gin 


His Pro 


Phe 


Leu Asn 


Val 


Phe 


Glu 


Leu 




40 






45 








Thr Gly 


Lys Thr 


Ala 


Cys Asn 


His 


Ala 


Asp 


Glu 


55 






60 










Tyr Leu 


Gly Asp 














70 

















Lys 


Cys 


Pro 


Ser Thr Asp Ser Gly Lys 


Glu 


Glu 








10 


15 






Gly 


Ser 


Glu Tyr Phe Cys Tyr Asp 


Leu 


Ser 








25 30 






Ser 


Ser 


Ser 


Asp Glu He Thr Leu Ser 


Phe 


Lys 






40 


45 






Gly 


Leu 


Met 


Leu His Thr Gly Lys Ser 


Ala 


Asp 




55 




60 






Leu 


Lys 


Asn 


Gly Ala Val Ser Leu Val 


He 


Asn 


70 






75 




80 


Phe 


Glu 


Ala 


Leu Val Glu Pro Val Asn 


Gly 


Lys 








90 


95 




Trp 


His 


Asp 


Val Lys Val Thr Xaa Asn 


Leu 


Arg 
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100 105 
Gin Val Thr He Ser Val Asp Gly He 

115 120 
Gin Glu Asp Tyr Thr Met Leu Gly Ser 
130 135 

<210> 847 
<211> 112 
<212> PRT 

<213> Homo sapiens 



<400> 847 



Met 


Gin 


Leu 


Tyr 


Leu 


Tyr 


Ser 


Met 


Ser 


1 








5 










Glu 


Gin 


Ala 


Arg 


Thr 


Leu 


Glu 


Arg Glu 








20 










25 


Ala 


He 


Arg 


Ser 


Asp Arg 


Cys 


Val 


Lys 






35 










40 




Ala 


Asn 


Asn 


He 


Ser 


Phe 


Tyr 




Cys 




50 










55 






Met 


Leu 


Arg 


Phe 


Ser 


Ser 


Ala 


Ser 


Ala 


65 










70 








Pro 


Leu 


Leu 


Leu 


Leu 
85 


Phe 


Leu 


Leu 


Leu 


Met 


Arg 


Met 


Lys 


Thr 


Phe 


Met 


Met 


Thr 



100 105 



<210> 848 
<211> 54 
<212> PRT 

<213> Homo sapiens 
<400> 848 

Met He Gin Arg Asn Leu Ser Asn Asp 
1 5 

Met Asp Ala Asp Met Asp Ala Val Ser 

20 25 
Arg Asn Ala Arg Cys Asp Pro Ala Ser 

35 40 
He Phe Ser Thr His Ser 
50 

<210> 849 
<211> 138 
<212> PRT 

<213> Homo sapiens 



<400> 849 



Met 


Lys 


Lys 


Leu 


Lys Ala Arg 


Met 


His 


1 








5 








Asp 


Lys 


Met 


Gin 


Asn 


Ala Glu 


Ser Gly 








20 








25 


Leu 


Ala 


Glu 




Glu 


Asp Asp 


Phe 


Lys 






35 








40 




Ala 


Ala 


Tyr 


Tyr 


Glu 


Glu Gin 


His 


Pro 




50 








55 






Lys 


Glu 


Arg 


Asp 


Gly Leu Arg 


Cys 


Arg 


65 










70 






Asp 


Val 


Glu 


Asp 


Pro 


Ala Thr 


Glu 


Glu 








85 








Lys 


Val 


Met 


Arg 


Trp 


Phe Gin 


Ala 


Met 








100 








105 


Trp 


His 


Gly 


Val 


Leu 


Ala Trp 


Val 


Lys 



430 









110 










Thr 






Thr 






125 








Asp Asp 


Phe 


Phe 


Tyr 


Val 






140 










Thr Ala 


He 


Met 


Gly 


Leu 


Asp 


10 








15 




Glu Lys 


Lys 


Glu 


Asn 


Ser 


Gly 








30 






Ala Ser 


Phe 


Gin 


Gin 


Gly 


Leu 






45 








Pro Pro 


Gly 


Tyr 


Ser 


Pro 


Arg 




60 










Ala Leu 


Arg 


Gin 


Gin 


Asp 


Gin 


75 










80 








Val 






90 








95 




Arg He 


His 


Leu 


Met 


Asn 


Ser 








110 






Asp Asn Ser 


Ser 


Pro 


Ala 


Cys 


10 








15 




His Ser 


Tyr 


His 


Trp 


Met 


Ser 








30 






Leu Tyr Ser 


Thr 


Leu 


Ser 


Ser 






45 








Gin Ala 


He 


Glu 


Arg 


Phe 


Tyr 


10 








15 




Arg Gly Gin 


Val 


Met 


Ser 


Ser 








30 






Glu Gly 


Tyr 


Leu 


Glu 


Thr 


Val 






45 








Glu Leu 


Thr 


Pro 


Leu 


Leu 


Glu 




60 










Gly Asn Arg Ser 








75 










80 


Pro Gly Glu 


Ser 


Phe 


Cys 


Asp 


90 








95 




Leu Gin Arg 


Leu 


Gin 


Thr 


Trp 








110 






Glu Lys 


Val 


Val 


Ala 




Val 
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115 120 125 

His Ala Val Gin Ala Leu Trp Lys Gin Phe 
130 135 



<210> 850 
<211> 70 
<212> PRT 

<213> Homo sapiens 
<400> 850 



Met 


He 


Cys 


Thr 


Phe 


Leu Arg Ala 


1 








5 






Arg 


Ser 


Ser 


Ala 
20 


Lys 




Leu 


Ala 


Cys 


Leu Lys 


Thr 


Glu Pro 


Ser 






35 








40 


Lys 


Lys 


Thr 


Leu Arg 


Pro Arg 


Cys 




50 








55 




He 


Trp 


Thr 


Gin 


Gly 


Pro 




65 










70 





Val Gin Tyr Thr 


Glu 


Lys 


Leu 


His 


10 






15 




Leu Pro Tyr He 


Val 


Leu 


Asn 


Lys 


25 




30 






Leu Arg Cys Gly 


Leu 


Gin 


Tyr Gin 




45 








He Leu Gly Val 


Thr 


Gin 


Lys 


Thr 



60 



<210> 851 
<211> 152 
<212> PRT 
<213> Homo sapien; 



<400> 851 








Met 


Asn 


Val Thr Pro 


Glu 


Val 


Lys 


1 




5 








Glu 


Gin 


Leu Leu Lys 


His 


Gly Trp 






20 








Lys 


Glu 


Asn Gly He 


Thr 


Gin 


Ala 






35 






40 


Thr 


His 


Gly Val Gly 


His 


Asp 


Pro 




50 






55 




Trp 


Asn 


Glu Leu Phe 


Asn 


Lys 


Thr 


65 






70 






Gly 


Gin 


Asp Gly Val 


Gin 


He 








85 








Tyr 


Asn 


His Pro Lys 


Pro 


Asn 


Leu 






100 








Ala 


Thr 


Leu Thr Ser 


Gly Gly Glu 






115 






120 


Cys 


Ser 


Asp Asp Asp 


Asn 


Gin 


Gly 




130 






135 




Met 


Arg 


Cys Cys Ser 


Lys 


Pro 


Val 


145 






150 







Ser 


Arg 


Gly Met 


Lys 


Phe 


Ala 


Glu 




10 








15 




Thr 


Gin 


Gly Lys 


Gly 


Leu 


Gly Arg 


25 








30 






Leu Arg Val Thr 


Leu 


Lys 


Gin Asp 








45 








Ala 


Lys 


Glu Phe 


Thr 


Asn 


His 


Trp 






60 










Ala 


Ala 




Val 


Val 


Glu 


Thr 






75 








80 


Ser 


Leu 


Ser Lys 


Glu 


Thr 


Thr 


Arg 




90 








95 




Leu 


Tyr 


Gin Lys 


Phe 


Val 


Lys 


Met 


105 








110 






Lys 


Pro 


Asn Lys 


Asp 


Leu 


Glu 


Ser 








125 








Ser 


Ser 


Pro Gin 


Arg 


Phe 


Leu 


Leu 



140 



<210> 852 
<211> 120 
<212> PRT 

<213> Homo sapiens 



<400> 852 



Met 


Ala 


Lys Glu 


Glu 


Pro 


Gin 


Ser 


He 


Ser Arg Asp 


Leu 


Gin 


Glu 


Leu 


1 






5 










10 










15 




Gin 


Lys 


Lys Leu 


Ser 


Leu 


Leu 


He 


Asp 


Ser 


Phe 


Gin 


Asn 


Asn 


Ser 


Lys 






20 










25 










30 






Val 


Val 


Ala Phe 


Met 


Lys 


Ser 


Pro 


Val 


Gly 


Gin 


Tyr 


Leu 


Asp 


Ser 


His 






35 








40 










45 








Pro 


Phe 


Leu Ala 


Phe 


Thr 


Leu 


Leu 


Val 


Phe 


He 


Val 


Met 


Ser 


Ala 


Val 




50 








55 










60 










Pro 


Val 


Gly Phe 


Phe 


Leu 


Leu 


He 


Val 


Val 




Thr 


Thr 


Leu 


Ala 


Ala 
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65 70 

Leu Leu Gly Val lie He Leu Glu Gly 

85 

Phe Ser Leu Leu Cys He Leu Cys Gly 
100 105 
Met Ser Gly Met Met He Ala Ser 
115 120 



PCT/lBOl/00914 

432 

75 80 
Leu Val He Ser Val Gly Gly 
90 95 
Leu Ser Phe Val Ser Leu Ala 
110 



<210> 853 
<211> 103 
<212> PRT 

<213> Homo sapiens 



<400> 853 

Met Asp Gly Thr Val Leu Cys Lys Leu He Asn Ser Leu Tyr Pro Pro 

15 10 15 

Gly Gin Glu Pro He Pro Lys He Ser Glu Ser Lys Met Ala Phe Lys 

20 25 30 

Gin Met Glu Gin He Ser Gin Phe Leu Lys Ala Ala Glu Thr Tyr Gly 

35 40 45 

Val Arg Thr Thr Asp He Phe Gin Thr Val Asp Leu Trp Glu Gly Lys 

50 55 60 

Asp Met Ala Ala Val Gin Arg Thr Leu Met Ala Leu Gly Ser Val Ala 
65 70 75 80 

Val Thr Lys Asp Asp Gly Cys Tyr Arg Gly Glu Pro Ser Trp Phe His 

85 90 95 

Arg Lys Ala Gin Gin Asn Arg 
100 



<210> 854 

<211> 78 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> UNSURE 
<222> 68 

<2 23> Xaa = Gly, Arg 
<220> 

<2 21> UNSURE 

<222> 76 

<22 3> Xaa = Ly.s,Asn 
<400> 854 

Met Pro Leu Pro Arg Lys Asp Asn Pro Gin Cys Ser Pro Leu Leu Pro 

15 10 15 

Pro Leu Ala Ser Val Gly Lys Lys Met Ala Lys Lys He Ala Ala Ala 

20 25 30 

Pro Pro Pro Pro Pro Pro Pro Pro Ser Glu Arg Arg Gly Lys Arg Ala 

35 40 45 

Glu Gin Lys Gly Ala Val Gly Gin Arg Pro Asp Trp He Phe Ser Pro 

50 55 60 

Gin Pro Gin Xaa Met Phe Leu Met Thr Gin Gly Xaa Leu Val 
65 70 75 

<210> 855 
<211> 136 
<212> PRT 

<213> Homo sapiens 



<400> 855 

Met Ser Glu Pro Gly Lys Leu Ser Gin Lys He Lys Val Trp Leu Gin 
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1 5 



Glu 


Tyr 


Trp Asn He Thr 


Asp 


Leu 


Val 






20 






25 


Gly 


Ala 


He Leu Arg Leu 


Gin 


Asn 


Gin 






35 




40 




Val 


He 


Tyr Cys Val Asp 


He 


He 


Phe 




50 




55 






He 


Phe 


Gly Val Asn Lys 


Tyr 


Leu 


Gly 


65 




70 








Lys 


Met 


Met He Asp Met 


Leu 


Tyr 


Phe 






85 










Met 


Ser Phe Gly Val 


Ala 


Arg 


Gin 






100 






105 


Lys 


Pro 


Ser Trp Lys Leu 


Ala 


Arg 


Asn 






115 




120 




Met 


He 


Tyr Gly Glu Val 


Leu 


Gin 






130 




135 







PCT/lBOl/00914 

433 

10 15 

Ala He Ser Thr Phe Met He 

30 

Pro Tyr Met Gly Tyr Gly Arg 
45 

Trp Tyr He Arg Val Leu Asp 
60 

Pro Tyr Val Met Met He Gly 

75 80 
Val Val He Met Leu Val Val 

90 95 
Ala He Leu His Pro Glu Glu 
110 

He Phe Tyr Met Pro Tyr Trp 
125 



<210> 856 
<211> 118 
<212> PRT 

<213> Homo sapiens 
<400> 856 



Met 


Val 


Arg 


Gly 


Trp Ala 


Pro 


Thr 


Trp 


Ala 


He 


Pro Glu 


Leu 


Cys 


Gly 


1 








5 








10 








15 




Val 


Trp 


Thr 


Leu 

20 


Thr Gly 


Thr 


Pro 


Ser 

25 


Met 


Ser 


Ser Leu 


Ala 

30 


Gin 


Leu 


Thr 


Thr 


Ala 
35 


Val 


Val Ser 


Gly 


Thr 
40 


Val 


Lys 


Gin 


Ser Gly 
15 


Glu 


Val 




Val 


Asn 
50 


Val 


Lys 


Glu His 


Ser 
55 


Arg 


Gin 


He 


Asn 


Asp He 
60 


Gin 


Leu 


Ser 


Arg 


Asp 


Met 


Thr 


Met Phe 


Val 


Thr 


Ala 


Ser 


Lys 


Asp Asn 


Thr 


Ala 


Lys 


65 








70 










75 








80 


Leu 


Phe 


Asp 


Ser 


Thr Thr 
85 




Glu 


His 


Gin 
90 


Lys 


Thr Phe 


Arg 


Thr 
95 


Glu 


Arg 


Pro 


Val 


Asn 
100 


Ser Ala 


Ala 


Leu 


Ser 
105 


Pro 


Asn 


Tyr Asp 


His 
110 


Val 


Val 




Gly 


Gly 
115 


Gly 


Gin Glu 





















<210> 857 
<211> 122 
<212> PRT 

<213> Homo sapiens 



<400> 857 










Met 


Arg 


Pro 


Ser 


Cys Ala 


Pro 


Gly 


1 








5 






Tyr 


Gly 


Tyr 


Phe 


Arg Lys 


Leu 


Arg 








20 








Cys 


Gin 


Ser 


Tyr 


Gly Asn 


Gly 


Ala 






35 








40 


Lys 


Glu 


Ala 


Ser 


Thr He 


Ala 


Glu 


50 








55 




Gin 


Pro 


He 


Trp 


He Gly 


Leu 


His 


65 








70 






Gin 


Trp 


He 


Asp 


Gly Ala 

85 


Met 


Tyr 


Ser 


Met 


Gly 


Gly 


Asn Lys 


His 


Cys 








100 








Phe 


Leu 


Thr 


Trp 


Ser Ser 


Asn 


Glu 






115 








120 



Trp 


Phe 


Tyr 


His Lys 


Ser Asn 


Cys 




10 






15 




Asn 


Trp 


Ser 


Asp Ala 


Glu Leu 


Glu 


25 








30 




His 


Leu 


Ala 


Ser He 


Leu Ser 


Leu 








45 






Tyr 


He 


Ser 


Gly Tyr 


Gin Arg 


Ser 








60 






Asp 


Pro 


Gin 


Lys Arg Gin Gin 


Trp 






75 






80 


Leu 


Tyr 


Arg 


Ser Trp 


Ser Gly 


Lys 




90 






95 




Ala 


Glu 


Met 


Ser Ser 


Asn Asn 


Asn 


105 








110 




Cys 


Asn 
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<210> 858 
<211> 107 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> SIGNAL 
<222> -23. .-1 



Met 


Leu 


Leu 


Gin Ser 
-20 


Gly 


Leu 


Ser 


Phe Arg 
-15 


Arg Leu 


Leu 


-10 


Leu 


Ser 


Leu 


Val 


Ser 


Gly Ala 


Leu 


Gly 


Leu 


Gly Gly 


Ala Val 


Leu Gly 


Val 


Gly 






-5 








1 




5 










Leu 


Ser 


Leu Gly Pro 


Val 


Pro 




Thr Pro 


Trp Val 


Phe 


Gly 


Val 


Thr 


10 








15 








20 








25 


Ala 


Gly 


Val 


Phe Leu 


Tyr Val 


Ala 


Leu Val 


Asp Met 


Leu 


Pro 


Ala 


Leu 








30 








35 








40 




Leu 


Arg 


Pro 


Pro Glu 


Pro 


Leu 


Pro 


Thr Pro 


His Val 


Leu 




Gin 


Gly 






45 








50 






55 






Leu 


Gly 


Leu 


Leu Leu 


Gly 


Gly Gly Leu Met 


Leu Ala 


lie 


Thr 


Leu 


Leu 






60 








65 






70 








Glu 


Glu 

75 


Arg 


Leu Leu 


Pro 


Val 
80 


Thr 


Thr Glu 


Gly 











<210> 859 
<211> 92 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> SIGNAL 
<222> -24. .-1 

<220> 

<221> UNSURE 
<222> 52 

<223> Xaa = Arg, Thr 
<400> 859 

Met Val Met Gly Leu Gly Val Leu Leu Leu Val Phe Val Leu Gly Leu 

-20 -15 -10 

Gly Leu Thr Pro Pro Thr Leu Ala Gin Asp Asn Ser Arg Tyr Thr His 

-5 15 
Phe Leu Thr Gin His Tyr Asp Ala Lys Pro Gin Gly Arg Asp Asp Arg 

10 15 20 

Tyr Cys Glu Ser lie Met Arg Arg Arg Gly Leu Thr Ser Pro Cys Lys 
25 30 35 40 

Asp lie Asn Thr Phe lie His Gly Asn Lys Arg Xaa Ser Arg Pro Ser 

45 50 55 

Val Lys Thr Arg Met Glu Thr Leu Thr Glu Lys Thr 
60 65 

<210> 860 
<211> 80 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> SIGNAL 
<222> -32. .-1 
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WO 02/083898 

<220> 

<221> UNSURE 
<222> 44 

<223> Xaa = Ala, Pro 
<400> 860 

Met Met Leu Val Gly 
-30 

Gin Cys Met Leu Gly 
-15 

He Glu He Ala Ala 
1 5 
He Lys Glu Val Gin 
20 

Thr Lys Asp Glu Pro 
35 

<210> 861 
<211> 166 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -15. .-1 

<220> 

<221> UNSURE 
<222> 142 

<223> Xaa = Ser,Thr 
<400> 861 



Met 


Leu 


Pro 


Leu 


Leu 


He 


He 


Cys 


Leu 


Leu 


Pro 


Ala 


He 


Glu 


Gly Lys 


-15 










-10 










-5 








1 


Asn 


Cys 


Leu 


5 


Cys 


Trp 


Pro 


Glu 


Leu 

10 


Ser 


Ala 


Leu 


He 


Asp 
15 


Tyr Asp 


Leu 


Gin 


He 
20 


Leu 


Trp 


Val 


Thr 


Pro 
25 


Gly 


Pro 


Pro 


Thr 


Glu 
30 


Leu 


Ser Gin 


Asn 


Arg 
35 


Asp 


His 


Leu 


Glu 


Glu 
40 


Glu 


Thr 


Ala 


Lys 


Phe 
45 


Phe 


Thr 


Gin Val 


His 


Gin 


Ala 


He 


Lys 


Thr 


Leu 


Arg 


Asp Asp 


Lys 


Thr 


Val 


Leu 


Leu Glu 


50 










55 










60 








65 


Glu 


He 


Tyr 


Thr 


His 
70 




Asn 




Phe 


Thr 
75 


Glu 


Arg 




Asn 


Lys He 
80 


Ser 


Asp 


Gly 


Leu 
85 


Lys 


Glu 


Lys 


Asp 


He 
90 


Gin 


Ser 


Thr 


Leu 


Lys 
95 


Val Thr 


Ser 


Cys 


Ala 


Asp 


Cys 


Arg 


Thr 


His 


Phe 


Leu 


Ser -Cys 


Asn 


Asp 


Pro Thr 






100 










105 










110 






Phe 


Cys 


Pro 


Ala 


Arg 


Asn 






Thr 


Ser 


Leu 


Trp Ala 


Val 


Ser Leu 




115 










120 










125 








Ser 


Ser 


Ala 


Leu 


Leu 


Leu 


Ala 


He 


Ala 


Gly Asp 


Val 


Xaa 


Phe 


Thr Gly 


130 










135 










140 








145 


Lys 


Gly 


Arg 


Arg Arg 


Gin 





















150 



<210> 862 
<211> 54 

<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -26. .-1 



FCT/lBOl/00914 



Phe 




Gly 


Cys Cys Gly Ala 


Val 


Gin 


Glu 


Ser 






-25 




-20 








Leu 


Phe 


Phe 


Gly Phe Leu Leu 


Val 


He 


Phe 


Ala 




-10 




-5 










Ala 


He 


Trp 


Gly Tyr Ser His 


Lys Asp 


Glu 


Val 








10 






15 




Glu 


Phe 


Tyr 


Lys Asp Thr Tyr Asn 


Lys 


Leu 


Lys 








25 




30 






Gin Arg Glu 


Thr Leu Lys Xaa 


Ser 


Thr 


Met Arg 



40 45 



[fije>'/^:/yv'GG208389e.s^] 



WO 02/083898 PCT/lBOl/00914 



<400> 862 

Met Val Lys Val Met Trp Val Ser Val Leu Ala Leu Ala Ala Ala lie 

-25 -20 -15 

Leu Leu Leu Thr Val Pro Val Ala Glu Gly Val Thr Cys Ser Pro Met 
-10 -5 15 

Gin Leu Ala Ser Cys Ala Ala Ala Met Thr Ser Ser Ser Pro Pro Ser 

10 15 20 

Glu Ala Val Ala Gin Ser 
25 

<210> 863 

<211> 55 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SIGNAL 
<222> -26. .-1 

<400> 863 

Met Lys Pro Val Ser Arg Arg Thr Leu Asp Trp lie Tyr Ser Val Leu 

-25 -20 -15 

Leu Leu Ala lie Val Leu lie Ser Trp Gly Cys lie lie Tyr Ala Ser 
-10 -5 15 

Met Val Ser Ala Arg Arg Gin Leu Arg Lys Lys Tyr Pro Asp Lys lie 

10 15 20 

Phe Gly Thr Asn Glu Asn Leu 
25 

<210> 864 
<211> 101 
<212> PRT 

<213> Homo sapiens 

<220> 

<221> SIGNAL 
<222> -22. .-1 



Met 


Arg 


Thr 


Leu 


Ala 


lie 


Leu Ala Ala 


He 


Leu 


Leu 


Val 


Ala 


Leu 


Gin 






-20 








-15 








-10 








Ala 


Arg 




Ser 


His 


Ser 


Arg Gin Glu 


Leu 


Met 


Arg 


Leu 


Leu 


Gin 


Pro 




-5 










1 




5 










10 


Arg 


Ser 


Arg 


Leu 


Gin 


Arg 


Thr Ser Gin 


Lys 


Trp 




Phe 


Pro 




His 










15 






20 










25 




Gly 


Thr 


Lys 


Ala 


Trp 


Leu 


Gin Ser He 


Gin 


Ala 


Gin Gly 


Lys 


Thr Trp 








30 






35 










40 






Pro 


Ala 


lie 


Ala 


Glu 


Tyr 


Gin Arg Ala 




Gin 


Glu 




Val 


Ala 


Met 






45 








50 








55 








Glu 


Pro 


Ala 


Ser 


Thr Arg 


Glu Asp Ser Gly His 


Ser 


Ala 


Ala 


Glu 


Leu 




60 










65 






70 










Ala 


Glu 


Lys 


Glu 


Lys 





















75 

<210> 865 
<211> 57 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -21. .-1 
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<400> 865 

Met Val Leu Leu Ser Ser Thr Met Ser Val Leu lie Phe Cys Leu Leu 

-20 -15 -10 

Asn Leu Ala lie Ser Asp Arg Arg Val Met Lys Ser Pro Thr He He 
-5 15 10 

Val He His Leu Phe Phe Pro His Ser Ser Thr Ser He Cys He Glu 

15 20 25 

Asn Leu Val Thr Leu Leu Leu Gly Thr 
30 35 

<210> 866 
<211> 129 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -26. .-1 



Met 


Val 


Ser 


Arg Met 


Val 


Ser 


Thr Met 


Leu 


Ser Gly Leu Leu 


Phe 


Trp 




-25 








-20 






-15 








Ala 


Ser 


Gly Trp 


Thr 


Pro 


Ala Phe 


Ala 


Tyr Ser Pro Arg 


Thr 


Pro 


-10 








-5 








1 


5 




Asp 


Arg 


Val 


Ser Glu 


Ala 


Asp 


He Gin 




Leu Leu His Gly 


Val 


Met 








10 






15 




20 






Glu 


Gin 


Leu 


Gly He 


Ala 


Arg 


Pro Arg 


Val 


Glu Tyr Pro Ala 


His 


Gin 






25 








30 




35 






Ala 


Met 


Asn 


Leu Val 


Gly 


Pro 


Gin Ser 


He 


Glu Gly Gly Ala 


His 


Glu 




40 








45 






50 






Gly 


Leu 


Gin 


His Leu 


Gly 


Pro 


Phe Gly 


Asn 


He Pro Asn He 


Val 


Ala 


55 








60 








65 




70 


Glu 


Leu 


Thr 


Gly Asp 


Asn 


He 


Pro Lys 


Asp 


Phe Ser Glu Asp 


Gin 


Gly 








75 








80 




85 




Tyr 


Gin 


Thr 


Leu Gin 


He 


Pro 


Val Leu 




Glu Lys Gin Met 


Met 


Asp 








90 






95 




100 







Val 

<210> 867 
<211> 99 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 



<220> 

<221> UNSURE 
<222> -35 

<223> Xaa = Ala, Cys, Gly, Ser, Trp 
<400> 867 

Met Ala Ala Ala Val Xaa Ala Ala Pro Gly Ala Leu Gly Ser Leu His 

-40 -35 -30 

Ala Gly Gly Ala Arg Leu Val Ala Ala Cys Ser Ala Trp Leu Cys Pro 
-25 -20 -15 -10 

Gly Leu Arg Leu Pro Gly Ser Leu Ala Gly Arg Arg Ala Gly Pro Ala 

-5 15 
He Trp Ala Gin Gly Trp Val Pro Ala Ala Gly Gly Pro Ala Pro Lys 

10 15 20 

Arg Gly Tyr Ser Ser Glu Met Lys Thr Glu Asp Glu Leu Arg Val Arg 
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His Leu Glu Glu Glu Asn Arg Asp Arg Asp Lys Asn Met Val Lys Gly 
40 45 50 55 



<210> 868 

<211> 108 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> SIGNAL 
<222> -19.. -1 



<400> 868 

Met Ala Ala Ala Thr Gly Ala Val Ala Ala Ser Ala Ala Ser Gly Gin 

-15 -10 -5 

Ala Glu Gly Lys Lys lie Thr Asp Leu Arg Val lie Asp Leu Lys Ser 

15 10 
Glu Leu Lys Arg Arg Asn Leu Asp lie Thr Gly Val Lys Thr Val Leu 

15 20 25 

lie Ser Arg Leu Lys Gin Ala lie Glu Glu Glu Gly Gly Asp Pro Asp 
30 35 40 45 

Asn lie Glu Leu Thr Val Ser Thr Asp Thr Pro Asn Lys Lys Pro Thr 

50 55 60 

Lys Gly Lys Gly Lys Lys His Glu Ala Asp Glu Leu Ser Gly Asp Ala 

65 70 75 

Ser Val Glu Asp Asp Ala Phe lie Lys Val Lys Cys 
80 85 



<210> 869 

<211> 60 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SIGNAL 
<222> -41. .-1 

<400> 869 

Met Pro Glu Ser Leu Trp Gin His 

-40 -35 
Gin Leu Lys Glu Ala Gly Phe Leu 
-25 -20 
Ser Phe Tyr Leu Ser Arg Val Asn 
-5 

Leu Pro Lys Pro Ala Arg His Leu 
10 15 



Ser His Trp Cys Gly Glu Ser Phe 

-30 

Gly Thr Leu Glu Val Phe Trp Met 
-15 -10 
Cys Thr Thr Lys Glu Thr Ser Pro 

1 5 
Gly Leu Glu Pro 



<210> 870 
<211> 60 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> SIGNAL 
<222> -45. .-1 



<400> 870 

Met His Ser Leu Phe lie Ala Ser Leu Lys Val Leu Phe Tyr Tyr Ser 
-45 -40 -35 -30 

Phe Ser Phe Arg Phe Asn Trp Phe Asp Cys Leu Leu His Asn Leu Gly 
-25 -20 -15 
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Glu Asn Phe Leu Ser Leu Leu Ser Lys Ser Cys Ser Ala Asp Pro Ser 

-10 -5 1 

Gly Ser Thr Phe Met Arg Asp lie Glu Thr Asn Lys 
5 10 15 

<210> 871 

<211> 133 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -13. .-1 



Met 


Ala 


His 


Cys 


Val Thr Leu Val 


Gin Leu Ser He 


Ser Cys Asp 


His 








-10 




-5 


1 






Leu 


He 


Asp 


Lys 


Asp He Gly Ser 


Lys Ser Asp Pro 


Leu Cys 


Val 






5 






10 


15 








Leu 


Gin 


Asp Val 


Gly Gly Gly Ser 


Trp Ala Glu Leu 


Gly Arg 


Thr 


Glu 


20 








25 


30 






35 


Arg 


Val 


Arg 


Asn 


Cys Ser Ser Pro 


Glu Phe Ser Lys 


Thr Leu 


Gin 












40 


45 




50 




Glu 


Tyr 


Arg 


Phe 


Glu Thr Val Gin 


Lys Leu Arg Phe 


Gly He 


Tyr Asp 








55 




60 


65 






He 


Asp 


Asn 


Lys 


Thr Pro Glu Leu 


Arg Asp Asp Asp 


Phe Leu 


Gly 


Gly 






70 




75 




80 






Ala 


Glu 


Cys 


Ser 


Leu Gly Gin He 


Val Ser Ser Gin 


Val Leu 


Thr 


Leu 




85 






90 


95 








Pro 


Leu 


Met 


Leu 


Lys Pro Gly Lys 


Pro Ala Gly Arg Gly Thr 


Ser 


Arg 


100 








105 


110 






115 


Ser 


Gin 


Leu 


Arg 













<210> 872 
<211> 68 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> SIGNAL 
<222> -47. .-1 



<400> 872 














Met 


Val Lys 


Ser 


Ser Leu 


Gin 


Arg 


He Leu 


Asn Ser His Cys Phe Ala 




-45 








-40 




-35 


Arg 


Glu Lys 


Glu 


Gly Asp 


Lys 


Pro 


Ser Ala 


Thr He His Ala Ser Arg 




-30 






-25 






-20 


Thr 


Met Pro 


Leu 


Leu Ser 


Leu 


His 


Ser Arg 


Gly Gly Ser Ser Ser Glu 


-15 






-10 








-5 1 


Ser 


Ser Arg 


Val 


Ser Leu 


His 


Cys 


Cys Ser 


Asn Pro Gly Pro Gly Pro 






5 








10 


15 


Arg 


Trp Cys 


Ser 













20 



<210> 873 
<211> 67 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> SIGNAL 
<222> -31. .-1 
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<400> 873 














Met 


Ser Arg 


Leu 


Gly 


Ser 


Ala Val 


Pro Asp Cys His He Asn Ala 


Pro 




-30 








-25 


-20 




Glu 


Pro Ala 


Ser 


Trp 


Ala 


Leu Gly Arg Trp Gly Ser Cys Arg Ala 


Leu 


-15 








-10 




-5 


1 


Asp 


Lys Gly 


Glu 


Gly 


Gly Arg Glu 


Gly Asp Pro Lys His Pro Ser 


Leu 






5 








10 15 




Pro 


Gly Pro 


Arg 




Gin 


Thr Cys 


Pro Met Arg Ser Thr Leu Met Asn 




20 








25 


30 




Cys 


Trp Ser 
















35 















<210> 874 

<211> 81 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SIGNAL 
<222> -19. .-1 



Met 


Ala 


Ala Ala 


Val 


Gly 


Arg 


Leu 






Ala 


Ser 


Val 


Ala 


Arg 


His 








-15 










-10 










-5 




Val 


Ser 


Ala He 


Pro 


Trp 


Gly 


He 


Ser 


Ala 


Thr 


Ala 


Ala 




Arg 


Pro 






1 








5 










10 








Ala 


Ala 


Cys Gly 




Thr 


Ser 


Leu 


Thr 


Asn 


Leu 




Cys 


Ser 


Gly 


Ser 




15 








20 










25 










Ser 


Gin 


Ala Lys 


He 


He 


Ser 


Ser 


He 


Leu 


Trp 


He 


Ser 


Pro 


Leu 


Cys 


30 








35 










40 










45 


Val 


Leu 


Gin Lys 


Leu 


Leu 


Leu 


Leu 


Val 


Thr 


Lys 




Thr 


Asn 


Phe 


Thr 



Met 

<210> 875 
<211> 53 
<212> PRT 

<213> Homo sapiens 

<220> 

<221> SIGNAL 
<222> -35. .-1 

<400> 875 

Met Glu Gly Arg Ser Arg Ser Arg Lys Ala Cys Leu Glu Val Ala Thr 
-35 -30 -25 -20 

Val Asn Gin Val Phe Val Gly Leu Val Glu Val Val Ala Val Gly Val 

-15 -10 -5 

Val Arg Ser Asp Gin He Lys Asn Met Tyr Leu Lys Tyr Ser Gin His 

15 10 
Asp Leu Leu Met Asp 
15 

<210> 876 

<211> 56 

<212> PRT 

<213> Homo sapiens 

<400> 876 

Met Gin Glu Met Ser Arg Pro He Asn Lys Lys Phe Leu Ser Leu Ala 

15 10 15 

Glu Asn Gly Trp Pro His Glu Ala Thr Thr Thr Gly Gly Lys Glu Gly 
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Cys Ser Thr Pro Arg Leu lie Gly Lys Pro Asn Gin lie Trp Asp Leu 

35 40 45 

Tyr Ser Val Ala Glu Ala Gly Arg 



<210> 877 
<211> 53 
<212> PRT 
<213> Homo 



sapiens 



<400> 877 

Met Lys Lys Pro Gin Ser Glu Asn 
1 5 
lie Asp Leu Leu Val Val Phe Pro 
20 

Ser Leu Ser Tyr Tyr Ser Leu Lys 

35 40 
Phe Ser Ser lie Leu 
50 



Phe Gly Gly Gly Pro Arg Thr Trp 

10 15 

Lys Val His Val Leu Glu lie Phe 

25 30 

Lys Lys Met Leu Thr Val Tyr Leu 
45 



<210> 878 

<211> 58 

<212> PRT 

<213> Homo sapiens 



<400> 878 

Met Ser Thr Lys Arg Arg Leu Glu 
1 5 
Gin Phe Leu Ser Glu Glu Asn Met 
20 

Leu His Asn Asp Gin Pro Leu Leu 

35 40 
Ser Pro Ala Pro Thr Gin Glu Pro 
50 55 



Glu Glu Gin Glu Pro Leu Arg Lys 

10 15 
Ala Thr His Phe Ser Gin Leu Ser 
25 30 
Gin Pro His His Asp Leu Leu Pro 
45 

Leu Leu 



<210> 879 

<211> 51 

<212> PRT 

<213> Homo sapiens 



<400> 879 

Met Ala Ser Asp Leu Asp Phe Ser 
1 5 
Ser Trp Arg Thr Cys Tyr Gly Thr 
20 

Ser Ser Ser Val Cys Trp Pro Ser 

35 40 
Arg Arg Arg Leu Asn Arg Leu Ser 
50 55 



Pro Pro Glu Cys Pro Ser Pro Leu 

10 15 
Asp Ser Ser Trp Glu Pro Ser Ser 
25 30 
Ser Tyr Pro Phe Pro Ser Pro Thr 
45 

Pro Glu Val Leu Arg 
60 



<210> 880 

<211> 61 

<212> PRT 

<213> Homo sapiens 

<400> 880 

Met Leu Ser Leu Asp Phe Leu Asp 
1 5 
Gin Leu Tyr Tyr Gin Val Leu Asn 
20 

Leu Met lie Trp Lys Gly Leu Met 
35 40 



Asp Val Arg Arg Met Asn Lys Arg 

10 15 
Phe Gly Met He Val Ser Ser Ala 
25 30 
Val He Thr Gly Ser Glu Ser Pro 
45 
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He Val Val Val Ser Gly Lys Met Gly lie Ser Ser Phe 



<210> 881 
<211> 113 
<212> PRT 
<213> Homo sapiens 



<400> 881 



Met 


He 


Val 


Ser 


Ser 


Ala Leu Met 


He 


Trp Lys Gly Leu Met 


Val 


He 


1 








5 






10 




15 




Thr 


Gly 


Ser 


Glu 


Ser 


Pro He Val 


Val 


Val Leu Ser Gly 


Ser 


Met 


Glu 






20 






25 




30 






Pro 


Ala 


Phe 


His 


Arg 


Gly Asp Leu 


Leu 


Phe Leu Thr Asn 


Arg 


Val 


Glu 






35 






40 




45 








Asp 


Pro 


He 


Arg 


Val Gly Glu He 


Val 


Val Phe Arg He 


Glu 


Gly Arg 




50 








55 




60 








Glu 


He 


Pro 


He 


Val 


His Arg Val 


Leu 


Lys He His Glu 


Lys 


Phe 


Val 


65 










70 




75 






80 


Pro 


Tyr 


He 


Gly 


He 


Val Thr He 


Leu 


Met Asn Asp Tyr 


Pro 


Lys 


Phe 










85 






90 




95 




Lys 


Tyr 


Ala 


Val 


Leu 


Phe Leu Leu 


Gly 


Leu Phe Val Leu 


Val 


His 


Arg 








100 






105 




110 







Glu 



<210> 882 
<211> 72 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> UNSURE 
<222> 17 

<223> Xaa = Cys , Gly, Arg, Ser 

<220> 

<221> UNSURE 
<222> 61 

<223> Xaa = His, Gin 
<400> 882 

Met Gly Ser Ser Arg Asp Thr Ser Arg Ser Leu Gly Gly Cys Ser Pro 

15 10 15 

Xaa Ser Ser Leu Asn Gly Phe Gly Phe Lys Asp Asp His Leu Arg Leu 

20 25 30 

Leu Gly Cys His Ser Ala Thr Asp His Leu Gin Leu Leu Ala Thr Leu 

35 40 45 

Gly Phe Leu His Phe Cys Leu Tyr Val Leu Leu Leu Xaa Arg Leu Ser 

50 55 60 

His Asp He Tyr Leu Ser Val Met 
65 70 

<210> 883 

<211> 70 

<212> PRT 

<213> Homo sapiens 

<400> 883 

Met Trp Tyr Ala Lys Lys Gin Gly Gin Lys Asn Lys Glu Ala Glu Arg 

15 10 15 

Gin Glu Arg Ala Val Val His Met Asp Glu Arg Arg Glu Glu Gin He 

20 25 30 

Val Gin Leu Leu Asn Ser Val Gin Ala Lys Asn Asp Lys Glu Ser Glu 



[fije>'/^:/yv'GG208389e.s^] 



WO 02/083898 PCT/lBOl/00914 
443 



35 








40 








45 




Ala Gin He 


Ser Thr 


Pro 


Asp 


Leu 


Ala 


Val Arg 


Ser 


Ala 


Pro Ser Gly 


50 






55 








60 






Gin Pro Phe 


Arg Ala 


Cys 
















65 




70 
















<210> 884 




















<211> 53 




















<212> PRT 




















<213> Homo sapiens 


















<400> 884 




















Met Ser Gly 


Phe Glu 


Asn 


Leu 


Asn 


Thr Asp Phe 


Tyr 


Gin 


Thr Ser Tyr 


1 


5 










10 






15 


Ser He Asp 


Asp Gin 


Ser 


Gin 


Gin 


Ser 


Tyr Asp 


Tyr Gly 


Gly Ser Gly 




20 
















30 


Gly Pro Tyr 


Asn Ser 


Met 


Leu 


Ala 


Met 


Thr He 


Arg 


Ser 


Lys Ala Asp 


35 








40 








45 




Leu Ser Leu 


Gin Thr 


















50 




















<210> 885 




















<211> 56 




















<212> PRT 




















<213> Homo sapiens 


















<400> 885 




















Met He Phe 


Cys Ser 


His 


Lys 


Leu 


Ala 


Leu Leu 


Ser 


He 


Phe Pro Glu 


1 


5 










10 






15 


His Thr Pro 


Gly Val 


Cys 


Lys 


Tyr 


Ser 


Leu Pro 


Leu 


Leu 


His Thr His 




20 








25 








30 


Arg Val Phe 


Thr Ser 


Val 


Thr 


Phe 


Cys 


Ala Tyr 


Cys 


Ser 


Ala Pro Cys 


35 








40 








45 




Leu Phe His 


Leu Thr 


Val 


He 


He 












50 






55 














<210> 886 




















<211> 103 




















<212> PRT 




















<213> Homo sapiens 


















<400> 886 




















Met Met Gly 


Ser Lys 


Met 


Ala 


Ser 


Ala 


Ser Arg 


Val 


Val 


Gin Val Val 


1 


5 










10 






15 


Lys Pro His 


Thr Pro 


Leu 


He 




Phe 


Pro Asp 


Arg Arg 


Asp Asn Pro 




20 








25 








30 


Lys Pro Asn 


Val Ser 


Glu 


Ala 


Leu Arg Ser Ala 


Gly 


Leu 


Pro Ser His 


35 








40 








45 




Ser Ser Val 


He Ser 


Gin 


His 


Ser 


Lys 


Gly Ser 


Lys 


Ser 


Pro Asp Leu 


50 






55 








60 






Leu Met Tyr 


Gin Gly 


Pro 


Pro 


Asp 


Thr 


Ala Glu 


He 


He 


Lys Thr Leu 


65 




70 








75 






80 


Pro Gin Lys 


Tyr Arg 


Arg 




Leu 


Val 


Ser Gin 


Glu 


Glu 


Met Glu Phe 




85 










90 






95 


He Gin Arg 


Gly Gly 


Pro 


Glu 
















100 



















<210> 887 
<211> 124 
<212> PRT 
<213> Homo sapiens 



<400> 887 
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Met Ala Arg Val He 


He Asp Asn 


Lys Asp Gly 


Pro Thr 


Gin 


Lys Tyr 


1 5 










10 








15 


Leu Leu He Phe Gly 


Ala 


Phe 


Val 


Ser 


Val 


Tyr 


He Gin 


Glu 


Met Phe 


20 








25 








30 




Arg Phe Ala Tyr Tyr 


Lys 


Leu 


Leu Lys 


Lys 


Ala 


Ser Glu 


Gly 


Leu Lys 


35 






40 








45 






Ser He Asn Pro Gly 


Glu 


Thr 


Ala 


Pro 


Ser 


Met 


Arg Leu 




Ala Tyr 


50 




55 










60 






Val Ser Gly Leu Gly 


Phe 




He 


Met 


Ser 


Gly 


Val Phe 


Ser 


Phe Val 


65 


70 










75 






80 


Asn Thr Leu Ser Asp 


Ser 


Leu 


Gly 


Pro 


Gly 


Thr 


Val Gly 


He 


His Gly 


85 










90 








95 


Asp Ser Pro Gin Phe 


Phe 


Leu 


Leu 


Phe 


Ser 


Phe 


His Asp Ala Gly His 


100 








105 








110 




Tyr Leu Ala Ala Cys 


He 


Leu 


Gly 


His 


Cys 


He 


Phe 






115 






120 














<210> 888 




















<211> 62 




















<212> PRT 




















<213> Homo sapiens 




















<400> 888 




















Met Ala Ser Ser Thr 


Val 


Pro 


Val 


Ser 


Ala 


Ala 


Gly Ser 


Ala 


Asn Glu 


1 5 










10 








15 


Thr Pro Glu He Pro 


Asp 


Asn 


Val 


Gly 


Asp 


Trp 




Gly 


Val Tyr 


20 








25 








30 




Pro Leu Pro Leu He 


Gly Met 


Thr 


Ser Gly Glu 


Leu Asp 


Thr 


Lys Phe 


35 






40 








45 






Gly Thr Leu Cys Cys 


Gly Ser 


Leu 


Ala 


Gly Gin 


Glu Leu 


Glu 




50 




55 










60 






<210> 889 




















<211> 65 




















<212> PRT 




















<213> Homo sapiens 




















<400> 889 




















Met Leu Arg His Ser 


Pro 


Ser 


Leu 


Trp 


Glu 


Leu 


Val Glu 


Glu 


His Val 


1 5 










10 








15 


Pro Leu Arg Glu Arg 


Arg 


Glu 


Val 


Lys Arg 


He 


Leu Gly Gly Gly Gly 


20 








25 








30 




Gly Gly Pro Glu Pro 


Gly 


Ala 


Ala 


Gly 


Gly 


Gly 


Gly Asp 


Val 


Thr Gly 


35 






40 








45 






Thr Ala Pro Arg Gly 


Ser 


He 


Leu 


Ser 


Ser 


Pro 


Gin Leu 


Pro 


Pro His 


50 




55 










60 






Leu 




















55 




















<210> 890 




















<211> 140 




















<212> PRT 




















<213> Homo sapiens 




















<400> 890 






















Glu 


Met 


Leu 


Pro 


Phe 


Trp 


Met Asn 


Ser 


Thr Gly 


Met y ^ g 










10 








15 


Arg Arg Glu Gly Trp 


Gin 


Arg 


Gly 


Trp 


His 


Gly 


Tyr Asp Asn 


Glu Leu 


20 








25 








30 




Met Asp Met Arg Gly 


He 


Phe 




Ala 


Phe 


Gly 


Pro Asp 


Phe 




35 






40 








45 






Asn Phe Arg Ala Ala 


Pro 


He Arg 


Ser 


Val Asp 


Val Tyr 


Asn 


Val Met 



50 55 60 
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Cys Asn Val 


Val 


Gly 


He 


Thr 


Pro Leu 


Pro Asn 


Asn 


Gly 


Ser 


Trp 


Ser 


65 






70 






75 










80 


Arg Val Met 


Cys 


Met 


Leu 


Lys 


Gly Arg 


Ala Ala 


Leu 


Pro Arg 


Leu 


Ser 






85 








90 








95 




Gly Pro Ala 


Thr 


Val 


Pro 


Trp 


His Asp 


Ser Ser 


Leu 


Pro 


Ala 


Cys 


He 




100 








105 








110 






Thr Asp His 


He 


Ala 


Cys 


Leu 


Arg Lys 


Lys His 


His 


Gin 


Gin 


Ser Gly 


115 










120 






125 








Pro Pro Lys 


Pro Asp Asp 


Phe 


His Phe 


Met Cys 


Glu 










130 








135 






140 










<210> 891 
























<211> 62 
























<212> PRT 
























<213> Homo sapiens 




















<400> 891 
























Met Glu Met 


Ala 


Ser 


Ser 


Ala Gly Ser 


Trp Leu 


Ser 


Gly Cys 


Leu 


He 


1 




5 








10 








15 




Pro Leu Val 


Phe 


Leu Arg 


Leu 


Ser Val 


His Val 


Ser 


Gly 


His 


Ala 


Gly 




20 








25 








30 






Asp Ala Gly Lys 


Phe 


His 


Val 


Ala Leu 


Leu Gly Gly 


Thr 


Ala 


Glu 


Leu 


35 










40 






45 








Leu Cys Pro 


Leu 


Ser 


Leu 


Trp 


Pro Gly 


Thr Tyr 


Pro 


Arg Arg 






50 








55 






60 










<210> 892 
























<211> 114 
























<212> PRT 
























<213> Homo sapiens 




















<400> 892 
























Met Asp lie 


Val 


Ala 


His 


Cys 


Glu Lys 


Ala Met 


Ala 


Glu 


Ser 


Val 


Lys 


1 




5 








10 








15 




Lys His Gin 


Phe 


Ser 


Thr 


Ser 


Glu Ser 


Tyr Leu 


Pro 


Ala 


Thr 


Leu 


He 




20 








25 








30 






Leu Asn Ser 


He 


Ser 


Phe 


Asn 


Thr Asp 


Gly Ser 


He 


Gin 


Tyr 


Lys 


Glu 


35 










40 






45 








Lys Leu Tyr 


Phe 


Ser 


Ala 


Ser 


Glu Ala 


Leu Asp Ala 


Tyr 


He 


Asp Asp 


50 








55 






60 










Phe His Leu 


Asn 


Tyr 


Glu 


Pro 


Pro Asp 


He Asp 


Thr 


Lys 


Val 


Asn 


Leu 


65 






70 






75 










80 


Asp Gin Ser 


Pro 




Glu 


Phe 


Leu Ala 


Lys Ser 


Asn 


Ser 


Gly 


Val 


His 






85 








90 








95 




Gly He Ala 


Pro 


Gly 


Gin 


Ser 


Gly Met 


Glu Lys 


He 


He 


Gin 


Trp 


Tyr 




100 








105 








110 






Ala Ala 

























<210> 893 
<211> 55 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> UNSURE 
<222> 27 

<223> Xaa = Asn, Thr 

<220> 

<221> UNSURE 
<222> 39 

<223> Xaa = Leu, Val 
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<220> 

<221> UNSURE 
<222> 34,37 
<223> Xaa = Lys,Asn 

<400> 893 



Met Arg Cys 


Arg 


Pro Ser 


Ser Val His 


Val 


Ala He Leu 


Thr 


Glu Ala 


1 




5 




10 






15 


lie Pro Pro 


Lys 


Met He 


Ser Phe Ser 


Thr 


Xaa Ser Lys 


Arg 


He Phe 




20 




25 






30 




Lys Xaa Ser 


Pro 


Xaa Met 


Xaa Lys Ser 


Gin 


Val Cys Pro 


Leu 


He Gly 


35 






40 




45 






Glu lie Ser Gly Ser Trp 


Met 










50 






55 










<210> 894 
















<211> 123 
















<212> PRT 
















<213> Homo sapiens 












<220> 
















<221> UNSURE 














<222> 26 
















<223> Xaa = 


His, 


,Gln 












<400> 894 
















Met Pro Lys 


Gly 


Gly Arg 


Lys Gly Gly 


His 


Lys Gly Arg 


Ala 


Arg Gin 


1 




5 




10 






15 


Tyr Thr Ser 


Pro 


Glu Glu 


He Asp Ala 


Xaa 


Cys Arg Leu 


Arg 


Ser Arg 




20 




25 






30 




Arg Pro Gly 


Lys 


Lys Arg 


Ser Lys Lys 


Gly 


Gly Asp Gly 


Ala Ala Gly 


35 






40 




45 






Asp Pro Lys 


Lys 


Glu Lys 


Lys Ser Leu Asp 


Ser Asp Glu 


Ser 


Glu Asp 


50 






55 




60 






Glu Glu Asp 


Asp 


Tyr Gin 


Gin Lys Arg 


Lys 


Gly Val Glu 


Gly Leu He 


65 




70 






75 




80 


Asp lie Glu 


Asn 


Pro Asn 


Arg Val Ala 


Gin 


Thr Thr Lys 


Lys 


Val Thr 






85 




90 






95 


Gin Leu Asp 


Leu 


Asp Gly 


Pro Arg Ser 


Phe 


Arg Gly Glu 




Glu Lys 




100 




105 






110 




Arg Leu Arg 


Ser 


Arg Arg 


Gin Lys Ser 


Val 


Thr 






115 






120 










<210> 895 
















<211> 123 
















<212> PRT 
















<213> Homo sapiens 












<220> 
















<221> UNSURE 














<222> 26 
















<223> Xaa = 


His, 


,Gln 












<400> 895 
















Met Pro Lys 


Gly 


Gly Arg 


Lys Gly Gly His 


Lys Gly Arg 


Ala 


Arg Gin 


1 




5 




10 






15 


Tyr Thr Ser 


Pro 


Glu Glu 


He Asp Ala 


Xaa 


Cys Arg Leu 


Arg 


Ser Arg 




20 




25 






30 




Arg Pro Gly 


Lys 


Lys Arg 


Ser Lys Lys Gly 


Gly Asp Gly 


Ala Ala Gly 


35 






40 




45 






Asp Pro Lys 


Lys 


Glu Lys 


Lys Ser Leu 


Asp 


Ser Asp Glu 


Ser 


Glu Asp 


50 






55 




60 






Glu Glu Asp 


Asp 


Tyr Gin 


Gin Lys Arg 




Gly Val Glu 


Gly 


Leu He 
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65 70 
Asp lie Glu Asn Pro Asn Arg Val Ala 
85 

Gin Leu Asp Leu Asp Gly Pro Arg Ser 
100 105 
Arg Leu Arg Ser Arg Arg Gin Lys Ser 
115 120 



PCT/IBOl/00914 

447 

75 80 
Gin Thr Thr Lys Lys Val Thr 
90 95 
Phe Arg Gly Glu Asn Glu Lys 
110 

Val Thr 



<210> 896 
<211> 81 

<212> PRT 

<213> Homo sapiens 
<220> 

<221> UNSURE 
<222> 78 

<223> Xaa = Ser, Thr 



<400> 896 






















Met 


Val Gly Leu 


Lys 


Lys 


Lys 


Ser 


Ala Asp Asp Val 


Lys 


Lys 


Val 


Phe 


1 




5 








10 








15 




His 


Met Leu Asp 


Lys 


Asp Lys 


Ser 


Gly Phe He 


Glu 


Glu 


Asp 


Glu 


Leu 




20 










25 






30 






Gly 


Phe lie Leu 


Lys 


Gly 


Phe 


Ser 


Pro Asp Ala 


Arg 


Asp 


Leu 


Ser 


Ala 




35 








40 






45 








Lys 


Glu Thr Lys 


Met 


Leu 


Met 


Ala 


Ala Gly Asp 


Lys Asp 


Gly 


Asp 


Gly 




50 






55 






60 










Lys 


He Gly Val Asp 


Glu 


Phe 


Ser 


Thr Leu Val 


Ala 


Glu 


Xaa 


Lys 


Lys 



65 70 75 80 



<210> 897 
<211> 143 
<212> PRT 

<213> Homo sapiens 



<400> 897 
















Met 


Tyr 


Thr Thr Val 


Asp 


Ala 


Asn 


Gly Tyr 


Leu Lys Asn 


Gly 


Ser Ala 


1 




5 








10 






15 


Gly 


Gin 


Leu Ser Gin 


Ser 


Ser 


His 


Leu Ala 


Leu Gin Leu 


Pro 


Tyr Asn 






20 








25 




30 




Val 


Leu 


Gly Leu Gly 


Arg 


Ser 


Ala 


Asn Phe 


Leu Asp His 




Tyr Val 






35 






40 




45 






Gly 


He 


Pro Arg Pro 


Ser 


Gly Glu 


Lys Ser 


He Arg Lys 


Gin 


Glu Trp 




50 






55 






60 






Thr 


Ala 


He He Pro 


Asn 


Ser 


Gin 


Leu He 


Val He Pro 


Tyr 


Pro His 


65 






70 








75 




80 




Val 


Pro Arg Ser 


Trp 


Ser 


Ala 


Lys Leu 


Tyr Leu Thr 


Pro 


Ser Asn 






85 








90 






95 


He 


Val 


Leu Leu Thr 


Ala 


He 


Ala 


Leu He 


Gly Val Cys 


Val 


Phe He 






100 








105 




110 




Leu 


Ala 


He He Gly 


He 


Leu 


His 


Trp Gin 


Glu Lys Lys 


Ala 


Asp Asp 






115 






120 




125 






Arg 


Glu 


Lys Arg Gin 


Glu 


Ala 


His 


Arg Phe 


His Phe Asp Ala 


Met 




130 






135 






140 







<210> 898 

<211> 68 
<212> PRT 

<213> Homo sapiens 



<400> 898 

Met Met Asp Thr Met Asp Pro Phe Trp Phe Asn Ala Phe Lys Arg Thr 
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1 








5 








10 








15 




Asn 


Thr 


He 


Leu 


His 


His 


Leu 


Arg 


Met Ser 


Lys 


His 


Thr 


Asp Ala 


Ala 








20 










25 








30 




Glu 


Glu 


Val 






Glu 


Lys 


Lys 


Gly Cys 


Thr 


Gly 


Val 


He Thr 








35 










40 








45 






Asn 


Arg 


Pro 


Lys 


Phe 




Asn 


Ala 


Leu Thr 


Leu 


Asn 


Met 


He Arg 


Gin 




50 










55 








60 








He 


Tyr 


His 


Ser 






















65 





























<210> 899 
<211> 74 
<212> PRT 

<213> Homo sapiens 



<400> 899 














Met 


Ala Ser 


Ser 


Thr Val Pro Val 


Ser Ala Ala 


Gly Ser Ala Asn Glu 


1 






5 


10 






15 


Thr 


Pro Glu 


He 


Pro Asp Asn Val 


Gly Asp Trp 


Leu Arg 


Gly 


Val Tyr 






20 




25 




30 




Arg 


Phe Ala 


Thr 


Asp Arg Asn Asp 


Phe Arg Arg 


Asn Leu 


He 


Leu Asn 




35 




40 




45 






Leu 


Gly Leu 


Phe 


Ala Ala Gly Val 


Trp Leu Ala 


Arg Asn 


Leu 


Ser Asp 




50 




55 




60 






He 


Asp Leu 


Met 


Ala Pro Gin Pro Gly Val 








55 






70 











<210> 900 
<211> 81 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> UNSURE 
<222> 19 

<223> Xaa = Arg, Ser 
<220> 

<221> UNSURE 
<222> 75 

<223> Xaa = Leu, Arg 

<220> 

<221> UNSURE 
<222> 74 

<223> Xaa = Lys, Asn 



<400> 900 

Met Gly Ala Thr Gly Pro Lys Leu 

1 5 

Gly Arg Xaa Gly Gly Ser Glu Ala 

20 

Arg Pro Arg Gly Arg Ala Val Pro 

35 40 
Ser Met Phe Tyr Asp Glu Asp Gly 

50 55 
Glu Thr He Val Thr Lys Asn Gly 
65 70 



Gly Ala Cys Gly Leu Arg Leu Gly 

10 15 
Ala Gly Ala Glu Gin Ala Leu Val 

25 30 
Pro Phe Val Phe Thr Arg Arg Gly 
45 

Asp Leu Ala His Glu Phe Tyr Glu 
60 

Gin Xaa Xaa Ala Lys Leu Arg Ala 
75 80 



<210> 901 
<211> 68 
<212> PRT 
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<213> Homo sapiens 
<400> 901 

Met Trp Ala Phe Pro Trp Glu Glu Pro Leu Tyr Val lie Thr Phe Tyr 

15 10 15 

He Val Ala Val Glu Ala Lys Trp Thr Glu Ser Pro Arg Gin His Glu 

20 25 30 

Val Val Leu Gin Val He Leu Glu Lys Leu Thr Tyr His Thr Thr His 

35 40 45 

Gin Pro Met Gin Lys Pro Val Tyr Val Gin Ser Ala Glu Cys Leu Gly 

50 55 60 

Pro Pro Lys Lys 
65 



<210> 902 
<211> 107 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> UNSURE 
<222> 100 

<223> Xaa = * ,Trp 
<220> 

<221> UNSURE 
<222> 27 

<223> Xaa = * ,Tyr 
<220> 

<221> UNSURE 

<222> 23 

<223> Xaa = Arg, Ser 

<220> 

<221> UNSURE 
<222> 19 

<223> Xaa = Glu, Gly 
<220> 

<221> UNSURE 
<222> 74, 107 
<223> Xaa = Gly, Val 

<220> 

<221> UNSURE 
<222> 102, 104 
<223> Xaa = His, Gin 



<220> 

<221> UNSURE 
<222> 28 

<223> Xaa = Lys, Thr 



<400> 902 

Met Pro Gin Leu Lys Leu Thr Ser 

1 5 

Tyr Thr Xaa Thr Arg Thr Xaa Ser 

20 

Pro Met Asp He Gin Met Ala Asn 

35 40 
Pro Gin Gly Asn Asp Cys Asp Leu 
50 55 



Val Trp Thr Thr Thr His Ser Tyr 

10 15 
Pro Gly Xaa Xaa Pro Gly Pro Pro 
25 30 
Asn Phe Thr Pro Pro Ser Ala Thr 
45 

Tyr Ala His His Ser Thr Ala Ser 
60 
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lie Val Met Pro Leu His Tyr Ser Leu Xaa Phe He He Gly Leu Val 
65 70 75 80 

Gly Asn Leu Leu Ala Leu Val Val He Val Gin Asn Arg Lys Lys He 

85 90 95 

He Ser Pro Xaa Ala Xaa Pro Xaa Pro He Xaa 
100 105 

<210> 903 
<211> 72 
<212> PRT 

<213> Homo sapiens 
<400> 903 

Met Leu Asp Ser He Trp Lys Pro Asp Leu Phe Phe Ala Asn Glu Lys 

15 10 15 

Gly Ala Asn Phe His Glu Val Thr Thr Asp Asn Lys Leu Leu Arg He 

20 25 30 

Phe Lys Asn Gly Asn Val Leu Tyr Ser He Arg Leu Thr Leu Thr Leu 

35 40 45 

Ser Cys Pro Met Asp Leu Lys Asn Phe Pro Met Asp Val Gin Thr Cys 

50 55 60 

He Met Gin Leu Glu Ser Leu Val 
65 70 



<210> 904 

<211> 57 

<212> PRT 

<213> Homo sapiens 

<400> 904 

Met Gly Lys Gin Arg Asn Leu Thr 
1 5 
Glu Thr Val Phe Leu Asn His Leu 
20 

Tyr Ser Phe Val Glu Pro Leu Leu 

35 40 
His He Ser Leu Leu Phe Gin Leu 
50 55 

<210> 905 
<211> 102 
<212> PRT 
<213> Homo sapiens 



Leu Phe Phe Ser Phe Pro Phe Cys 

10 15 
Leu Asn Ser Val Phe Phe Pro Ser 
25 30 
Ser Tyr Leu Arg Phe Pro Gin Thr 
45 

Tyr 



<400> 905 












Met 


Glu 


Arg 


Leu 


Thr 


Leu 


Pro 


Leu 


1 








5 








Tyr 


Leu 


Glu 


Tyr 

20 


Arg 


Arg 


He 


Val 


Phe 


Thr 


Pro 
35 


Glu 


Glu 


Tyr 


Glu 


Glu 
40 


Arg 


Leu 
50 


Gin 


Asn 




Leu 


Phe 
55 


Val 


Asp 


Cys 


Lys 


Leu 


Val 


Gly 


Pro 


Glu 


65 










70 






Tyr 


Lys 


Gin 


His 


Lys 
85 


Thr 


Asp 




He 


Asp 


Pro 


Pro 


Ala 


Lys 







100 



Gly 


Gly Ala Ala Ala Val 


Asp 


Glu 




10 


15 




Gly 


Glu Asp Asp Gly Gly 


Lys 


Leu 


25 


30 






Tyr 


Lys Arg Lys Val Leu 


Pro 


Leu 




45 






Ser 


Trp Arg Ser Pro Thr 


Gly 


Met 




60 






Thr 


Leu Cys Phe Cys Thr 


His 


Arg 




75 




80 


Glu 


Ala He Pro Gin Gin 


Cys 


Pro 




90 


95 





<210> 906 
<211> 104 
<212> PRT 
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<213> Homo sapiens 
<220> 

<221> UNSURE 
<222> 64 

<223> Xaa = His, Gin 
<400> 906 



Met 


Ser 


Thr 






He 


Ser 


Ser 


Ser 


Asp Ala 


Leu 


Leu 


Glu 


Glu 


Leu 


1 








5 










10 










15 




Glu 


Arg 


Ser 


Thr 
20 


Leu 


Gin 


Asp 


Ser 


Asp 

25 


Glu 


Tyr 


Ser 


Asn 


Pro 
30 


Ala 


Pro 


Leu 


Pro 


Leu 


Asp 


Gin 


His 


Ser 


Arg 


Lys 


Glu 


Thr 


Asn 


Leu Asp Glu 


Thr 






35 










40 










45 








Ser 


Glu 


He 


Leu 


Ser 


He 


Gin Asp Asn 


Thr 


Ser 


Pro 


Leu 


Pro 


Ala 


Xaa 




50 










55 










60 










Ser 


Cys 


He 


Leu 


Pro 


He 


Ser 


Arg 


Ser 


Ser 


Met 


Ser 


Thr 


Val 


Lys 


Pro 


65 










70 










75 










80 


Lys 


Ser 


Gin 


Arg 


Asn 


His 


His 


Arg 


Leu 


Leu 


Lys Arg 


Gin 


Gin 


Leu 


Leu 










85 










90 










95 




Ser 


Trp 


Met 


Ser 
100 


Ser 


Leu 




Thr 



















<210> 907 
<211> 63 
<212> PRT 

<213> Homo sapiens 

<220> 

<221> UNSURE 
<222> 57 

<223> Xaa = Ala, Pro 
<220> 

<221> UNSURE 
<222> 60 

<223> Xaa = Gly,Val 
<220> 

<221> UNSURE 
<222> 39 

<223> Xaa = Phe,Val 

<220> 

<221> UNSURE 
<222> 56 

<223> Xaa = Pro, Ser 

<400> 907 

Met Asp Tyr Gly Tyr Val Gin Thr Thr Ser Thr Glu Met Leu Arg Asn 

15 10 15 

Phe He Gin Thr Glu Ala Val Val Ser Lys Pro Phe Ser Leu Phe Asp 

20 25 30 

Leu Ser Ser Val Gly Leu Xaa Gly Ala Glu Thr Gin Gin Ser Lys Val 

35 40 45 

Ala Pro Gin Gin Cys Ser Gin Xaa Xaa Arg Pro Xaa Gin Ser Leu 
50 55 60 

<210> 908 
<211> 65 
<212> PRT 

<213> Homo sapiens 
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<400> 908 



Met Leu Ser 


Ala 


Leu 


Pro 


Gly 


Trp Gly 


Pro 


Ala His Leu 


Gin 


Arg Pro 


1 




5 








10 






15 


Leu Leu Gly 


Pro 


Ala 


Ser 


Cys 


Leu Gly 


He 


Leu Arg Pro 


Ala 


Met Thr 




20 








25 






30 




Ala His Ser 


Phe 


Ala 


Leu 


Pro 


Val He 


He 


Phe Thr Thr 


Phe 


Trp Gly 


35 










40 




45 






Leu Val Gly 


He 


Ala Gly 


Pro 


Trp Phe Val 


Pro Lys Asp 


Pro 


Thr Ala 


50 








55 






60 






Glu 




















65 




















<210> 909 




















<211> 83 




















<212> PRT 




















<213> Homo sapiens 
















<220> 




















<221> UNSURE 


















<222> 42 




















<223> Xaa = 


Asp, 


, Glu 
















<400> 909 




















Met Leu Val 


Glu 




Lys 


Val Asn Gly Leu Val Pro Arg 


Leu 


Met Pro 


1 




5 








10 






15 


Val He Ala 


Ala 


His 


Trp 


Glu 


Ala Gin 


Val 


Gly Ala Ser 


Leu 


Glu Gly 




20 








25 






30 




Ser Ser He 


Leu 


Asp Arg 


Ala 


Val He 


Xaa 


His Asn Leu 


Leu 


Ser Ala 


35 










40 




45 






Ser Lys Leu 


Tyr 


Asn 


Asn 


He 


Thr Phe 


Glu 


Glu Leu Gly Ala 


Leu Leu 


50 








55 






60 






Glu He Pro 


Ala 


Ala 


Lys 


Ala 


Glu Lys 


He 


Ala Ser Gin 


Met 


He Thr 


65 






70 








75 




80 


Glu Asp Val 




















<210> 910 




















<211> 94 




















<212> PRT 




















<213> Homo sapiens 
















<400> 910 




















Met Ala Ser 


Asn 


Glu 


Val 


He Ala Asp 


He 


Asn Cys Lys 


Gly 


Arg Ser 


1 




5 








10 






15 


Lys Ser Asn 


Leu 


Gly Trp 


Thr 


Pro Leu 


His 


Leu Ala Cys 


Tyr 


Phe Gly 




20 








25 






30 




His Arg Gin 


Val 


Val 


Gin 


Asp 


Leu Leu 


Lys 


Ala Gly Ala 


Glu 


Val Asn 


35 










40 




45 






Val Leu Asn 


Asp 


Met 


Gly 


Asp 


Thr Pro 


Leu 


His Arg Ala Ala 


Phe Thr 


50 








55 






60 






Gly Arg Lys 


Val 




He 


He 


Leu Cys 


Ser 


Met Phe Val 


Ser 


Glu Val 


65 






70 








75 




80 


Phe Gly Gly Val 


Val 


Thr 


He 


Val Phe 


Ser 


Val He Thr 


He 








85 








90 








<210> 911 




















<211> 75 




















<212> PRT 




















<213> Homo sapiens 
















<400> 911 




















Met Pro Gly 


Pro 


Thr 


Pro 


Ser 


Gly Thr Asn 


Val Gly Ser 


Ser 


Gly Arg 


1 




5 








10 






15 


Ser Pro Ser 


Lys 


Ala 


Val 


Ala Ala Arg Ala 


Gly Asp Pro 


Leu 


Ser Gly 
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20 25 30 

Arg Gly Lys Met Pro Ala Val Gly Gin Gly Val Gin Ala Ala Gin Pro 

35 40 45 

Arg Gin Ala Pro Gly Gly Cys Gly Asp Ser Thr Gin Lys lie His Leu 

50 55 60 

Gly Ser Lys Leu Ala Leu Phe Gin Tyr Trp Leu 
65 70 75 

<210> 912 
<211> 160 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> UNSURE 
<222> 107 

<223> Xaa = * , Trp 
<220> 

<221> UNSURE 
<222> 97 

<223> Xaa = Cys, Trp 



<400> 912 
















Met 


Arg 


Pro 


Leu 


Pro 


Pro 


Val 


Gly Asp Val Arg Leu Glu Leu 


Ser 


Pro 


1 








5 






10 


15 




Pro 


Pro 


Pro 


Leu 


Leu 


Pro 


Val 


Pro Val Val Ser Gly Ser Pro 


Val 


Gly 








20 








25 30 






Ser 


Ser 


Gly Arg 


Leu 


Met 


Ala 


Ser Ser Ser Ser Leu Val Pro 


Asp 


Arg 






35 










40 45 






Leu 


Arg 




Pro 


Leu 


Cys 


Phe 


Leu Gly Val Phe Val Cys Tyr 


Phe 


Tyr 




50 










55 


60 






Tyr 


Gly 


He 


Leu 


Gin 


Glu Lys 


He Thr Arg Gly Lys Tyr Gly 


Glu 


Gly 


65 










70 




75 




80 


Ala 


Lys 


Gin 


Glu 


Thr 


Phe 


Thr 


Phe Ala Leu Thr Leu Val Phe 


He 


Gin 








85 






90 


95 




Xaa 


Val 


He 


Asn 


Ala 


Val 


Phe 


Ala Lys He Xaa Trp Asp Arg 


Thr Arg 








100 








105 110 






Ser 


Trp 


Leu 


Tyr 


Ala 


Ala 


Cys 


Ser He Ser Tyr Leu Gly Ala 


Met 


Val 






115 










120 125 






Ser 


Ser 


Asn 


Ser 


Ala 


Leu 


Gin 


Phe Val Asn Tyr Pro Thr Gin 


Val 


Leu 




130 










135 


140 






Gly 


Lys 


Ser 


Cys 


Lys 


Pro 


He 


Pro Val Met Leu Leu Gly Val 


Thr 


Ser 


145 










150 




155 




160 



<210> 913 
<211> 60 
<212> PRT 

<213> Homo sapiens 
<400> 913 

Met Arg Leu Ser Pro Phe His Leu 
1 5 
Lys Arg Glu Asp Leu Gin He Ser 
20 

Met Ala Val Asn He Gin Pro Lys 

35 40 
Arg Gin Val Arg Cys Leu Thr Phe 

50 55 



Gin Leu Glu Phe His Met Lys Glu 

10 15 
Trp Ser Phe He Ser Val Pro Glu 
25 30 
His Trp Gly Arg Thr Arg Trp Leu 
45 

Ser Arg Thr Ser 
60 



<210> 914 
<211> 137 
<212> PRT 
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<400> 914 






















Met Glu 


Tyr Leu 


He Gly 


He 


Gin 


Gly 


Pro Asp 


Tyr 


Val 


Leu 


Val Ala 


1 




5 










10 








15 


Ser Asp 


Arg Val 


Ala 


Ala 


Ser 


Asn 


He 


Val Gin 


Met 


Lys 


Asp Asp His 




20 










25 








30 




Asp Lys 


Met Phe 


Lys 


Met 


Ser Glu Lys 


He Leu 


Leu 


Leu 


Cys 


Val Gly 




35 








40 








45 






Glu Ala 


Gly Asp 


Thr 


Val 


Gin 


Phe 


Ala 


Glu Tyr 


He 


Gin 


Lys 


Asn Val 


50 








55 








60 








Gin Leu 




Met 


Arg 


Asn 


Gly 


Tyr 


Glu Leu 


Ser 


Pro 


Thr 


Ala Ala 


55 






70 








75 








80 


Ala Asn 


Phe Thr 


Arg 


Arg 


Asn 


Leu 


Ala 


Asp Cys 


Leu 


Arg 


Ser Arg Thr 






85 










90 








95 


Pro Tyr 


His Val 


Asn 


Leu 


Leu 


Leu 


Ala 


Gly Tyr Asp 


Glu 


His 


Glu Gly 




100 










105 








110 




Pro Ala 


Leu Tyr 


Tyr 


Met 


Asp 


Tyr 


Leu 


Ala Ala 


Leu 


Ala 


Lys 


Ala Leu 




115 








120 








125 






Leu Gin 


Pro Thr 


Ala 


Met 


Val 


Pro 


Ser 












130 








135 
















<210> 915 






















<211> 55 






















<212> PRT 






















<213> Homo sapiens 




















<400> 915 






















Met Ala 


Asp Arg 


Gly Gly 


Val 


Gly Glu 


Ala Ala 


Ala 


Val 


Gly 


Ala Ser 


1 




5 










10 








15 


Pro Ala 


Ser Val 


Pro 


Gly 


Leu 


Asn 


Pro 


Thr Leu 


Gly 


Trp Arg Glu Arg 




20 










25 








30 




Leu Arg 


Pro Gly Trp Arg 


Gly Leu Gly Pro Arg Cys 


Gly Ser 


Trp Arg 




35 








40 








45 






Gly Trp 


Gly Cys 


Phe 


Thr 


Pro 
















50 








55 
















<210> 916 






















<211> 132 






















<212> PRT 






















<213> Homo sapiens 




















<400> 916 






















Met He 


Ser He 


Thr 


Glu 


Trp 


Gin 


Lys 


He Gly Val 


Gly 


He 


Thr Gly 


1 




5 










10 








15 


Phe Gly 


He Phe 


Phe 


He 


Leu 


Phe 


Gly 


Thr Leu 


Leu 


Tyr 


Phe 


Asp Ser 




20 










25 








30 




Val Leu 


Leu Ala 


Phe 


Gly 


Asn 


Leu 


Leu 


Phe Leu 


Thr 


Gly 


Leu 


Ser Leu 




35 








40 








45 






He He 


Gly Leu 


Arg Lys 


Thr 


Phe 


Trp 


Phe Phe 


Phe 


Gin Arg 


His Lys 


50 








55 








60 








Leu Lys 


Gly Thr 


Ser 


Phe 


Leu 


Leu Gly 


Gly Val 


Val 


He 


Val 


Leu Leu 


65 






70 








75 








80 


Arg Trp 


Pro Leu 


Leu 


Gly 


Met 


Phe 


Leu 


Glu Thr 


Tyr 


Gly Phe 


Phe Ser 




85 










90 








95 


Leu Phe 


Lys Gly 


Phe 


Phe 


Pro 


Val 


Ala 


Phe Gly 


Phe 


Leu Gly Asn Val 




100 










105 








110 




Cys Asn 


He Pro 


Phe 


Leu 


Gly Ala 


Leu 


Phe Arg 




Leu Gin Gly Thr 


115 








120 








125 







Ser Ser Met Val 
130 



<210> 917 
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<211> 143 
<212> PRT 

<213> Homo sapiens 



<400> 917 



Met 


Leu 


Gin Ala Pro Arg 


Gly Arg 


1 




5 






Ala 


Val 


Ala Thr Arg Thr 


Met 


Ala 






20 






Glu 


Leu 


Arg Ser Leu Leu 


Ala 


Ser 






35 




40 


Arg 


Ser 


Arg Glu Glu Ala 


Ala 


Ala 




50 




55 




lie 


Pro 


Val Ser Glu Leu 


Glu 


Ser 


65 




70 






Phe 


Gin 


Ala Leu Tyr Ser 


Ala 


Glu 






85 








Val 


Phe Phe Cys Gin 


Met 


Gly 






100 






Trp 


Pro 


Gly Val Leu Asp 


Thr 


Leu 






115 




120 


He 


Glu 


Asn Gly Trp Arg 


Lys 


Arg 




130 




135 





<210> 918 

<211> 56 
<212> PRT 

<213> Homo sapiens 



Ala Arg 


Ala 


Ala 


Leu 


Arg 


Arg 


Leu 


10 










15 




Gly Ala 


Pro 


Thr 


Val 


Ser 


Leu 


Pro 


25 








30 






Gly Arg 


Ala 


Arg 


Leu 


Phe 


Asp Val 








45 








Gly Thr 


He 


Pro 


Gly 


Ala 


Leu 


Asn 






60 










Ala Leu 


Gin 


Met 


Glu 


Pro 


Ala 


Ala 




75 










80 


Lys Pro 


Lys 


Leu 


Glu Asp 


Glu 


His 


90 










95 




Lys Arg 


Gly 




Gin 


Ala 


Arg 


Ser 


105 








110 






Gly Leu 


Ala 


Thr 


Thr 


Leu 


Glu 


Pro 








125 








Val Arg 


Gin 


Glu 


Ala 


Ala 


Tyr 





140 



<400> 918 

Met Lys Met Ser Arg Asn Ser Ser 
1 5 
Gly Ala Ser Trp Glu Ala Lys His 
20 

Ser Ser Leu Gly Phe Arg Pro Leu 

35 40 
Leu Lys Glu Ala Pro Asp Ser Met 
50 55 



Leu Gly Gly Ala Thr His Cys Thr 

10 15 
Gly Pro Ala Ser Gly Ala Ser Pro 
25 30 
Ser Pro Ser Leu Pro Ala Leu Gly 
45 
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06-05-1999 






OP 


2001509004 T 


10-07-2001 






OP 


2001510323 T 


31-07-2001 






OP 


2002502234 T 


22-01-2002 






US 


2001016650 Al 


23-08-2001 






UO 


9739123 A2 


23-10-1997 






UO 


9739102 A2 


23-10-1997 






UO 


9739030 A2 


23-10-1997 






US 


6280739 Bl 


28-08-2001 



